Geodesic distance based fuzzy clustering

Balazs Feil, Janos Abonyi

Research output: Chapter in Book/Report/Conference proceedingChapter

13 Citations (Scopus)

Abstract

Clustering is a widely applied tool of data mining to detect the hidden structure of complex multivariate datasets. Hence, clustering solves two kinds of problems simultaneously, it partitions the datasets into cluster of objects that are similar to each other and describes the clusters by cluster prototypes to provide some information about the distribution of the data. In most of the cases these cluster prototypes describe the clusters as simple geometrical objects, like spheres, ellipsoids, lines, linear subspaces etc., and the cluster prototype defines a special distance function. Unfortunately in most of the cases the user does not have prior knowledge about the number of clusters and not even about the proper shape of prototypes. The real distribution of data is generally much more complex than these simple geometrical objects, and the number of clusters depends much more on how well the chosen cluster prototypes fit the distribution of data than on the real groups within the data. This is especially true when the clusters are used for local linear modeling purposes. The aim of this paper is not to define a new distance norm based on a problem dependent cluster prototype but to show how the so called geodesic distance that is based on the exploration of the manifold the data lie on, can be used in the clustering instead of the classical Euclidean distance. The paper presents how this distance measure can be integrated within fuzzy clustering and some examples are presented to demonstrate the advantages of the proposed new methods.

Original languageEnglish
Title of host publicationSoft Computing in Industrial Applications
Subtitle of host publicationRecent Trends
EditorsAshraf Saad, Erel Avineri, Keshav Dahal, Muhammad Sarfraz, Rajkumar Roy
Pages50-59
Number of pages10
DOIs
Publication statusPublished - Dec 1 2007

Publication series

NameAdvances in Soft Computing
Volume39
ISSN (Print)1615-3871
ISSN (Electronic)1860-0794

    Fingerprint

ASJC Scopus subject areas

  • Computer Science (miscellaneous)
  • Computational Mechanics
  • Computer Science Applications

Cite this

Feil, B., & Abonyi, J. (2007). Geodesic distance based fuzzy clustering. In A. Saad, E. Avineri, K. Dahal, M. Sarfraz, & R. Roy (Eds.), Soft Computing in Industrial Applications: Recent Trends (pp. 50-59). (Advances in Soft Computing; Vol. 39). https://doi.org/10.1007/978-3-540-70706-6_5