Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data

  title={Cluster Tendency Assessment for Fuzzy Clustering of Incomplete Data},
  author={Ludmila Himmelspach and Daniel Hommers and Stefan Conrad},
  booktitle={EUSFLAT Conf.},
The quality of results for partitioning clustering algorithms depends on the assumption made on the number of clusters presented in the data set. Applying clustering methods on real data missing values turn out to be an additional challenging problem for clustering algorithms. Fuzzy clustering approaches adapted to incomplete data perform well for a given number of clusters. In this study, we analyse dierent cluster validity functions in terms of applicability on incomplete data on the one hand… 

Figures and Tables from this paper

On Cluster Validity for Fuzzy Clustering of Incomplete Data
This study addresses the problem of finding the optimal number of clusters on incomplete data using cluster validity functions and analyzes which fuzzy clustering algorithm for incomplete data produces better partitioning results for cluster validity.
Fuzzy Clustering of Incomplete Data
A new approach for adapting the fuzzy c-means clustering algorithm to incomplete data that takes the scatters of clusters into account is presented and is able to correctly detect the clusters closely located to each other and clusters with wide density variations.
A Three-Way Decisions Clustering Algorithm for Incomplete Data
A three-way decisions clustering algorithm for incomplete data based on attribute significance and miss rate is proposed and the experimental results show preliminarily the effectiveness of the proposed algorithm.
An imputation-based method for fuzzy clustering of incomplete data
  • S. Soni, I. Sharma
  • Computer Science
    2017 International Conference on Communication and Signal Processing (ICCSP)
  • 2017
This paper proposes an iterative technique to impute data and fine tune the estimates based on the idea that information from complete objects should be garnered to estimate missing values such that purpose of analysis is supported.
Incomplete data mining based on fuzzy tolerance quotient space
Model of fuzzy tolerance granularity is discussed, a hierarchical structure is given, and best approximation of incomplete data is researched, and the distance between set cover and set partition is defined.
New method of mining incomplete data
In this paper, optimal partitioning of incomplete data is researched and the proposed method improves correct rate greatly, and the experimental results demonstrate the method’s validity.
Discretization methods for Bayesian networks in the case of the earthquake
This result indicates that K-means is the best method for discretizing the data that the authors use in this study, and compares the three methods of discretization used a confusion matrix.


Fuzzy Clustering of Incomplete Data Based on Cluster Dispersion
This paper presents an extension for existing fuzzy c-means clustering algorithms for incomplete data, which uses the information about the dispersion of clusters, and shows that this approach outperforms other clustering methods for incompleteData.
Unsupervised Optimal Fuzzy Clustering
The unsupervised fuzzy partition-optimal number of classes algorithm performs well in situations of large variability of cluster shapes, densities, and number of data points in each cluster.
A Clustering Performance Measure Based on Fuzzy Set Decomposition
  • E. Backer, Anil K. Jain
  • Computer Science
    IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 1981
Several experiments reported in here show that the proposed performance measure puts an order on different partitions of the same data which is consistent with the error rate of a classifier designed on the basis of the obtained cluster labelings.
Clustering approaches for data with missing values: Comparison and evaluation
Different approaches proposed in the literature for adapting partitioning clustering algorithms for dealing withMissing values in data are discussed and the analysis of the accuracy of these methods depending on the different missing-data mechanisms and the percentage of missing values in the data sets are analyzed.
Fuzzy K-means clustering with missing values
This paper proposes a technique to exploit the information provided by the patterns with the missing values so that the clustering results are enhanced and is consolidated in this paper by fine-tuning theMissing values using the information from other attributes.
Fuzzy Cluster Analysis
The Grade of Membership model – which has a long history of usage in other contexts – is proposed as a straightforward way of performing fuzzy cluster analysis.
Fuzzy c-means clustering of incomplete data
Four strategies for doing FCM clustering of incomplete data sets are given, three of which involve modified versions of the FCM algorithm and numerical convergence properties of the new algorithms are discussed.
Different approaches to fuzzy clustering of incomplete datasets
A geometric approach to cluster validity for normal mixtures
Viewing mixture decomposition as probabilistic clustering as opposed to parametric estimation enables both fuzzy and crisp measures of cluster validity for this problem, and uses the expectation-maximization algorithm to find clusters in the data.
Finding Groups in Data: An Introduction to Chster Analysis
This book make understandable the cluster analysis is based notion of starsmodern treatment, which efficiently finds accurate clusters in data and discusses various types of study the user set explicitly but also proposes another.