Fuzzy c-means clustering of incomplete data
@article{Hathaway2001FuzzyCC,
title={Fuzzy c-means clustering of incomplete data},
author={Richard J. Hathaway and James C. Bezdek},
journal={IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society},
year={2001},
volume={31 5},
pages={
735-44
}
}The problem of clustering a real s-dimensional data set X={x(1 ),,,,,x(n)} subset R(s) is considered. Usually, each observation (or datum) consists of numerical values for all s features (such as height, length, etc.), but sometimes data sets can contain vectors that are missing one or more of the feature values. For example, a particular datum x(k) might be incomplete, having the form x(k)=(254.3, ?, 333.2, 47.45, ?)(T), where the second and fifth feature values are missing. The fuzzy c-means…
Figures, Tables, and Topics from this paper
427 Citations
Fuzzy C-Means clustering of incomplete data based on probabilistic information granules of missing values
- Computer ScienceKnowl. Based Syst.
- 2016
Fuzzy non-metric model for data with tolerance and its application to incomplete data clustering
- Computer Science2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)
- 2015
New clustering algorithm to apply the concept of tolerance to FNM, called fuzzy non-metric model for data with tolerance (FNM-T) is proposed and it is shown that the proposed algorithm handle incomplete data sets.
Fuzzy c-means clustering of partially missing data sets based on statistical representation
- Computer Science2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery
- 2010
Comparisons and analysis of the clustering results of the incomplete IRIS data demonstrate that the proposed statistical representation can estimate missing attributes rationally and improve the fuzzy c-means clustering of incomplete data.
An attribute weighted fuzzy c-means algorithm for incomplete data sets
- Computer Science2012 International Conference on System Science and Engineering (ICSSE)
- 2012
An attribute weighted fuzzy c-means algorithm for incomplete data clustering is proposed in this paper, in which the important degree of each attribute are viewed as additional variables and optimized during clustering.
A New Algorithm for Fuzzy Clustering Handling Incomplete Dataset
- Computer ScienceInt. J. Artif. Intell. Tools
- 2014
The proposed algorithm is based on neural network and uses Optimal Completion Strategy for missing value estimation in incomplete dataset and an extension of the algorithm is proposed, to tackle the problem of estimating the number of clusters, by using a multi level OCS-FSOM method.
Interval Fuzzy C-means Approach for Incomplete Data Clustering Based on Neural Networks
- Computer Science
- 2018
This work proposes an efficient interval approach which utilizes a missing-data back propagation to estimate the error value of the complete property of the missing samples and convert the value ofThe missing attribute to the form of an interval, which performs clustering analysis on the recovered data set.
Fuzzy Clustering of Incomplete Data
- Computer Science
- 2016
A new approach for adapting the fuzzy c-means clustering algorithm to incomplete data that takes the scatters of clusters into account is presented and is able to correctly detect the clusters closely located to each other and clusters with wide density variations.
Fuzzy c-Means Clustering of Incomplete Data Using Dimension-Wise Fuzzy Variances of Clusters
- Computer ScienceIPMU
- 2016
An approach for adapting the fuzzy c-means algorithm to incomplete data that uses the dimension-wise fuzzy variances of clusters for imputation of missing values is presented.
A Robust Self-Organizing Approach to Effectively Clustering Incomplete Data
- Computer Science2015 Seventh International Conference on Knowledge and Systems Engineering (KSE)
- 2015
iS nps can do clustering on incomplete data as well as estimate incomplete data using the nearest prototype strategy in an iterative manner and can produce the clusters of good quality and a better approximation of incomplete data via the experiments on benchmark data sets.
References
SHOWING 1-10 OF 33 REFERENCES
A possibilistic approach to clustering
- Computer ScienceIEEE Trans. Fuzzy Syst.
- 1993
An appropriate objective function whose minimum will characterize a good possibilistic partition of the data is constructed, and the membership and prototype update equations are derived from necessary conditions for minimization of the criterion function.
Characterization and detection of noise in clustering
- Computer SciencePattern Recognit. Lett.
- 1991
Fuzzy decision trees: issues and methods
- Computer ScienceIEEE Trans. Syst. Man Cybern. Part B
- 1998
This work presents another modification, aimed at combining symbolic decision trees with approximate reasoning offered by fuzzy representation, to exploit complementary advantages of both: popularity in applications to learning from examples, high knowledge comprehensibility of decision trees, and the ability to deal with inexact and uncertain information of fuzzy representation.
Convergence theory for fuzzy c-means: Counterexamples and repairs
- MathematicsIEEE Transactions on Systems, Man, and Cybernetics
- 1987
A counterexample to the original incorrect convergence theorem for the fuzzy c-means (FCM) clustering algorithms (see J.C. Bezdak, IEEE Trans. Pattern Anal. and Math. Intell., vol.PAMI-2, no.1,…
Pattern Recognition with Partly Missing Data
- Computer ScienceIEEE Transactions on Systems, Man, and Cybernetics
- 1979
An experimental comparison of several simple inexpensive ways of doing pattern recognition when some data elements are missing (blank) is presented and suggestions are given for choosing the best method for a particular application.
22 Inference and data tables with missing values
- PhysicsClassification, Pattern Recognition and Reduction of Dimensionality
- 1982
Pattern Recognition with Fuzzy Objective Function Algorithms
- Computer ScienceAdvanced Applications in Pattern Recognition
- 1981
Books, as a source that may involve the facts, opinion, literature, religion, and many others are the great friends to join with, becomes what you need to get.
Analysis of Incomplete Climate Data: Estimation of Mean Values and Covariance Matrices and Imputation of Missing Values.
- Mathematics
- 2001
Estimating the mean and the covariance matrix of an incomplete dataset and filling in missing values with imputed values is generally a nonlinear problem, which must be solved iteratively. The…
Application of clustering to estimate missing data and improve data integrity
- Computer ScienceICSE '76
- 1976
Two problems in the use of computerized data base systems are: (1) How can we estimate the values for missing data? (2) How can we improve data integrity, that is, reduce the number of errors in the…












