Fuzzy c-means clustering of incomplete data

@article{Hathaway2001FuzzyCC,
  title={Fuzzy c-means clustering of incomplete data},
  author={Richard J. Hathaway and James C. Bezdek},
  journal={IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society},
  year={2001},
  volume={31 5},
  pages={
          735-44
        }
}
  • R. Hathaway, J. Bezdek
  • Published 1 October 2001
  • Computer Science
  • IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics : a publication of the IEEE Systems, Man, and Cybernetics Society
The problem of clustering a real s-dimensional data set X={x(1 ),,,,,x(n)} subset R(s) is considered. Usually, each observation (or datum) consists of numerical values for all s features (such as height, length, etc.), but sometimes data sets can contain vectors that are missing one or more of the feature values. For example, a particular datum x(k) might be incomplete, having the form x(k)=(254.3, ?, 333.2, 47.45, ?)(T), where the second and fifth feature values are missing. The fuzzy c-means… 
Interval kernel Fuzzy C-Means clustering of incomplete data
Fuzzy non-metric model for data with tolerance and its application to incomplete data clustering
TLDR
New clustering algorithm to apply the concept of tolerance to FNM, called fuzzy non-metric model for data with tolerance (FNM-T) is proposed and it is shown that the proposed algorithm handle incomplete data sets.
Fuzzy c-means clustering of partially missing data sets based on statistical representation
TLDR
Comparisons and analysis of the clustering results of the incomplete IRIS data demonstrate that the proposed statistical representation can estimate missing attributes rationally and improve the fuzzy c-means clustering of incomplete data.
An attribute weighted fuzzy c-means algorithm for incomplete data sets
  • Dan Li, C. Zhong, Jinhua Li
  • Computer Science
    2012 International Conference on System Science and Engineering (ICSSE)
  • 2012
TLDR
An attribute weighted fuzzy c-means algorithm for incomplete data clustering is proposed in this paper, in which the important degree of each attribute are viewed as additional variables and optimized during clustering.
A New Algorithm for Fuzzy Clustering Handling Incomplete Dataset
TLDR
The proposed algorithm is based on neural network and uses Optimal Completion Strategy for missing value estimation in incomplete dataset and an extension of the algorithm is proposed, to tackle the problem of estimating the number of clusters, by using a multi level OCS-FSOM method.
Interval Fuzzy C-means Approach for Incomplete Data Clustering Based on Neural Networks
TLDR
This work proposes an efficient interval approach which utilizes a missing-data back propagation to estimate the error value of the complete property of the missing samples and convert the value ofThe missing attribute to the form of an interval, which performs clustering analysis on the recovered data set.
Fuzzy Clustering of Incomplete Data
TLDR
A new approach for adapting the fuzzy c-means clustering algorithm to incomplete data that takes the scatters of clusters into account is presented and is able to correctly detect the clusters closely located to each other and clusters with wide density variations.
Fuzzy c-Means Clustering of Incomplete Data Using Dimension-Wise Fuzzy Variances of Clusters
TLDR
An approach for adapting the fuzzy c-means algorithm to incomplete data that uses the dimension-wise fuzzy variances of clusters for imputation of missing values is presented.
A Robust Self-Organizing Approach to Effectively Clustering Incomplete Data
  • Vo Thi Ngoc Chau
  • Computer Science
    2015 Seventh International Conference on Knowledge and Systems Engineering (KSE)
  • 2015
TLDR
iS nps can do clustering on incomplete data as well as estimate incomplete data using the nearest prototype strategy in an iterative manner and can produce the clusters of good quality and a better approximation of incomplete data via the experiments on benchmark data sets.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 33 REFERENCES
A possibilistic approach to clustering
TLDR
An appropriate objective function whose minimum will characterize a good possibilistic partition of the data is constructed, and the membership and prototype update equations are derived from necessary conditions for minimization of the criterion function.
Characterization and detection of noise in clustering
  • R. Davé
  • Computer Science
    Pattern Recognit. Lett.
  • 1991
Fuzzy decision trees: issues and methods
  • C. Janikow
  • Computer Science
    IEEE Trans. Syst. Man Cybern. Part B
  • 1998
TLDR
This work presents another modification, aimed at combining symbolic decision trees with approximate reasoning offered by fuzzy representation, to exploit complementary advantages of both: popularity in applications to learning from examples, high knowledge comprehensibility of decision trees, and the ability to deal with inexact and uncertain information of fuzzy representation.
Convergence theory for fuzzy c-means: Counterexamples and repairs
A counterexample to the original incorrect convergence theorem for the fuzzy c-means (FCM) clustering algorithms (see J.C. Bezdak, IEEE Trans. Pattern Anal. and Math. Intell., vol.PAMI-2, no.1,
Pattern Recognition with Partly Missing Data
  • J. K. Dixon
  • Computer Science
    IEEE Transactions on Systems, Man, and Cybernetics
  • 1979
TLDR
An experimental comparison of several simple inexpensive ways of doing pattern recognition when some data elements are missing (blank) is presented and suggestions are given for choosing the best method for a particular application.
22 Inference and data tables with missing values
Pattern Recognition with Fuzzy Objective Function Algorithms
  • J. Bezdek
  • Computer Science
    Advanced Applications in Pattern Recognition
  • 1981
TLDR
Books, as a source that may involve the facts, opinion, literature, religion, and many others are the great friends to join with, becomes what you need to get.
Analysis of Incomplete Climate Data: Estimation of Mean Values and Covariance Matrices and Imputation of Missing Values.
Estimating the mean and the covariance matrix of an incomplete dataset and filling in missing values with imputed values is generally a nonlinear problem, which must be solved iteratively. The
Application of clustering to estimate missing data and improve data integrity
Two problems in the use of computerized data base systems are: (1) How can we estimate the values for missing data? (2) How can we improve data integrity, that is, reduce the number of errors in the
...
1
2
3
4
...