Clustering Using Monte Carlo Cross-Validation

@inproceedings{Smyth1996ClusteringUM,
  title={Clustering Using Monte Carlo Cross-Validation},
  author={Padhraic Smyth},
  booktitle={KDD},
  year={1996}
}
Finding the “right” number of clusters, Ic, for a data set is a difficult, and often ill-posed, problem. In a probabilistic clustering context, likelihood-ratios, penalized likelihoods, and Bayesian techniques are among the more popular techniques. In this paper a new cross-validated likelihood criterion is investigated for determining cluster structure. A practical clustering algorithm based on Monte Carlo crossvalidation (MCCV) is introduced. The algorithm permits the data analyst to judge if… CONTINUE READING
Highly Influential
This paper has highly influenced 15 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 114 extracted citations

Estimating the predominant number of clusters in a dataset

Intell. Data Anal. • 2013
View 8 Excerpts
Highly Influenced

Identifying Mycobacterium tuberculosis complex strain families using spoligotypes.

Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases • 2006
View 5 Excerpts
Highly Influenced

A density-based cluster validity approach using multi-representatives

Pattern Recognition Letters • 2008
View 3 Excerpts
Highly Influenced

Analysis of fully polarimetric SAR data based on the Cloude-Pottier decomposition and the complex Wishart classifier

2007 IEEE International Geoscience and Remote Sensing Symposium • 2007
View 4 Excerpts
Highly Influenced

References

Publications referenced by this paper.
Showing 1-10 of 19 references

Modelbased Gaussian and non-Gaussian clustering,

J. D. Banfield, A. E. Raftery
1993
View 6 Excerpts
Highly Influenced

A comparative study of ordinary cross-validation, v-fold cross-validation, and the repeated learning-testing methods,

P. Burman
Biometrika • 1989
View 2 Excerpts
Highly Influenced

Bayesian Classification (AutoClass): Theory and Results

Advances in Knowledge Discovery and Data Mining • 1996
View 2 Excerpts

Bayesian Data Analysis, London, UK

A. Gelman, J. B. Carlin, H. S. Stern, D. B. Rubin
1995

Gaussian parsimonious clustering models

Pattern Recognition • 1995
View 3 Excerpts

Bayesian estimation of finite mixture distributions,

J. Diebolt, C. P. Robert
J. R. Stat. Sot. B, • 1994
View 1 Excerpt

Neural networks and related methods for classification (with discussion) ,

B. D. Ripley
J. Roy. Stat. Sot. B, • 1994
View 2 Excerpts

Similar Papers

Loading similar papers…