• Corpus ID: 7099295

Intrinsic classification by MML - the Snob program

@inproceedings{Wallace1994IntrinsicCB,
  title={Intrinsic classification by MML - the Snob program},
  author={Chris S. Wallace and David L. Dowe},
  year={1994}
}
We provide a brief overviewo fM inimum Message Length (MML) inductive inference (Wallace and Boulton (1968), Wallace and Freeman (1987)). We then outline howMML is used for statistical parameter estimation, and howt he MML intrinsic classification program, Snob (Wallace and Boulton (1968), Wallace (1986), Wallace (1990)) uses the message lengths from various parameter estimates to enable it to combine parameter estimation with model selection in intrinsic classification. Wem ention here the… 

MML mixture modelling of multi-state, Poisson, von Mises circular and Gaussian distributions

TLDR
The MML theory can be regarded as the theory with the highest posterior probability, and the MML mixture modelling program, Snob, uses the message lengths from various parameter estimates to enable it to combine parameter estimation with selection of the number of components.

MML clustering of multi-state, Poisson, von Mises circular and Gaussian distributions

TLDR
This work outlines how MML is used for statistical parameter estimation, and how the MML mixture modelling program, Snob, uses the message lengths from various parameter estimates to enable it to combine parameter estimation with selection of the number of components and estimation of the relative abundances of the components.

Unsupervised learning of Gamma mixture models using Minimum Message Length

TLDR
This research considers MML mixture modelling using the Gammadistribution and shows that the proposed MML method performs better than AIC and BIC, and works well with datasetscontaining left-skewed components such as PalmValley (Australia) imagedataset.

Minimum message length inference : theory and applications

TLDR
This thesis is that Minimum Message Length inductive inference provides a principled method for finding estimates of complex models with real (or continuous) parameters on small data sets by optimally discretising real parameters.

Unsupervised Learning Using MML

TLDR
The Minimum Message Length (MML)criterion is applied to the unsup ervised learning problem and an empirical comparison ofcriteriaprominentintheliteratureforesti-mating the numb er of comp onents in a dataset is given.

Model Selection and the Principle of Minimum Description Length

TLDR
This article reviews the principle of minimum description length (MDL) for problems of model selection, and illustrates the MDL principle by considering problems in regression, nonparametric curve estimation, cluster analysis, and time series analysis.

Model Selection and the Principleof Minimum Description

TLDR
This paper reviews the principle of Minimum Description Length (MDL) for problems of model selection and illustrates the MDL principle by considering problems in regression, nonparametric curve estimation, cluster analysis, and time series analysis.

MML inference of Finite State Automata for probabilistic spam detection

TLDR
This research is a two-fold solution to a problem where one part focusses on the best inferred PFSA using MML and the second part focuses on the classification problem of Spam Detection.

Comparison of Unsupervised Classi ersMuhammad

TLDR
An empirical comparison of ART2, Autoclass and Snob shows that statistical classiiers, especially Snob, perform better than their neural network counterpart ART2.
...

References

SHOWING 1-10 OF 33 REFERENCES

Classification by Minimum-Message-Length Inference

TLDR
A solution has been found using a novel coding trick, which could be useful in many inductive inferences in any MML inference which infers many discrete "nuisance" parameters.

Estimation and Inference by Compact Coding

SUMMARY The systematic variation within a set of data, as represented by a usual statistical model, may be used to encode the data in a more compact form than would be possible if they were

Minimum complexity density estimation

TLDR
An index of resolvability is proved to bound the rate of convergence of minimum complexity density estimators as well as the information-theoretic redundancy of the corresponding total description length to demonstrate the statistical effectiveness of the minimum description-length principle as a method of inference.

The information content of a multistate distribution.

Eye-movement data: Cumulative fixation time and cluster analysis

TLDR
Methods for two- and three-dimensional representation of cumulative fixation time (CFT) and ways of enhancing the peaks in CFT distributions are outlined and illustrated by reference to examples from eye-movement studies of cognitive processes.

Statistical Analysis of Circular Data

TLDR
This book presents a meta-modelling framework for analysing two or more samples of unimodal data from von Mises distributions, and some modern Statistical Techniques for Testing and Estimation used in this study.

The Classification of Depression by Numerical Taxonomy

TLDR
The findings of this study suggest that endogenous depression is a syndrome which provides a useful basis for classifying depressed patients and is not merely the result of observer bias.