Mixtures of probabilistic principal component analyzers.

Abstract

Principal component analysis (PCA) is one of the most popular techniques for processing, compressing, and visualizing data, although its effectiveness is limited by its global linearity. While nonlinear variants of PCA have been proposed, an alternative paradigm is to capture data complexity by a combination of local linear PCA projections. However, conventional PCA does not correspond to a probability density, and so there is no unique way to combine PCA models. Therefore, previous attempts to formulate mixture models for PCA have been ad hoc to some extent. In this article, PCA is formulated within a maximum likelihood framework, based on a specific form of gaussian latent variable model. This leads to a well-defined mixture model for probabilistic principal component analyzers, whose parameters can be determined using an expectation-maximization algorithm. We discuss the advantages of this model in the context of clustering, density modeling, and local dimensionality reduction, and we demonstrate its application to image compression and handwritten digit recognition.

Extracted Key Phrases

11 Figures and Tables

Showing 1-10 of 435 extracted citations
050100'98'00'02'04'06'08'10'12'14'16
Citations per Year

1,121 Citations

Semantic Scholar estimates that this publication has received between 917 and 1,363 citations based on the available data.

See our FAQ for additional information.