Probabilistic Principal Component Analysis

descriptionPublicationkeyboard_double_arrow_right Article 01 Sep 1999 English Publisher:Oxford University Press (OUP)Journal:Journal of the Royal Statistical Society Series B: Statistical Methodology, volume 61, pages 611-622 (issn: 1369-7412, eissn: 1467-9868,

Copyright policy )

Authors: Tipping, Michael E.; Bishop, Christopher M.;

doi: 10.1111/1467-9868.00196

Probabilistic Principal Component Analysis

- Summary
- Subjects
- Metrics

Abstract

Summary Principal component analysis (PCA) is a ubiquitous technique for data analysis and processing, but one which is not based on a probability model. We demonstrate how the principal axes of a set of observed data vectors may be determined through maximum likelihood estimation of parameters in a latent variable model that is closely related to factor analysis. We consider the properties of the associated likelihood function, giving an EM algorithm for estimating the principal subspace iteratively, and discuss, with illustrative examples, the advantages conveyed by this probabilistic approach to PCA.

Related Organizations

Microsoft (United States)
United States
Microsoft Research (United Kingdom)
United Kingdom

Keywords

density estimation, principal components analysis, factor analysis, Factor analysis and principal components; correspondence analysis, Gaussian mixtures, maximum likelihood, EM algorithm, dimensionality reduction, probability model

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2K
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 0.01%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.01%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

2K

Top 0.01%

Top 10%

hybrid

Fields of Science (4) View all

Fields of Science