
arXiv: 1301.3558
handle: 21.11116/0000-0000-FC73-3
This paper is concerned with an important issue in finite mixture modelling, the selection of the number of mixing components. We propose a new penalized likelihood method for model selection of finite multivariate Gaussian mixture models. The proposed method is shown to be statistically consistent in determining of the number of components. A modified EM algorithm is developed to simultaneously select the number of components and to estimate the mixing weights, i.e. the mixing probabilities, and unknown parameters of Gaussian distributions. Simulations and a real data analysis are presented to illustrate the performance of the proposed method.
FOS: Computer and information sciences, model selection, Classification and discrimination; cluster analysis (statistical aspects), Mathematics - Statistics Theory, Machine Learning (stat.ML), Statistics Theory (math.ST), Methodology (stat.ME), Statistical ranking and selection procedures, Statistics - Machine Learning, FOS: Mathematics, Gaussian mixture models, EM algorithm, penalized likelihood, Statistics - Methodology
FOS: Computer and information sciences, model selection, Classification and discrimination; cluster analysis (statistical aspects), Mathematics - Statistics Theory, Machine Learning (stat.ML), Statistics Theory (math.ST), Methodology (stat.ME), Statistical ranking and selection procedures, Statistics - Machine Learning, FOS: Mathematics, Gaussian mixture models, EM algorithm, penalized likelihood, Statistics - Methodology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 31 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
