On Clustering: Mixture Model Averaging with the Generalized Hyperbolic Distribution

Ricciuti, Sarah

Found an issue? Give us feedback

Canada Researcharrow_drop_down

Canada Research

Thesis . 2017

Data sources: Canada Research

MacSphere

Thesis . 2017

Data sources: MacSphere

On Clustering: Mixture Model Averaging with the Generalized Hyperbolic Distribution

descriptionPublicationkeyboard_double_arrow_right Thesis 12 Oct 2017 Canada English

Authors: Ricciuti, Sarah;

handle: 11375/22147

On Clustering: Mixture Model Averaging with the Generalized Hyperbolic Distribution

- Summary
- Subjects
- Metrics

Abstract

Cluster analysis is commonly described as the classification of unlabeled observations into groups such that they are more similar to one another than to observations in other groups. Model-based clustering assumes that the data arise from a statistical (mixture) model and typically a group of many models are fit to the data, from which the `best' model is selected by a model selection criterion (often the BIC in mixture model applications). This chosen model is then the only model that is used for making inferences on the data. Although this is common practice, proceeding in this way ignores a large component of model selection uncertainty, especially for situations where the difference between the model selection criterion for two competing models is relatively insignificant. For this reason, recent interest has been placed on selecting a subset of models that are close to the selected best model and using a weighted averaging approach to incorporate information from multiple models in this set. Model averaging is not a novel approach, yet its presence in a clustering framework is minimal. Here, we use Occam's window to select a subset of models eligible for two types of averaging techniques: averaging a posteriori probabilities, and direct averaging of model parameters. The efficacy of these model-based averaging approaches is demonstrated for a family of generalized hyperbolic mixture models using real and simulated data.

Master of Science (MSc)

Thesis

Country

Canada

Related Organizations

McMaster University
Canada

Keywords

Bayesian model averaging, Statistics, Occam's window, model averaging, finite mixture model, generalized hyperbolic distribution, clustering

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now