
A sample of multivariate observations is assumed to be drawn from a mixture of a given number of underlying populations. The mixture likelihood approach to clustering is used to allocate each individual in the sample to its population of origin on the basis of the estimated posterior probabilities of population membership. Estimation of the correct allocation rate is considered for individual populations as well as for the overall mixture by averaging functions of the maximum of these posterior probabilities. The estimates of the correct allocation rates provide a means of assessing the performance of the mixture approach to clustering. The bootstrap technique is investigated for its effectiveness in reducing the bias of the estimates so obtained. Results are reported for three real data sets and a simulation study. It is demonstrated that the proposed estimates generally provide useful information on the unobservable allocation rates of the mixture approach. Encouraging results are obtained for the bootstrap method of bias correction applied to the estimates of the individual and overall allocation rates.
Allocation rate correct, Mixture maximum likelihood method, Bootstrap method, Posterior probability estimate, 310, 1804 Statistics, Bias correction, Probability and Uncertainty, 1804 Statistics, Probability and Uncertainty, 2613 Statistics and Probability
Allocation rate correct, Mixture maximum likelihood method, Bootstrap method, Posterior probability estimate, 310, 1804 Statistics, Bias correction, Probability and Uncertainty, 1804 Statistics, Probability and Uncertainty, 2613 Statistics and Probability
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 22 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
