
arXiv: 2309.06264
The spectral clustering algorithm is often used as a binary clustering method for unclassified data by applying the principal component analysis. To study theoretical properties of the algorithm, the assumption of conditional homoscedasticity is often supposed in existing studies. However, this assumption is restrictive and often unrealistic in practice. Therefore, in this paper, we consider the allometric extension model, that is, the directions of the first eigenvectors of two covariance matrices and the direction of the difference of two mean vectors coincide, and we provide a non-asymptotic bound of the error probability of the spectral clustering algorithm for the allometric extension model. As a byproduct of the result, we obtain the consistency of the clustering method in high-dimensional settings.
20 pages
Methodology (stat.ME), FOS: Computer and information sciences, non-asymptotic bound, high-dimension, principal component analysis, Statistics, FOS: Mathematics, 62H25, 62H30, Mathematics - Statistics Theory, Statistics Theory (math.ST), Statistics - Methodology
Methodology (stat.ME), FOS: Computer and information sciences, non-asymptotic bound, high-dimension, principal component analysis, Statistics, FOS: Mathematics, 62H25, 62H30, Mathematics - Statistics Theory, Statistics Theory (math.ST), Statistics - Methodology
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 2 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
