On the Incommensurability Phenomenon

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jul 2016Embargo end date: 01 Jan 2013 English Publisher:Springer Science and Business Media LLCJournal:Journal of Classification, volume 33, pages 185-209 (issn: 0176-4268, eissn: 1432-1343,

Copyright policy )

Authors: Donniell E. Fishkind; Cencheng Shen; Youngser Park; Carey E. Priebe;

doi: 10.1007/s00357-016-9203-9 , 10.48550/arxiv.1301.1954

arXiv: 1301.1954

On the Incommensurability Phenomenon

- Summary
- Subjects
- Metrics

Abstract

Suppose that two large, multi-dimensional data sets are each noisy measurements of the same underlying random process, and principle components analysis is performed separately on the data sets to reduce their dimensionality. In some circumstances it may happen that the two lower-dimensional data sets have an inordinately large Procrustean fitting-error between them. The purpose of this manuscript is to quantify this "incommensurability phenomenon." In particular, under specified conditions, the square Procrustean fitting-error of the two normalized lower-dimensional data sets is (asymptotically) a convex combination (via a correlation parameter) of the Hausdorff distance between the projection subspaces and the maximum possible value of the square Procrustean fitting-error for normalized data. We show how this gives rise to the incommensurability phenomenon, and we employ illustrative simulations as well as a real data experiment to explore how the incommensurability phenomenon may have an appreciable impact.

Related Organizations

View all View all

Keywords

FOS: Computer and information sciences, Classification and discrimination; cluster analysis (statistical aspects), Grassmannian, Hausdorff distance, Machine Learning (stat.ML), incommensurability phenomenon, Library and Information Sciences, Factor analysis and principal components; correspondence analysis, Mathematics (miscellaneous), Procrustes fitting, Statistics - Machine Learning, principal components analysis, Psychology (miscellaneous), Statistics, Probability and Uncertainty

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

Green

hybrid