
arXiv: 2209.12079
The purpose of this paper is to study the fractal phenomena in large data sets and the associated questions of dimension reduction. We examine situations where the classical Principal Component Analysis is not effective in identifying the salient underlying fractal features of the data set. Instead, we employ the discrete energy, a technique borrowed from geometric measure theory, to limit the number of points of a given data set that lie near a $k$-dimensional hyperplane, or, more generally, near a set of a given upper Minkowski dimension. Concrete motivations stemming from naturally arising data sets are described and future directions outlined.
Mathematics - Classical Analysis and ODEs, Classical Analysis and ODEs (math.CA), FOS: Mathematics, 28A75, 62R07, Mathematics - Statistics Theory, Statistics Theory (math.ST)
Mathematics - Classical Analysis and ODEs, Classical Analysis and ODEs (math.CA), FOS: Mathematics, 28A75, 62R07, Mathematics - Statistics Theory, Statistics Theory (math.ST)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
