
Abstract The paper presents a novel spectral algorithm EVSA (eigenvector structure analysis), which uses eigenvalues and eigenvectors of the adjacency matrix in order to discover clusters. Based on matrix perturbation theory and properties of graph spectra we show that the adjacency matrix can be more suitable for partitioning than other Laplacian matrices. The main problem concerning the use of the adjacency matrix is the selection of the appropriate eigenvectors. We thus propose an approach based on analysis of the adjacency matrix spectrum and eigenvector pairwise correlations. Formulated rules and heuristics allow choosing the right eigenvectors representing clusters, i.e., automatically establishing the number of groups. The algorithm requires only one parameter-the number of nearest neighbors. Unlike many other spectral methods, our solution does not need an additional clustering algorithm for final partitioning. We evaluate the proposed approach using real-world datasets of different sizes. Its performance is competitive to other both standard and new solutions, which require the number of clusters to be given as an input parameter.
spectral clustering, Eigenvalues, singular values, and eigenvectors, Classification and discrimination; cluster analysis (statistical aspects), Graphs and linear algebra (matrices, eigenvalues, etc.), QA75.5-76.95, Electronic computers. Computer science, QA1-939, graph perturbation theory, adjacency matrix eigenvalues/eigenvectors, Mathematics, eigengap heuristics
spectral clustering, Eigenvalues, singular values, and eigenvectors, Classification and discrimination; cluster analysis (statistical aspects), Graphs and linear algebra (matrices, eigenvalues, etc.), QA75.5-76.95, Electronic computers. Computer science, QA1-939, graph perturbation theory, adjacency matrix eigenvalues/eigenvectors, Mathematics, eigengap heuristics
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 5 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
