On the Robustness of the Successive Projection Algorithm

Name: On the Robustness of the Successive Projection Algorithm
Keywords: Machine Learning, FOS: Computer and information sciences, Numerical Analysis, Data Structures and Algorithms, FOS: Mathematics, Data Structures and Algorithms (cs.DS), Machine Learning (stat.ML), Numerical Analysis (math.NA), Machine Learning (cs.LG)

Giovanni Barbarino; Nicolas Gillis

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2024

Data sources: arXiv.org e-Print Archive

SIAM Journal on Matrix Analysis and Applications

Article . 2025 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2024

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

DBLP

Article

Data sources: DBLP

SIAM Journal on Matrix Analysis and Applications

Article . 2025 . Peer-reviewed

Data sources: European Union Open Data Portal

On the Robustness of the Successive Projection Algorithm

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 19 Sep 2025Embargo end date: 01 Jan 2024 English Publisher:Society for Industrial & Applied Mathematics (SIAM)Journal:SIAM Journal on Matrix Analysis and Applications, volume 46, pages 2,140-2,170 (issn: 0895-4798, eissn: 1095-7162,

Copyright policy )Funded by:EC | eLinoR

Authors: Giovanni Barbarino; Nicolas Gillis;

doi: 10.1137/24m171293x , 10.48550/arxiv.2411.16195

arXiv: 2411.16195

On the Robustness of the Successive Projection Algorithm

- Summary
- Subjects
- Metrics

Abstract

The successive projection algorithm (SPA) is a workhorse algorithm to learn the $r$ vertices of the convex hull of a set of $(r-1)$-dimensional data points, a.k.a. a latent simplex, which has numerous applications in data science. In this paper, we revisit the robustness to noise of SPA and several of its variants. In particular, when $r \geq 3$, we prove the tightness of the existing error bounds for SPA and for two more robust preconditioned variants of SPA. We also provide significantly improved error bounds for SPA, by a factor proportional to the conditioning of the $r$ vertices, in two special cases: for the first extracted vertex, and when $r \leq 2$. We then provide further improvements for the error bounds of a translated version of SPA proposed by Arora et al. (''A practical algorithm for topic modeling with provable guarantees'', ICML, 2013) in two special cases: for the first two extracted vertices, and when $r \leq 3$. Finally, we propose a new more robust variant of SPA that first shifts and lifts the data points in order to minimize the conditioning of the problem. We illustrate our results on synthetic data.

26 pages, revised version, new experiments to study the conditioning of the preprocessed vertices

Related Organizations

Université de Mons (UMONS)
Belgium
University of Mons
Belgium

Keywords

Machine Learning, FOS: Computer and information sciences, Numerical Analysis, Data Structures and Algorithms, FOS: Mathematics, Data Structures and Algorithms (cs.DS), Machine Learning (stat.ML), Numerical Analysis (math.NA), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Funded by

EC| eLinoR