Hyperparameter tuning via trajectory predictions: stochastic prox-linear methods in matrix sensing

Name: Hyperparameter tuning via trajectory predictions: stochastic prox-linear methods in matrix sensing
Keywords: Statistics - Machine Learning, Mathematics - Statistics Theory, Mathematics - Optimization and Control

Mengqi Lou; Kabir Aladin Verchand; Ashwin Pananjady

Found an issue? Give us feedback

Mathematical Program...arrow_drop_down

Mathematical Programming

Article . 2025 . Peer-reviewed

License: CC BY

Data sources: Crossref

arXiv.org e-Print Archive

Preprint . 2024

Data sources: arXiv.org e-Print Archive

Research Collection

Conference object . 2024

License: http://rightsstatements.org/page/InC-NC/1.0/

Data sources: Research Collection

ETH Zürich Research Collection

Conference object . 2024

Data sources: Datacite

Hyperparameter tuning via trajectory predictions: stochastic prox-linear methods in matrix sensing

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Preprint 17 Sep 2025Embargo end date: 06 Mar 2024 English Publisher:Springer Science and Business Media LLCJournal:Mathematical Programming (issn: 0025-5610, eissn: 1436-4646,

Copyright policy )

Authors: Mengqi Lou; Kabir Aladin Verchand; Ashwin Pananjady;

doi: 10.1007/s10107-025-02279-0 , 10.3929/ethz-b-000664567

arXiv: 2402.01599

handle: 20.500.11850/664567

Hyperparameter tuning via trajectory predictions: stochastic prox-linear methods in matrix sensing

- Summary
- Subjects
- Metrics

Abstract

Abstract Motivated by the desire to understand stochastic algorithms for nonconvex optimization that are robust to their hyperparameter choices, we analyze a mini-batched prox-linear iterative algorithm for the canonical problem of recovering an unknown rank-1 matrix from rank-1 Gaussian measurements corrupted by noise. We derive a deterministic recursion that predicts the error of this method and show, using a non-asymptotic framework, that this prediction is accurate for any batch-size and a large range of step-sizes. In particular, our analysis reveals that this method, though stochastic, converges linearly from a local initialization with a fixed step-size to a statistical error floor. Our analysis also exposes how the batch-size, step-size, and noise level affect the (linear) convergence rate and the eventual statistical estimation error, and we demonstrate how to use our deterministic predictions to perform hyperparameter tuning (e.g. step-size and batch-size selection) without ever running the method. On a technical level, our analysis is enabled in part by showing that the fluctuations of the empirical iterates around our deterministic predictions scale with the error of the previous iterate.

Related Organizations

University of California System
United States
UNIVERSITY OF CAMBRIDGE
University of Cambridge
Statistical Laboratory University of Cambridge
United Kingdom
University of Cambridge
United Kingdom

View all View all

Keywords

Statistics - Machine Learning, Mathematics - Statistics Theory, Mathematics - Optimization and Control

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

hybrid