Accelerated optimization landscape of linear–quadratic regulator

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2025Embargo end date: 01 Jan 2023 English Publisher:Elsevier BVJournal:Automatica, volume 171, page 111,927 (issn: 0005-1098,

Copyright policy )

Authors: Lechen Feng; Yuan-Hua Ni;

doi: 10.1016/j.automatica.2024.111927 , 10.48550/arxiv.2307.03590

arXiv: 2307.03590

Accelerated optimization landscape of linear–quadratic regulator

- Summary
- Subjects
- Metrics

Abstract

Linear-quadratic regulator (LQR) is a landmark problem in the field of optimal control, which is the concern of this paper. Generally, LQR is classified into state-feedback LQR (SLQR) and output-feedback LQR (OLQR) based on whether the full state is obtained. It has been suggested in existing literature that both SLQR and OLQR could be viewed as \textit{constrained nonconvex matrix optimization} problems in which the only variable to be optimized is the feedback gain matrix. In this paper, we introduce a first-order accelerated optimization framework of handling the LQR problem, and give its convergence analysis for the cases of SLQR and OLQR, respectively. Specifically, a Lipschiz Hessian property of LQR performance criterion is presented, which turns out to be a crucial property for the application of modern optimization techniques. For the SLQR problem, a continuous-time hybrid dynamic system is introduced, whose solution trajectory is shown to converge exponentially to the optimal feedback gain with Nesterov-optimal order $1-\frac{1}{\sqrtκ}$ ($κ$ the condition number). Then, the symplectic Euler scheme is utilized to discretize the hybrid dynamic system, and a Nesterov-type method with a restarting rule is proposed that preserves the continuous-time convergence rate, i.e., the discretized algorithm admits the Nesterov-optimal convergence order. For the OLQR problem, a Hessian-free accelerated framework is proposed, which is a two-procedure method consisting of semiconvex function optimization and negative curvature exploitation. In a time $\mathcal{O}(ε^{-7/4}\log(1/ε))$, the method can find an $ε$-stationary point of the performance criterion; this entails that the method improves upon the $\mathcal{O}(ε^{-2})$ complexity of vanilla gradient descent. Moreover, our method provides the second-order guarantee of stationary point.

Related Organizations

Nankai University
China (People's Republic of)

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Nesterov-type method, Optimization and Control (math.OC), Linear-quadratic optimal control problems, FOS: Mathematics, Optimal feedback synthesis, linear-quadratic regulator, accelerated optimization, Mathematics - Optimization and Control, second-order guarantee, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Green