Leveraging Reinforcement Learning and Koopman Theory for Enhanced Model Predictive Control Performance

Name: Leveraging Reinforcement Learning and Koopman Theory for Enhanced Model Predictive Control Performance
Creator: Dony, Md Nur-A-Adam
Keywords: 93A30 (Mathematical Systems Theory), 68T05 (Learning and Adaptive Systems), 68Q32 (Computational Learning Theory), FOS: Electrical engineering, electronic engineering, information engineering, Systems and Control (eess.SY), Electrical Engineering and Systems Science - Systems and Control

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2025Embargo end date: 01 Jan 2025Publisher:arXiv

Authors: Dony, Md Nur-A-Adam;

doi: 10.48550/arxiv.2505.08122

arXiv: 2505.08122

Leveraging Reinforcement Learning and Koopman Theory for Enhanced Model Predictive Control Performance

- Summary
- Subjects
- Metrics

Abstract

This study presents an innovative approach to Model Predictive Control (MPC) by leveraging the powerful combination of Koopman theory and Deep Reinforcement Learning (DRL). By transforming nonlinear dynamical systems into a higher-dimensional linear regime, the Koopman operator facilitates the linear treatment of nonlinear behaviors, paving the way for more efficient control strategies. Our methodology harnesses the predictive prowess of Koopman-based models alongside the optimization capabilities of DRL, particularly using the Proximal Policy Optimization (PPO) algorithm, to enhance the controller's performance. The resulting end-to-end learning framework refines the predictive control policies to cater to specific operational tasks, optimizing both performance and economic efficiency. We validate our approach through rigorous NMPC and eNMPC case studies, demonstrating that the Koopman-RL controller outperforms traditional controllers by achieving higher stability, superior constraint satisfaction, and significant cost savings. The findings indicate that our model can be a robust tool for complex control tasks, offering valuable insights into future applications of RL in MPC.

arXiv admin note: This version has been removed by arXiv administrators due to copyright infringement and inappropriate text reuse from external sources

Keywords

93A30 (Mathematical Systems Theory), 68T05 (Learning and Adaptive Systems), 68Q32 (Computational Learning Theory), FOS: Electrical engineering, electronic engineering, information engineering, Systems and Control (eess.SY), Electrical Engineering and Systems Science - Systems and Control

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Average

Green