Data‐driven policy iteration algorithm for optimal control of continuous‐time Itô stochastic systems with Markovian jumps

descriptionPublicationkeyboard_double_arrow_right Article 01 Aug 2016 English Publisher:Institution of Engineering and Technology (IET)Journal:IET Control Theory & Applications, volume 10, pages 1,431-1,439 (issn: 1751-8644, eissn: 1751-8652,

Copyright policy )

Authors: Song, Jun; He, Shuping; Liu, Fei; Niu, Yugang; Ding, Zhengtao;

doi: 10.1049/iet-cta.2015.0973

Data‐driven policy iteration algorithm for optimal control of continuous‐time Itô stochastic systems with Markovian jumps

- Summary
- Metrics

Abstract

This studies the infinite horizon optimal control problem for a class of continuous‐time systems subjected to multiplicative noises and Markovian jumps by using a data‐driven policy iteration algorithm. The optimal control problem is equivalent to solve a stochastic coupled algebraic Riccatic equation (CARE). An off‐line iteration algorithm is first established to converge the solutions of the stochastic CARE, which is generalised from an implicit iterative algorithm. By applying subsystems transformation (ST) technique, the off‐line iterative algorithm is decoupled into N parallel Kleinman's iterative equations. To learn the solution of the stochastic CARE from N decomposed linear subsystems data, an ST‐based data‐driven policy iteration algorithm is proposed and the convergence is proved. Finally, a numerical example is given to illustrate the effectiveness and applicability of the proposed two iterative algorithms.

Related Organizations

Chinese Academy of Sciences
China (People's Republic of)
Anhui University
China (People's Republic of)
Ministry of Education of the People's Republic of China
China (People's Republic of)
University of Salford
United Kingdom
Institute of Automation
China (People's Republic of)

View all View all

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	36
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%