Adaptive Dynamic Programming and Optimal Control of Unknown Multiplayer Systems Based on Game Theory

Name: Adaptive Dynamic Programming and Optimal Control of Unknown Multiplayer Systems Based on Game Theory
Creator: Jingang Zhao
Keywords: nonzero-sum (NZS) games, neural network (NN), 0202 electrical engineering, electronic engineering, information engineering, Electrical engineering. Electronics. Nuclear engineering, 02 engineering and technology, multi-player systems, coupled Hamilton-Jacobi (HJ) equations, Adaptive dynamic programming, TK1-9971

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2022Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Access, volume 10, pages 77,695-77,706 (eissn: 2169-3536,

Authors: Jingang Zhao;

doi: 10.1109/access.2022.3193505

Adaptive Dynamic Programming and Optimal Control of Unknown Multiplayer Systems Based on Game Theory

- Summary
- Subjects
- Metrics

Abstract

In this paper, we present a new adaptive dynamic programming (ADP) scheme to solve the optimal control problem of multi-player systems with unknown dynamics from the perspective of nonzero-sum (NZS) games. In the presented scheme, a new iterative equation is given. On the basis of the given iterative equation, the control policy and corresponding value function for each player can be learned by using the state and input data, which does not need to identify the system dynamics. To overcome the difficulty of unknown system dynamics, neural network (NN)-based function approximation techniques are employed in the implementation. Based on the given iterative equation and NN-based function approximation techniques, a new non-model-based ADP algorithm is developed. The convergence of the developed non-model-based ADP algorithm is rigorously analyzed and proved. Finally, two numerical simulation examples are provided to demonstrate the performance of the developed non-model-based ADP algorithm.

Related Organizations

Weifang University
China (People's Republic of)

Keywords

nonzero-sum (NZS) games, neural network (NN), Electrical engineering. Electronics. Nuclear engineering, multi-player systems, coupled Hamilton-Jacobi (HJ) equations, Adaptive dynamic programming, TK1-9971

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

Top 10%

Average

gold

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering