Two-Phase Iteration for Value Function Approximation and Hyperparameter Optimization in Gaussian-Kernel-Based Adaptive Critic Design

Article English OPEN
Xin Chen; Penghuan Xie; Yonghua Xiong; Yong He; Min Wu;
(2015)

Adaptive Dynamic Programming (ADP) with critic-actor architecture is an effective way to perform online learning control. To avoid the subjectivity in the design of a neural network that serves as a critic network, kernel-based adaptive critic design (ACD) was developed... View more
  • References (30)
    30 references, page 1 of 3

    Sutton, R., Barto, A.. Reinforcement Learning: An Introduction. 1998

    Lewis, F. L., Vrabie, D.. Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits and Systems Magazine. 2009; 9 (3): 32-50

    Lewis, F. L., Vrabie, D., Vamvoudakis, K.. Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers. IEEE Control Systems Magazine. 2012; 32 (6): 76-105

    Zhang, H., Luo, Y., Liu, D.. Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Transactions on Neural Networks. 2009; 20 (9): 1490-1503

    Huang, Y., Liu, D.. Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm. Neurocomputing. 2014; 125: 46-56

    Xu, X., Zuo, L., Huang, Z.. Reinforcement learning algorithms with function approximation: recent advances and applications. Information Sciences. 2014; 261: 1-31

    Al-Tamimi, A., Lewis, F. L., Abu-Khalaf, M.. Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics. 2008; 38 (4): 943-949

    Ferrari, S., Steck, J. E., Chandramohan, R.. Adaptive feedback control by constrained approximate dynamic programming. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics. 2008; 38 (4): 982-987

    Wang, F.-Y., Zhang, H., Liu, D.. Adaptive dynamic programming: an introduction. IEEE Computational Intelligence Magazine. 2009; 4 (2): 39-47

    Wang, D., Liu, D., Wei, Q., Zhao, D., Jin, N.. Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming. Automatica. 2012; 48 (8): 1825-1832

  • Metrics
Share - Bookmark