Adaptive Dynamic Programming (ADP) with critic-actor architecture is an effective way to perform online learning control. To avoid the subjectivity in the design of a neural network that serves as a critic network, kernel-based adaptive critic design (ACD) was developed... View more
Sutton, R., Barto, A..
Reinforcement Learning: An Introduction. 1998
Lewis, F. L., Vrabie, D.. Reinforcement learning and adaptive dynamic programming for feedback control.
IEEE Circuits and Systems Magazine. 2009; 9 (3): 32-50
Lewis, F. L., Vrabie, D., Vamvoudakis, K.. Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers.
IEEE Control Systems Magazine. 2012; 32 (6): 76-105
Zhang, H., Luo, Y., Liu, D.. Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints.
IEEE Transactions on Neural Networks. 2009; 20 (9): 1490-1503
Huang, Y., Liu, D.. Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm.
Neurocomputing. 2014; 125: 46-56
Xu, X., Zuo, L., Huang, Z.. Reinforcement learning algorithms with function approximation: recent advances and applications.
Information Sciences. 2014; 261: 1-31
Al-Tamimi, A., Lewis, F. L., Abu-Khalaf, M.. Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof.
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics. 2008; 38 (4): 943-949
Ferrari, S., Steck, J. E., Chandramohan, R.. Adaptive feedback control by constrained approximate dynamic programming.
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics. 2008; 38 (4): 982-987