2.2. Preliminary For the approximation of a continuous function h(Z) : Rr → R, the following Radial Function Basis (RBF) Neural Networks (NN) are used :
 J. P. Merlet (Ed.), Parallel robots, Vol. 74, Springer, 2001.
 R. Bonitz, T. Hsia, Robust internal-force based impedance control for coordinating manipulators-theory and experiments, in: Proceedings of IEEE International Conference on Robotics and Automation, Vol. 1, 1996, pp. 622-628.
 Z. Li, P. Y. Tao, S. S. Ge, M. Adams, W. Wijesoma, Robust adaptive control of cooperating mobile manipulators with relative motion, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics 39 (1) (2009) 103-116.
 Y. Li, S. S. Ge, Impedance learning for robots interacting with unknown environments, IEEE Transactions on Control Systems Technology 22 (4) (2014) 1422-1432.
 H. Kwakernaak, R. Sivan, Linear Optimal Control Systems, John Wiley & Sons, Inc. New York, NY, USA, 1972.
 R. S. Sutton, A. G. Barto, R. J. Williams, Reinforcement learning is direct adaptive optimal control, IEEE Control Systems 12 (2) (1992) 19-22.
 X. Liu, S. Balakrishnan, Convergence analysis of adaptive critic based optimal control, in: Proceedings of American Control Conference, Vol. 3, 2000, pp. 1929-1933.
 J. J. Murray, C. J. Cox, G. G. Lendaris, R. Saeks, Adaptive dynamic programming, IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 32 (2) (2002) 140-153.
 M. T. Rosenstein, A. G. Barto, Supervised actor-critic reinforcement learning, Learning and Approximate Dynamic Programming: Scaling Up to the Real World (2004) 359-380.
 P. J. Werbos, Using ADP to understand and replicate brain intelligence: The next level design, Proceedings of the 2007 IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning (2007) 209-216. [OpenAIRE]
 P. J. Werbos, Intelligence in the brain: A theory of how it works and how to build it, Neural Networks 22 (3) (2009) 200-212. [OpenAIRE]
 D. Vrabie, F. Lewis, Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems, Neural Networks 22 (3) (2009) 237-246.
 D. P. Bertsekas, D. P. Bertsekas, D. P. Bertsekas, D. P. Bertsekas, Dynamic programming and optimal control, Vol. 1, Athena Scientific Belmont, MA, 1995.
 K. Doya, Reinforcement learning in continuous time and space, Neural computation 12 (1) (2000) 219-245. [OpenAIRE]