
In emerging manufacturing facilities, robots must enhance their flexibility. They are expected to perform complex jobs, showing different behaviors on the need, all within unstructured environments, and without requiring reprogramming or setup adjustments. To address this challenge, we introduce the A3CQP, a non-strict hierarchical Quadratic Programming (QP) controller. This controller seamlessly combines both motion and interaction functionalities, with priorities dynamically and autonomously adapted through a Reinforcement Learningbased adaptation module. This module utilizes the Asynchronous Advantage Actor-Critic algorithm (A3C) to ensure rapid convergence and stable training within continuous action and observation spaces. The experimental validation, involving a collaborative peg-in-hole assembly and the polishing of a wooden plate, demonstrates the effectiveness of the proposed solution in terms of its automatic adaptability, responsiveness, and safety.
[INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO], Optimization and Optimal Control, Machine Learning for Robot Control, [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG], Optimization and Optimal Control Reinforcement Learning Machine Learning for Robot Control, Reinforcement Learning
[INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO], Optimization and Optimal Control, Machine Learning for Robot Control, [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG], Optimization and Optimal Control Reinforcement Learning Machine Learning for Robot Control, Reinforcement Learning
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
