descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2024Publisher:Elsevier BVJournal:Robotics and Computer-Integrated Manufacturing, volume 91, page 102,857 (issn: 0736-5845,

Authors: Andrea Testa; Marco Laghi; Edoardo Del Bianco; Gennaro Raiola; Enrico Mingo Hoffman; Arash Ajoudani;

doi: 10.2139/ssrn.4745571 , 10.1016/j.rcim.2024.102857

A Stable Method for Task Priority Adaptation in Quadratic Programming Via Reinforcement Learning

- Summary
- Subjects
- Metrics

Abstract

In emerging manufacturing facilities, robots must enhance their flexibility. They are expected to perform complex jobs, showing different behaviors on the need, all within unstructured environments, and without requiring reprogramming or setup adjustments. To address this challenge, we introduce the A3CQP, a non-strict hierarchical Quadratic Programming (QP) controller. This controller seamlessly combines both motion and interaction functionalities, with priorities dynamically and autonomously adapted through a Reinforcement Learningbased adaptation module. This module utilizes the Asynchronous Advantage Actor-Critic algorithm (A3C) to ensure rapid convergence and stable training within continuous action and observation spaces. The experimental validation, involving a collaborative peg-in-hole assembly and the polishing of a wooden plate, demonstrates the effectiveness of the proposed solution in terms of its automatic adaptability, responsiveness, and safety.

Related Organizations

French National Centre for Scientific Research
France
University of Lorraine
France
Inria Centre at Université de Lorraine
France
Defence Research and Development Organisation
India
Italian Institute of Technology
Italy

View all View all

Keywords

[INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO], Optimization and Optimal Control, Machine Learning for Robot Control, [INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG], Optimization and Optimal Control Reinforcement Learning Machine Learning for Robot Control, Reinforcement Learning

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average