Downloads provided by UsageCounts
arXiv: 1910.14475
handle: 10261/219797 , 2117/335534
In this paper we present a Deep Reinforcement Learning approach to solve dynamic cloth manipulation tasks. Differing from the case of rigid objects, we stress that the followed trajectory (including speed and acceleration) has a decisive influence on the final state of cloth, which can greatly vary even if the positions reached by the grasped points are the same. We explore how goal positions for non-grasped points can be attained through learning adequate trajectories for the grasped points. Our approach uses few demonstrations to improve control policy learning, and a sparse reward approach to avoid engineering complex reward functions. Since perception of textiles is challenging, we also study different state representations to assess the minimum observation space required for learning to succeed. Finally, we compare different combinations of control policy encodings, demonstrations, and sparse reward learning techniques, and show that our proposed approach can learn dynamic cloth manipulation in an efficient way, i.e., using a reduced observation space, a few demonstrations, and a sparse reward.
6 pages, 5 figures, accepted at International Conference on Robotics and Automation ICRA'2020
Deep reinforcement learning, FOS: Computer and information sciences, Àrees temàtiques de la UPC::Informàtica::Robòtica, Classificació INSPEC::Automation::Robots, Dynamic manipulation, Intelligent robots, Learning in simulation, Manipulators, :Automation::Robots [Classificació INSPEC], Computer Science - Robotics, Deformable object manipulation, :Informàtica::Robòtica [Àrees temàtiques de la UPC], Robotics (cs.RO)
Deep reinforcement learning, FOS: Computer and information sciences, Àrees temàtiques de la UPC::Informàtica::Robòtica, Classificació INSPEC::Automation::Robots, Dynamic manipulation, Intelligent robots, Learning in simulation, Manipulators, :Automation::Robots [Classificació INSPEC], Computer Science - Robotics, Deformable object manipulation, :Informàtica::Robòtica [Àrees temàtiques de la UPC], Robotics (cs.RO)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 64 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 1% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 1% |
| views | 58 | |
| downloads | 117 |

Views provided by UsageCounts
Downloads provided by UsageCounts