
handle: 11382/567734 , 11589/284881
Soft robots can adaptively interact with unstructured environments. However, nonlinear soft material properties challenge modeling and control. Learning‐based controllers that leverage efficient mechanical models are promising for solving complex interaction tasks. This article develops a closed‐loop pose/force controller for a dexterous soft manipulator enabling dynamic pushing tasks using deep reinforcement learning. Force tests investigate the mechanical properties of a soft robot module, resulting in orthogonal forces of N. Then, the policy is trained in simulation leveraging a dynamic Cosserat rod model of the soft robot. Domain randomization mitigate the sim‐to‐real gap while careful reward engineering induced pose and force control even without explicit force inputs. Despite the approximate simulation, the sim‐to‐real transfer achieved an average reaching distance of mm (), an average orientation error of rad () and applied pushing forces up to N. Such performance is reasonable for the intended assistive tasks of the manipulator. The experiments uncovered that the soft robot interacting with the environment exhibited torsional and counter‐balancing movements. Although not explicitly enforced, they emerged from the mechanical intelligence of the manipulator. The results demonstrate the potential of soft robotic manipulation via reinforcement learning.
reinforcement learning, Computer engineering. Computer hardware, Control engineering systems. Automatic machinery (General), soft robots, Dynamic control; manipulation; reinforcement learning; sim-to-real; soft robots; system modeling, TK7885-7895, manipulation, TJ212-225, sim‐to‐real, system modeling, dynamic control
reinforcement learning, Computer engineering. Computer hardware, Control engineering systems. Automatic machinery (General), soft robots, Dynamic control; manipulation; reinforcement learning; sim-to-real; soft robots; system modeling, TK7885-7895, manipulation, TJ212-225, sim‐to‐real, system modeling, dynamic control
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 17 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
