DTC: Deep Tracking Control

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 17 Jan 2024Embargo end date: 01 Jan 2023 English Publisher:American Association for the Advancement of Science (AAAS)Journal:Science Robotics, volume 9 (eissn: 2470-9476,

Copyright policy )Funded by:SNSF | Perceptive Dynamic Locomo..., EC | LeMo, SNSF | Data-driven control appro...

Authors: Fabian Jenelten; Junzhe He; Farbod Farshidian; Marco Hutter 0001;

doi: 10.1126/scirobotics.adh5401 , 10.48550/arxiv.2309.15462 , 10.3929/ethz-b-000654755

pmid: 38232148

arXiv: 2309.15462

DTC: Deep Tracking Control

- Summary
- Subjects
- Metrics

Abstract

Legged locomotion is a complex control problem that requires both accuracy and robustness to cope with real-world challenges. Legged systems have traditionally been controlled using trajectory optimization with inverse dynamics. Such hierarchical model-based methods are appealing because of intuitive cost function tuning, accurate planning, generalization, and, most importantly, the insightful understanding gained from more than one decade of extensive research. However, model mismatch and violation of assumptions are common sources of faulty operation. Simulation-based reinforcement learning, on the other hand, results in locomotion policies with unprecedented robustness and recovery skills. Yet, all learning algorithms struggle with sparse rewards emerging from environments where valid footholds are rare, such as gaps or stepping stones. In this work, we propose a hybrid control architecture that combines the advantages of both worlds to simultaneously achieve greater robustness, foot-placement accuracy, and terrain generalization. Our approach uses a model-based planner to roll out a reference motion during training. A deep neural network policy is trained in simulation, aiming to track the optimized footholds. We evaluated the accuracy of our locomotion pipeline on sparse terrains, where pure data-driven methods are prone to fail. Furthermore, we demonstrate superior robustness in the presence of slippery or deformable ground when compared with model-based counterparts. Last, we show that our proposed tracking controller generalizes across different trajectory optimization methods not seen during training. In conclusion, our work unites the predictive capabilities and optimality guarantees of online planning with the inherent robustness attributed to offline learning.

Related Organizations

ETH Zurich
Switzerland
ETH-Zurich
Switzerland

Keywords

Optimization, FOS: Computer and information sciences, Computer Science - Machine Learning, ANYmal, Technology (applied sciences), Robotics, Systems and Control (eess.SY), Electrical Engineering and Systems Science - Systems and Control, Machine Learning (cs.LG), Computer Science - Robotics, Locomotion Control, Legged Robots, FOS: Electrical engineering, electronic engineering, information engineering, Legged locomotion, info:eu-repo/classification/ddc/600, REINFORCEMENT LEARNING (ARTIFICIAL INTELLIGENCE), MPC (Model-based Predictive Control), Robotics (cs.RO)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	76
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%

Found an issue? Give us feedback

76

Top 1%

Green

Fields of Science (3) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all

Funded by

SNSF| Perceptive Dynamic Locomotion on Rough Terrain, EC| LeMo, SNSF| Data-driven control approaches for advanced legged locomotion