Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control

Name: Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control
Keywords: FOS: Computer and information sciences, Computer Science - Robotics, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Robotics (cs.RO), Machine Learning (cs.LG)

Wang, Haochen; Shi, Zhiwei; Zhu, Chengxi; Qiao, Yafei; Zhang, Cheng; Yang, Fan; Ren, Pengjie; Lu, Lan; Xuan, Dong

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1109/icra55...

Article . 2025 . Peer-reviewed

License: STM Policy #29

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2025

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 19 May 2025Embargo end date: 01 Jan 2025Publisher:IEEEJournal:2025 IEEE International Conference on Robotics and Automation (ICRA)

Authors: Wang, Haochen; Shi, Zhiwei; Zhu, Chengxi; Qiao, Yafei; Zhang, Cheng; Yang, Fan; Ren, Pengjie; +2 Authors

doi: 10.1109/icra55743.2025.11127940 , 10.48550/arxiv.2504.17771

arXiv: 2504.17771

Integrating Learning-Based Manipulation and Physics-Based Locomotion for Whole-Body Badminton Robot Control

- Summary
- Subjects
- Metrics

Abstract

Learning-based methods, such as imitation learning (IL) and reinforcement learning (RL), can produce excel control policies over challenging agile robot tasks, such as sports robot. However, no existing work has harmonized learning-based policy with model-based methods to reduce training complexity and ensure the safety and stability for agile badminton robot control. In this paper, we introduce Hamlet, a novel hybrid control system for agile badminton robots. Specifically, we propose a model-based strategy for chassis locomotion which provides a base for arm policy. We introduce a physics-informed "IL+RL" training framework for learning-based arm policy. In this train framework, a model-based strategy with privileged information is used to guide arm policy training during both IL and RL phases. In addition, we train the critic model during IL phase to alleviate the performance drop issue when transitioning from IL to RL. We present results on our self-engineered badminton robot, achieving 94.5% success rate against the serving machine and 90.7% success rate against human players. Our system can be easily generalized to other agile mobile manipulation tasks such as agile catching and table tennis. Our project website: https://dreamstarring.github.io/HAMLET/.

Accepted to ICRA 2025. Project page: https://dreamstarring.github.io/HAMLET/

Related Organizations

Carnegie Mellon University
United States
Shandong Women’s University
China (People's Republic of)
Shanghai Jiao Tong University
China (People's Republic of)

Keywords

FOS: Computer and information sciences, Computer Science - Robotics, Computer Science - Machine Learning, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Robotics (cs.RO), Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green