Impact of Continuous Latent Variables on Imitation Learning Efficiency and Stability

SOVEREIGN Research Kernel

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Report

Data sources: ZENODO

Impact of Continuous Latent Variables on Imitation Learning Efficiency and Stability

descriptionPublicationkeyboard_double_arrow_right Report Under curation English Publisher:Zenodo

Authors: SOVEREIGN Research Kernel;

doi: 10.5281/zenodo.20651592

Impact of Continuous Latent Variables on Imitation Learning Efficiency and Stability

- Summary

Abstract

Autonomous learning has been a promising direction in control and robotics for more than a decade since data-driven learning allows to reduce the amount of engineering knowledge, which is otherwise required. However, autonomous reinforcement learning (RL) approaches typically require many interactions with the system to learn controllers, which is a practical limitation in real systems, such as robots, where many interactions can be impractical and time consuming. To address this problem, current learning approaches typically require task-specific knowledge in form of expert demonstrations, reResearch goal: What is the impact of replacing discrete action tokens with continuous latent variables on the sample efficiency and convergence stability of imitation learning policies trained on noisy, unlabeled video demonstrations?Autonomous synthesis report generated by SOVEREIGN Research Kernel. Tribunal consensus score: 8.7/10.

Found an issue? Give us feedback