Adaptive Prior Selection for Repertoire-Based Online Adaptation in Robotics

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type , Preprint 20 Jan 2020Embargo end date: 01 Jan 2019 France Publisher:Frontiers Media SAJournal:Frontiers in Robotics and AI, volume 6 (eissn: 2296-9144,

Copyright policy )Funded by:EC | ResiBots, CHIST-ERA | Heap

Authors: Kaushik, Rituraj; Desreumaux, Pierre; Mouret, Jean-Baptiste;

APC: 1,761.5 EUR

doi: 10.3389/frobt.2019.00151 , 10.48550/arxiv.1907.07029

pmid: 33501166

pmc: PMC7805922

arXiv: 1907.07029

Adaptive Prior Selection for Repertoire-Based Online Adaptation in Robotics

- Summary
- Subjects
- Related research
  (4)
- Metrics

Abstract

Repertoire-based learning is a data-efficient adaptation approach based on a two-step process in which (1) a large and diverse set of policies is learned in simulation, and (2) a planning or learning algorithm chooses the most appropriate policies according to the current situation (e.g., a damaged robot, a new object, etc.). In this paper, we relax the assumption of previous works that a single repertoire is enough for adaptation. Instead, we generate repertoires for many different situations (e.g., with a missing leg, on different floors, etc.) and let our algorithm selects the most useful prior. Our main contribution is an algorithm, APROL (Adaptive Prior selection for Repertoire-based Online Learning) to plan the next action by incorporating these priors when the robot has no information about the current situation. We evaluate APROL on two simulated tasks: (1) pushing unknown objects of various shapes and sizes with a robotic arm and (2) a goal reaching task with a damaged hexapod robot. We compare with "Reset-free Trial and Error" (RTE) and various single repertoire-based baselines. The results show that APROL solves both the tasks in less interaction time than the baselines. Additionally, we demonstrate APROL on a real, damaged hexapod that quickly learns to pick compensatory policies to reach a goal by avoiding obstacles in the path.

Frontiers in Robotics and AI. Vol. 6, p. 151, 2020. Video : http://tiny.cc/aprol_video

Country

France

Related Organizations

French National Centre for Scientific Research
France
University of York
United Kingdom
Université de Lorraine
France
French Institute for Research in Computer Science and Automation
France
Université de Lorraine
France

View all View all

Keywords

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Robotics and AI, FOS: Computer and information sciences, Computer Science - Machine Learning, repertoire-based robot learning, Computer Science - Artificial Intelligence, [INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO], Computer Science - Neural and Evolutionary Computing, QA75.5-76.95, Machine Learning (cs.LG), Computer Science - Robotics, [SPI.AUTO] Engineering Sciences [physics]/Automatic, Artificial Intelligence (cs.AI), Electronic computers. Computer science, TJ1-1570, data-efficient robot learning, fault tolerance in robotics, Mechanical engineering and machinery, Neural and Evolutionary Computing (cs.NE), Robotics (cs.RO), model-based learning, evolutionary robotics

4 Research products, page 1 of 1

Repertoire-Based Methods for Detecting and Quantifying Social Transmission
2013IsAmongTopNSimilarDocuments
kaushik_2019_aprol software on GitHub
IsRelatedTo
spyn software on GitHub
IsRelatedTo
chatzilygeroudis_2018_rte software on GitHub
IsRelatedTo

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	24
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%