
Repertoire-based learning is a data-efficient adaptation approach based on a two-step process in which (1) a large and diverse set of policies is learned in simulation, and (2) a planning or learning algorithm chooses the most appropriate policies according to the current situation (e.g., a damaged robot, a new object, etc.). In this paper, we relax the assumption of previous works that a single repertoire is enough for adaptation. Instead, we generate repertoires for many different situations (e.g., with a missing leg, on different floors, etc.) and let our algorithm selects the most useful prior. Our main contribution is an algorithm, APROL (Adaptive Prior selection for Repertoire-based Online Learning) to plan the next action by incorporating these priors when the robot has no information about the current situation. We evaluate APROL on two simulated tasks: (1) pushing unknown objects of various shapes and sizes with a robotic arm and (2) a goal reaching task with a damaged hexapod robot. We compare with "Reset-free Trial and Error" (RTE) and various single repertoire-based baselines. The results show that APROL solves both the tasks in less interaction time than the baselines. Additionally, we demonstrate APROL on a real, damaged hexapod that quickly learns to pick compensatory policies to reach a goal by avoiding obstacles in the path.
Frontiers in Robotics and AI. Vol. 6, p. 151, 2020. Video : http://tiny.cc/aprol_video
[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Robotics and AI, FOS: Computer and information sciences, Computer Science - Machine Learning, repertoire-based robot learning, Computer Science - Artificial Intelligence, [INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO], Computer Science - Neural and Evolutionary Computing, QA75.5-76.95, Machine Learning (cs.LG), Computer Science - Robotics, [SPI.AUTO] Engineering Sciences [physics]/Automatic, Artificial Intelligence (cs.AI), Electronic computers. Computer science, TJ1-1570, data-efficient robot learning, fault tolerance in robotics, Mechanical engineering and machinery, Neural and Evolutionary Computing (cs.NE), Robotics (cs.RO), model-based learning, evolutionary robotics
[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI], Robotics and AI, FOS: Computer and information sciences, Computer Science - Machine Learning, repertoire-based robot learning, Computer Science - Artificial Intelligence, [INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO], Computer Science - Neural and Evolutionary Computing, QA75.5-76.95, Machine Learning (cs.LG), Computer Science - Robotics, [SPI.AUTO] Engineering Sciences [physics]/Automatic, Artificial Intelligence (cs.AI), Electronic computers. Computer science, TJ1-1570, data-efficient robot learning, fault tolerance in robotics, Mechanical engineering and machinery, Neural and Evolutionary Computing (cs.NE), Robotics (cs.RO), model-based learning, evolutionary robotics
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 24 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
