Downloads provided by UsageCounts
handle: 10261/18069
This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Reinforcement Learning algorithms, combining Case Based Reasoning (CBR) and Reinforcement Learning (RL) techniques. This approach, called Case Based Heuristically Accelerated Reinforcement Learning (CB-HARL), builds upon an emerging technique, the Heuristic Accelerated Reinforcement Learning (HARL), in which RL methods are accelerated by making use of heuristic information. CB-HARL is a subset of RL that makes use of a heuristic function derived from a case base, in a Case Based Reasoning manner. An algorithm that incorporates CBR techniques into the Heuristically Accelerated Q–Learning is also proposed. Empirical evaluations were conducted in a simulator for the RoboCup Four-Legged Soccer Competition, and results obtained shows that using CB-HARL, the agents learn faster than using either RL or HARL methods.
This work has been partially funded by the FI grant and the BE grant from the AGAUR, the 2005-SGR-00093 project, supported by the Generalitat de Catalunya, the MID-CBR project grant TIN 2006-15140-C03-01 and FEDER funds. Reinaldo Bianchi is supported by CNPq grant 201591/2007-3 and FAPESP grant 2009/01610-1.
The original publication is available at www.springerlink.com
Peer reviewed
Multiagent Learning, Case Based Heuristically Accelerated Reinforcement Learning, Case based reasoning, Reinforcement Learning, CBR, 004, Case-based heuristically accelerated reinforcement learning, Case-based reasoning, XXXXXX - Unknown, Reinforcement learning, Multiagent learning
Multiagent Learning, Case Based Heuristically Accelerated Reinforcement Learning, Case based reasoning, Reinforcement Learning, CBR, 004, Case-based heuristically accelerated reinforcement learning, Case-based reasoning, XXXXXX - Unknown, Reinforcement learning, Multiagent learning
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 20 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 43 | |
| downloads | 81 |

Views provided by UsageCounts
Downloads provided by UsageCounts