Improving Reinforcement Learning by Using Case Based Heuristics

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Conference object 01 Jan 2009Publisher:Springer Berlin Heidelberg

Authors: Reinaldo A. C. Bianchi; Raquel Ros; Ramón López de Mántaras;

doi: 10.1007/978-3-642-02998-1_7

handle: 10261/18069

Improving Reinforcement Learning by Using Case Based Heuristics

- Summary
- Subjects
- Metrics

Abstract

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Reinforcement Learning algorithms, combining Case Based Reasoning (CBR) and Reinforcement Learning (RL) techniques. This approach, called Case Based Heuristically Accelerated Reinforcement Learning (CB-HARL), builds upon an emerging technique, the Heuristic Accelerated Reinforcement Learning (HARL), in which RL methods are accelerated by making use of heuristic information. CB-HARL is a subset of RL that makes use of a heuristic function derived from a case base, in a Case Based Reasoning manner. An algorithm that incorporates CBR techniques into the Heuristically Accelerated Q–Learning is also proposed. Empirical evaluations were conducted in a simulator for the RoboCup Four-Legged Soccer Competition, and results obtained shows that using CB-HARL, the agents learn faster than using either RL or HARL methods.

This work has been partially funded by the FI grant and the BE grant from the AGAUR, the 2005-SGR-00093 project, supported by the Generalitat de Catalunya, the MID-CBR project grant TIN 2006-15140-C03-01 and FEDER funds. Reinaldo Bianchi is supported by CNPq grant 201591/2007-3 and FAPESP grant 2009/01610-1.

The original publication is available at www.springerlink.com

Peer reviewed

Related Organizations

Research Institute for Artificial Intelligence
Spain
Spanish National Research Council
Spain
Western Sydney University
Australia
Centro Universitário da FEI
Brazil

Keywords

Multiagent Learning, Case Based Heuristically Accelerated Reinforcement Learning, Case based reasoning, Reinforcement Learning, CBR, 004, Case-based heuristically accelerated reinforcement learning, Case-based reasoning, XXXXXX - Unknown, Reinforcement learning, Multiagent learning

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	20
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%