Hierarchical Reinforcement Learning with OMQ

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Jul 2006Publisher:IEEEJournal:2006 5th IEEE International Conference on Cognitive Informatics

Authors: Jing Shen; Haibo Liu; Guochang Gu;

doi: 10.1109/coginf.2006.365550

Hierarchical Reinforcement Learning with OMQ

- Summary
- Metrics

Abstract

A novel method of hierarchical reinforcement learning, named OMQ, by integrating Options into MAXQ is presented. In OMQ, the MAXQ is used as basic framework to design hierarchies experientially and learn online, and the Option is used to construct hierarchies automatically. The performance of OMQ is demonstrated in taxi domain and compared with Option and MAXQ. The simulation results show that the OMQ is more practical than Option and MAXQ in partial known environment.

Related Organizations

Harbin Engineering University
China (People's Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	3
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average