descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Oct 2008Embargo end date: 01 Jan 2008Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), volume 38, pages 1,207-1,220 (issn: 1083-4419,

Authors: Han-Xiong Li; Daoyi Dong; Chunlin Chen; Tzyh-Jong Tarn;

doi: 10.1109/tsmcb.2008.925743 , 10.48550/arxiv.0810.3828

pmid: 18784007

arXiv: http://arxiv.org/abs/0810.3828

Quantum Reinforcement Learning

- Summary
- Subjects
- Metrics

Abstract

The key approaches for machine learning, especially learning in unknown probabilistic environments are new representations and computation mechanisms. In this paper, a novel quantum reinforcement learning (QRL) method is proposed by combining quantum theory and reinforcement learning (RL). Inspired by the state superposition principle and quantum parallelism, a framework of value updating algorithm is introduced. The state (action) in traditional RL is identified as the eigen state (eigen action) in QRL. The state (action) set can be represented with a quantum superposition state and the eigen state (eigen action) can be obtained by randomly observing the simulated quantum state according to the collapse postulate of quantum measurement. The probability of the eigen action is determined by the probability amplitude, which is parallelly updated according to rewards. Some related characteristics of QRL such as convergence, optimality and balancing between exploration and exploitation are also analyzed, which shows that this approach makes a good tradeoff between exploration and exploitation using the probability amplitude and can speed up learning through the quantum parallelism. To evaluate the performance and practicability of QRL, several simulated experiments are given and the results demonstrate the effectiveness and superiority of QRL algorithm for some complex problems. The present work is also an effective exploration on the application of quantum computation to artificial intelligence.

13 pages, 7 figures, Latex

Related Organizations

Central South University
China (People's Republic of)
University of Washington
United States
University of Mary
United States
Chinese Academy of Sciences
China (People's Republic of)
City University of Hong Kong
China (People's Republic of)

View all View all

Keywords

FOS: Computer and information sciences, Quantum Physics, Computer Science - Machine Learning, Computer Science - Artificial Intelligence, FOS: Physical sciences, Models, Biological, Pattern Recognition, Automated, Machine Learning (cs.LG), Artificial Intelligence (cs.AI), Artificial Intelligence, Biomimetics, Humans, Quantum Theory, Computer Simulation, Quantum Physics (quant-ph), Reinforcement, Psychology

Impact byBIP!

	citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	261
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%