Benchmarking Projective Simulation in Navigation Problems

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2018Embargo end date: 01 Jan 2018 Germany, Austria Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Access, volume 6, pages 64,639-64,648 (eissn: 2169-3536,

Copyright policy )

Authors: Alexey A. Melnikov 0001; Adi Makmal; Hans J. Briegel;

doi: 10.1109/access.2018.2876494 , 10.48550/arxiv.1804.08607

arXiv: 1804.08607

Benchmarking Projective Simulation in Navigation Problems

- Summary
- Subjects
- Metrics

Abstract

Projective simulation (PS) is a model for intelligent agents with a deliberation capacity that is based on episodic memory. The model has been shown to provide a flexible framework for constructing reinforcement-learning agents, and it allows for quantum mechanical generalization, which leads to a speed-up in deliberation time. PS agents have been applied successfully in the context of complex skill learning in robotics, and in the design of state-of-the-art quantum experiments. In this paper, we study the performance of projective simulation in two benchmarking problems in navigation, namely the grid world and the mountain car problem. The performance of PS is compared to standard tabular reinforcement learning approaches, Q-learning and SARSA. Our comparison demonstrates that the performance of PS and standard learning approaches are qualitatively and quantitatively similar, while it is much easier to choose optimal model parameters in case of projective simulation, with a reduced computational effort of one to two orders of magnitude. Our results show that the projective simulation model stands out for its simplicity in terms of the number of model parameters, which makes it simple to set up the learning agent in unknown task environments.

8 pages, 10 figures

Countries

Germany, Austria

Related Organizations

Keywords

benchmarking tasks, FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Artificial Intelligence, quantum mechanics, Machine Learning (stat.ML), navigation problems, TK1-9971, Machine Learning (cs.LG), projective simulation, Artificial Intelligence (cs.AI), random processes, Statistics - Machine Learning, Reinforcement learning, Electrical engineering. Electronics. Nuclear engineering, info:eu-repo/classification/ddc/530, Reinforcement learning, projective simulation, benchmarking tasks, navigation problems, random processes, quantum mechanics, learning, delayed rewards, Markov decision processes

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	18
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%