Evaluation of Instance-Based Learning and Q-Learning Algorithms in Dynamic Environments

descriptionPublicationkeyboard_double_arrow_right Article 01 Jan 2021Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Access, volume 9, pages 138,775-138,790 (eissn: 2169-3536,

Copyright policy )

Authors: Anmol Gupta; Partha Pratim Roy; Varun Dutt;

doi: 10.1109/access.2021.3117855

Evaluation of Instance-Based Learning and Q-Learning Algorithms in Dynamic Environments

- Summary
- Subjects
- Metrics

Abstract

Reinforcement learning is an unsupervised learning algorithm, where learning is based upon feedback from the environment. Prior research has proposed cognitive (e.g., Instance-based Learning or IBL) and statistical (Q-learning) reinforcement learning algorithms. However, an evaluation of these algorithms in a single dynamic environment has not been explored. In this paper, a comparison between the statistical Q-learning algorithm and the cognitive IBL algorithm is presented. A well-known environment, “Frozen Lake,” is used to train, generalize, and scale Q-learning and IBL algorithms. For generalizing, the Q-learning and IBL agents were trained on one version of the Frozen Lake and tested on a permuted version of the same environment. For scaling, the two algorithms were tested on a larger version of the Frozen Lake environment. Results revealed that the IBL algorithm used less training time and generalized better to different environment variants. The IBL algorithm was also able to show scalability by retaining its superior performance in the larger environment. These results indicate that the IBL algorithm could be proposed as an alternative to the standard reinforcement learning algorithms based on dynamic programming such as Q-learning. The inclusion of human factors (such as memory) in the IBL algorithm makes it suitable for robust learning in complex and dynamic environments.

Related Organizations

Indian Institute of Technology Roorkee
India
Indian Institute of Technology Dharwad
India
Indian Institute of Technology Mandi
India

Keywords

openAI, frozen lake, Reinforcement learning, Q-learning, instance-based learning, Electrical engineering. Electronics. Nuclear engineering, cognitive modeling, TK1-9971

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	8
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

8

Top 10%

gold

Fields of Science (4) View all

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

View all