A Hierarchical System of Learning Automata

descriptionPublicationkeyboard_double_arrow_right Article 01 Mar 1981 India Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Systems, Man, and Cybernetics, volume 11, pages 236-241 (issn: 0018-9472, eissn: 2168-2909,

Copyright policy )

Authors: Thathachar, MAL; Ramakrishnan, KR;

doi: 10.1109/tsmc.1981.4308659

A Hierarchical System of Learning Automata

- Summary
- Subjects
- Metrics

Abstract

A learning automaton operating in a random environment updates its action probabilities on the basis of the reactions of the environment, so that asymptotically it chooses the optimal action. When the number of actions is large the automaton becomes slow because there are too many updatings to be made at each instant. A hierarchical system of such automata with assured c-optimality is suggested to overcome that problem.The learning algorithm for the hierarchical system turns out to be a simple modification of the absolutely expedient algorithm known in the literature. The parameters of the algorithm at each level in the hierarchy depend only on the parameters and the action probabilities of the previous level. It follows that to minimize the number of updatings per cycle each automaton in the hierarchy need have only two or three actions.

Country

India

Related Organizations

Indian Institute of Science Bangalore
India

Keywords

learning algorithm, random environment, Learning and adaptive systems in artificial intelligence, Electrical Engineering

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	49
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average