Reinforcement Learning for Resource Allocation in LEO Satellite Networks

descriptionPublicationkeyboard_double_arrow_right Article 01 Jun 2007 United Kingdom Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics), volume 37, pages 515-527 (issn: 1083-4419,

Copyright policy )

Authors: Usaha, W; Barria, J A;

doi: 10.1109/tsmcb.2006.886173

pmid: 17550108

handle: 10044/1/775

Reinforcement Learning for Resource Allocation in LEO Satellite Networks

- Summary
- Subjects
- Metrics

Abstract

In this paper, we develop and assess online decision-making algorithms for call admission and routing for low Earth orbit (LEO) satellite networks. It has been shown in a recent paper that, in a LEO satellite system, a semi-Markov decision process formulation of the call admission and routing problem can achieve better performance in terms of an average revenue function than existing routing methods. However, the conventional dynamic programming (DP) numerical solution becomes prohibited as the problem size increases. In this paper, two solution methods based on reinforcement learning (RL) are proposed in order to circumvent the computational burden of DP. The first method is based on an actor-critic method with temporal-difference (TD) learning. The second method is based on a critic-only method, called optimistic TD learning. The algorithms enhance performance in terms of requirements in storage, computational complexity and computational time, and in terms of an overall long-term average revenue function that penalizes blocked calls. Numerical studies are carried out, and the results obtained show that the RL framework can achieve up to 56% higher average revenue over existing routing methods used in LEO satellite networks with reasonable storage and computational requirements.

Country

United Kingdom

Related Organizations

Imperial College London
United Kingdom
SURANAREE UNIVERSITY OF TECHNOLOGY
Thailand
Suranaree University of Technology
Thailand
Suranaree University of Technology
Thailand

Keywords

Computer Communication Networks, Artificial Intelligence, Signal Processing, Computer-Assisted, Spacecraft, Algorithms, Decision Support Techniques, Pattern Recognition, Automated, Resource Allocation

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	27
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

27

Top 10%

Green

hybrid

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Related to Research communities

Knowmad Institut