UAV swarm path planning with reinforcement learning for field prospecting

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 03 Mar 2022Embargo end date: 01 Jan 2021 Spain English Publisher:Springer Science and Business Media LLCJournal:Applied Intelligence, volume 52, pages 14,101-14,118 (issn: 0924-669X, eissn: 1573-7497,

Copyright policy )

Authors: Alejandro Puente-Castro; Daniel Rivero 0001; Alejandro Pazos; Enrique Fernández-Blanco;

doi: 10.1007/s10489-022-03254-4 , 10.48550/arxiv.2106.02322

arXiv: 2106.02322

handle: 2183/29921

UAV swarm path planning with reinforcement learning for field prospecting

- Summary
- Subjects
- Metrics

Abstract

AbstractThere has been steady growth in the adoption of Unmanned Aerial Vehicle (UAV) swarms by operators due to their time and cost benefits. However, this kind of system faces an important problem, which is the calculation of many optimal paths for each UAV. Solving this problem would allow control of many UAVs without human intervention while saving battery between recharges and performing several tasks simultaneously. The main aim is to develop a Reinforcement Learning based system capable of calculating the optimal flight path for a UAV swarm. This method stands out for its ability to learn through trial and error, allowing the model to adjust itself. The aim of these paths is to achieve full coverage of an overflight area for tasks such as field prospection, regardless of map size and the number of UAVs in the swarm. It is not necessary to establish targets or to have any previous knowledge other than the given map. Experiments have been conducted to determine whether it is optimal to establish a single control for all UAVs in the swarm or a control for each UAV. The results show that it is better to use one control for all UAVs because of the shorter flight time. In addition, the flight time is greatly affected by the size of the map. The results give starting points for future research, such as finding the optimal map size for each situation.

Country

Spain

Related Organizations

University of A Coruña
Spain
Complexo Hospitalario Universitario A Coruña
Spain

Keywords

Artificial neural network, FOS: Computer and information sciences, Reinforcement learning, Agriculture, Computer Science - Multiagent Systems, Q-Learning, UAV swarm, Path planning, Multiagent Systems (cs.MA)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	45
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 1%