Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem

Naiming Qi; Xiaolei Sun; Kang Sun; Xingfu Liu; Feng Wu; Chao Liu

Found an issue? Give us feedback

https://doi.org/10.1...arrow_drop_down

https://doi.org/10.1109/icmc.2...

Article . 2014 . Peer-reviewed

Data sources: Crossref

https://dx.doi.org/10.1109/icm...

Article

Data sources: Microsoft Academic Graph

Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem

descriptionPublicationkeyboard_double_arrow_right Article 01 Jul 2014Publisher:IEEEJournal:2014 International Conference on Mechatronics and Control (ICMC)

Authors: Naiming Qi; Xiaolei Sun; Kang Sun; Xingfu Liu; Feng Wu; Chao Liu;

doi: 10.1109/icmc.2014.7231861

Approximate dynamic programming based on Gaussian process regression for the perimeter patrol optimization problem

- Summary
- Metrics

Abstract

A methodology is presented in this paper for stochastic optimal control of unmanned aerial vehicle performing the task of perimeter patrol. The optimal control problem is modeled as a Markov decision processes, and an approximate policy iteration algorithm is used for the cost-to-go function (value function) by introducing Gaussian process regression, resulting in improved quality of the decisions made while retaining computationally feasibility. The approximate dynamic programming (ADP) framework is developed to tackle the issues, in which situations standard dynamic programming algorithms become computationally too demanding. As a nonparametric ADP algorithm, the Gaussian processes that provide the combination of the prior and noise models presents a sub-solution in a lower dimensional space by exploiting kernel-based method. The numerical results that corroborate the effectiveness of the proposed methodology are also provided.

Related Organizations

Harbin Institute of Technology
China (People's Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average