Bounds for approximate dynamic programming based on string optimization and curvature

descriptionPublicationkeyboard_double_arrow_right Article , Conference object 01 Dec 2014Publisher:IEEEJournal:53rd IEEE Conference on Decision and Control

Authors: Yajing Liu; Edwin K. P. Chong; Ali Pezeshki; William Moran 0001;

doi: 10.1109/cdc.2014.7040433

Bounds for approximate dynamic programming based on string optimization and curvature

- Summary
- Metrics

Abstract

In this paper, we will develop a systematic approach to deriving guaranteed bounds for approximate dynamic programming (ADP) schemes in optimal control problems. Our approach is inspired by our recent results on bounding the performance of greedy strategies in optimization of string functions over a finite horizon. The approach is to derive a string-optimization problem, for which the optimal strategy is the optimal control solution and the greedy strategy is the ADP solution. Using this approach, we show that any ADP solution achieves a performance that is at least a factor of β of the performance of the optimal control solution, characterized by Bellman's optimality principle. The factor β depends on the specific ADP scheme, as we will explicitly characterize. To illustrate the applicability of our bounding technique, we present examples of ADP schemes, including the popular rollout method.

Related Organizations

Colorado State University
United States
RMIT University
Australia

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

1

Average

Fields of Science

engineering and technology

other engineering and technologies

Fields of Science

engineering and technology

other engineering and technologies

Upload OA version

Are you the author of this publication? Upload your Open Access version to Zenodo!

It’s fast and easy, just two clicks!

uploadUpload now