
The aim of this paper is to address optimality of stochastic control strategies via dynamic programming subject to total variational distance uncertainty on the conditional distribution of the controlled process. Utilizing concepts from signed measures, the maximization of a linear functional on the space of probability measures on abstract spaces is investigated, among those probability measures which are within a total variational distance from a nominal probability measure. The maximizing probability measure is found in closed form. These results are then applied to solve minimax stochastic control with deterministic control strategies, under a Markovian assumption on the conditional distributions of the controlled process. The results include: 1) Optimization subject to total variational distance constraints, 2) new dynamic programming recursions, which involve the oscillator seminorm of the value function.
Optimization, Closed form, Optimality, Control strategies, Conditional distribution, Recursions, Linear functional, Probability measures, Dynamic programming, Value functions, Seminorms, Markovian, Stochastic control systems, Controlled process, Stochastic control, Abstract space, Process control, Minimax, Variational distance, Signed measure, Probability
Optimization, Closed form, Optimality, Control strategies, Conditional distribution, Recursions, Linear functional, Probability measures, Dynamic programming, Value functions, Seminorms, Markovian, Stochastic control systems, Controlled process, Stochastic control, Abstract space, Process control, Minimax, Variational distance, Signed measure, Probability
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
