<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
doi: 10.1109/9.159584
handle: 2328/26296
The paper proposes a unified approach to the asymptotic analysis of a Markov decision process (MDP) with an \(\varepsilon\)-additive perturbation irrespective of whether the perturbation is regular or singular. It shows that an optimal solution to the perturbed MDP can be approximated by an optimal solution of the limit Markov control problem for sufficiently small perturbations. Investigating the discounted case it is shown that an optimal solution to the perturbed MDP can be approximated by an optimal solution of the original MDP for sufficiently small perturbations. The same conclusion can be derived for the unichain, the communicating and the discounted cases.
asymptotic analysis, Markov and semi-Markov decision processes, perturbation, Lyapunov and other classical stabilities (Lagrange, Poisson, \(L^p, l^p\), etc.) in control theory, Markov decision process, Mathematics, Markov Decision Process
asymptotic analysis, Markov and semi-Markov decision processes, perturbation, Lyapunov and other classical stabilities (Lagrange, Poisson, \(L^p, l^p\), etc.) in control theory, Markov decision process, Mathematics, Markov Decision Process
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 43 | |
popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |