descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Aug 2025Embargo end date: 01 Jan 2022 English Publisher:Institute for Operations Research and the Management Sciences (INFORMS)Journal:Mathematics of Operations Research, volume 50, pages 1,707-1,733 (issn: 0364-765X, eissn: 1526-5471,

Authors: Ziteng Cheng; Sebastian Jaimungal;

doi: 10.1287/moor.2023.0211 , 10.48550/arxiv.2203.09612

arXiv: 2203.09612

Risk-Averse Markov Decision Processes Through a Distributional Lens

- Summary
- Subjects
- Metrics

Abstract

By adopting a distributional viewpoint on law-invariant convex risk measures, we construct dynamic risk measures (DRMs) at the distributional level. We then apply these DRMs to investigate Markov decision processes, incorporating latent costs, random actions, and weakly continuous transition kernels. Furthermore, the proposed DRMs allow risk aversion to change dynamically. Under mild assumptions, we derive a dynamic programming principle and show the existence of an optimal policy in both finite and infinite time horizons. Moreover, we provide a sufficient condition for the optimality of deterministic actions. For illustration, we conclude the paper with examples from optimal liquidation with limit order books and autonomous driving. Funding: This work was supported by Natural Sciences and Engineering Research Council of Canada [Grants RGPAS-2018-522715 and RGPIN-2018-05705].

Related Organizations

Keywords

Mathematical Finance (q-fin.MF), FOS: Economics and business, Quantitative Finance - Mathematical Finance, Optimization and Control (math.OC), Risk Management (q-fin.RM), FOS: Mathematics, Mathematics - Optimization and Control, Quantitative Finance - Risk Management

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	1
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average