Risk-Sensitive Markov Decision Processes

descriptionPublicationkeyboard_double_arrow_right Article 01 Mar 1972 English Publisher:Institute for Operations Research and the Management Sciences (INFORMS)Journal:Management Science, volume 18, pages 356-369 (issn: 0025-1909, eissn: 1526-5501,

Copyright policy )

Authors: Ronald A. Howard; James E. Matheson;

doi: 10.1287/mnsc.18.7.356

Risk-Sensitive Markov Decision Processes

- Summary
- Subjects
- Metrics

Abstract

This paper considers the maximization of certain equivalent reward generated by a Markov decision process with constant risk sensitivity. First, value iteration is used to optimize possibly time-varying processes of finite duration. Then a policy iteration procedure is developed to find the stationary policy with highest certain equivalent gain for the infinite duration case. A simple example demonstrates both procedures.

Related Organizations

SRI International
United States
Stanford University
United States

Keywords

Markov and semi-Markov decision processes, Decision theory

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	310
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 1%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 0.1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average