Discount-isotone policies for Markov decision processes

White, D. J.

doi:10.1007/BF01720029

Discount-isotone policies for Markov decision processes

Theoretical Papers
Published: March 1988

Volume 10, pages 13–22, (1988)
Cite this article

Operations-Research-Spektrum Aims and scope Submit manuscript

D. J. White¹

35 Accesses
4 Citations
Explore all metrics

Summary

This paper considers infinite horizon discounted Markov decision processes and conditions under which discount-isotone optimal policies exist. Given partial orders over the state and action spaces, a set of discount-isotone optimal policies is a set of optimal policies, one for each discount factor in a given set, such that, for each state, the optimal actions are partially ordered in such a manner as to match the ordering of the discount factors. It is easier to solve problems with small discount factors and the induced partial ordering facilitates the solutions for higher discount factor levels.

Zusammenfassung

Für unendlich-stufige diskontierte Markovsche Entscheidungsprozesse werden Bedingungen angegeben, unter denen sogenannte “diskont-isotone” optimale Politiken existieren. Eine diskont-isotone Familie von optimalen Politiken liegt vor, wenn die Zustands- und Aktionenräume halbgeordnet sind und für eine Menge von Diskontierungsfaktoren je eine optimale Politik existiert, so daß in jedem Zustand die optimalen Aktionen isoton vom Diskontierungsfaktor abhängen. Es kann günstiger sein, zunächst Probleme mit kleinem Diskontierungsfaktor zu lösen und dann die Isotonie-Eigenschaften zur Lösung für größere Diskontierungsfaktoren heranzuziehen.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bellman R (1957) Dynamic programming. Princeton University Press, New Jersey
Google Scholar
Blackwell D (1965) Discounted dynamic programming. Ann Math Stat 36:226–235
Google Scholar
Rockafellar R (1972) Convex analysis. Princeton University Press, New Jersey
Google Scholar
Serfozo RF (1976) Monotone optimal policies for Markov decision processes. Math Prog Study 6:202–215
Google Scholar
White DJ (1978) Finite dynamic programming. Wiley, New York
Google Scholar
White DJ (1984) Isotone policies for the value iteration method for Markov decision processes. OR Spektrum 6:223–227
Google Scholar
White DJ (1982) Negatively isotone optimal policies for random walk type Markov decision processes. OR Spektrum 4:41–45
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Systems Engineering, University of Virginia, VA 22901, Charlottesville, USA
D. J. White

Authors

D. J. White
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

White, D.J. Discount-isotone policies for Markov decision processes. OR Spektrum 10, 13–22 (1988). https://doi.org/10.1007/BF01720029

Download citation

Received: 13 February 1987
Accepted: 22 September 1987
Issue Date: March 1988
DOI: https://doi.org/10.1007/BF01720029

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discount-isotone policies for Markov decision processes

Summary

Zusammenfassung

Access this article

Similar content being viewed by others

Semi-Markov decision processes with variance minimization criterion

Strong n-discount and finite-horizon optimality for continuous-time Markov decision processes

An axiomatic approach to Markov decision processes

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Discount-isotone policies for Markov decision processes

Summary

Zusammenfassung

Access this article

Similar content being viewed by others

Semi-Markov decision processes with variance minimization criterion

Strong n-discount and finite-horizon optimality for continuous-time Markov decision processes

An axiomatic approach to Markov decision processes

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation