A superharmonic approach to solving infinite horizon partially observable Markov decision problems

White, D. J.

doi:10.1007/BF01415066

A superharmonic approach to solving infinite horizon partially observable Markov decision problems

Articles
Published: February 1995

Volume 41, pages 71–88, (1995)
Cite this article

Zeitschrift für Operations Research Aims and scope Submit manuscript

D. J. White¹

26 Accesses
2 Citations
Explore all metrics

Abstract

In this paper we use an approach which uses a superharmonic property of a sequence of functions generated by an algorithm to show that these functions converge in a non-increasing manner to the optimal value function for our problem, and bounds are given for the loss of optimality if the computational process is terminated at any iteration. The basic procedure is to add an additional linear term at each iteration, selected by solving a particular optimisation problem, for which primal and dual linear programming formulations are given.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Article 17 January 2019

Random Gradient-Free Minimization of Convex Functions

Article 30 November 2015

Relaxed Inertial Method for Solving Split Monotone Variational Inclusion Problem with Multiple Output Sets Without Co-coerciveness and Lipschitz Continuity

Article 15 April 2024

References

Kallenberg LCM (1983) Linear programming and finite markovian control problems. Mathematisch Centrum Tracts 148, Mathematisch Centrum Amsterdam
Google Scholar
Monahan GE (1982) A survey of partially observable markov decision processes: Theory, models, and algorithm. Management Science 28: 1–16
Google Scholar
Porteus EL (1971) Some bounds for discounted sequential decision processes. Management Science 18: 7–11
Google Scholar
Rockafellar RT (1970) Convex analysis. Princeton University Press New Jersey
Google Scholar
Stoer J, Witzgall C (1970) Convexity and optimisation in finite dimensions I. Springer-Verlag Berlin
Google Scholar
White CC (1991) A survey of solution techniques for the partially observed markov decision process. Ann OR 32: 215–230
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Economic and Social Studies, Department of Decision Theory, University of Manchester, M 13 9PL, Manchester, Great Britain
D. J. White

Authors

D. J. White
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

White, D.J. A superharmonic approach to solving infinite horizon partially observable Markov decision problems. ZOR - Methods and Models of Operations Research 41, 71–88 (1995). https://doi.org/10.1007/BF01415066

Download citation

Received: 15 May 1991
Revised: 15 January 1994
Issue Date: February 1995
DOI: https://doi.org/10.1007/BF01415066

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A superharmonic approach to solving infinite horizon partially observable Markov decision problems

Abstract

Access this article

Similar content being viewed by others

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Random Gradient-Free Minimization of Convex Functions

Relaxed Inertial Method for Solving Split Monotone Variational Inclusion Problem with Multiple Output Sets Without Co-coerciveness and Lipschitz Continuity

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Key words

Navigation

A superharmonic approach to solving infinite horizon partially observable Markov decision problems

Abstract

Access this article

Similar content being viewed by others

Existence and Uniqueness of Quasi-stationary Distributions for Symmetric Markov Processes with Tightness Property

Random Gradient-Free Minimization of Convex Functions

Relaxed Inertial Method for Solving Split Monotone Variational Inclusion Problem with Multiple Output Sets Without Co-coerciveness and Lipschitz Continuity

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation