ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Hits per page

hit 1 - 1 | 1 hit

Select All Export

Electronic Resource

On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes (1991)

Fernández-Gaucherand, Emmanuel ; Arapostathis, Aristotle ; Marcus, Steven I.

Springer

Annals of operations research 29 (1991), S. 439-469

add to mindlist on the mindlist

Details

ISSN: 1572-9338

Keywords: Optimal control ; Markov chains ; partial observability ; average cost ; optimality equation ; structured optimal policies

Source: Springer Online Journal Archives 1860-2000

Topics: Mathematics , Economics

Notes: Abstract We consider partially observable Markov decision processes with finite or countably infinite (core) state and observation spaces and finite action set. Following a standard approach, an equivalent completely observed problem is formulated, with the same finite action set but with anuncountable state space, namely the space of probability distributions on the original core state space. By developing a suitable theoretical framework, it is shown that some characteristics induced in the original problem due to the countability of the spaces involved are reflected onto the equivalent problem. Sufficient conditions are then derived for solutions to the average cost optimality equation to exist. We illustrate these results in the context of machine replacement problems. Structural properties for average cost optimal policies are obtained for a two state replacement problem; these are similar to results available for discount optimal policies. The set of assumptions used compares favorably to others currently available.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF02283610

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

hit 1 - 1 | 1 hit