ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

1

Electronic Resource

Adaptive control of constrained Markov chains: Criteria and policies (1991)

Altman, Eitan ; Shwartz, Adam

Springer

Annals of operations research 28 (1991), S. 101-134

add to mindlist on the mindlist

Details

ISSN: 1572-9338

Source: Springer Online Journal Archives 1860-2000

Topics: Mathematics , Economics

Notes: Abstract We consider the constrained optimization of a finite-state, finite action Markov chain. In the adaptive problem, the transition probabilities are assumed to be unknown, and no prior distribution on their values is given. We consider constrained optimization problems in terms of several cost criteria which are asymptotic in nature. For these criteria we show that it is possible to achieve the same optimal cost as in the non-adaptive case. We first formulate a constrained optimization problem under each of the cost criteria and establish the existence of optimal stationary policies. Since the adaptive problem is inherently non-stationary, we suggest a class ofAsymptotically Stationary (AS) policies, and show that, under each of the cost criteria, the costs of an AS policy depend only on its limiting behavior. This property implies that there exist optimal AS policies. A method for generating adaptive policies is then suggested, which leads to strongly consistent estimators for the unknown transition probabilities. A way to guarantee that these policies are also optimal is to couple them with the adaptive algorithm of [3]. This leads to optimal policies for each of the adaptive constrained optimization problems under discussion.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF02055577

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

2

Electronic Resource

Sensitivity of constrained Markov decision processes (1991)

Altman, Eitan ; Shwartz, Adam

Springer

Annals of operations research 32 (1991), S. 1-22

add to mindlist on the mindlist

Details

ISSN: 1572-9338

Source: Springer Online Journal Archives 1860-2000

Topics: Mathematics , Economics

Notes: Abstract We consider the optimization of finite-state, finite-action Markov decision processes under constraints. Costs and constraints are of the discounted or average type, and possibly finite-horizon. We investigate the sensitivity of the optimal cost and optimal policy to changes in various parameters. We relate several optimization problems to a generic linear program, through which we investigate sensitivity issues. We establish conditions for the continuity of the optimal value in the discount factor. In particular, the optimal value and optimal policy for the expected average cost are obtained as limits of the dicounted case, as the discount factor goes to one. This generalizes a well-known result for the unconstrained case. We also establish the continuity in the discount factor for certain non-stationary policies. We then discuss the sensitivity of optimal policies and optimal values to small changes in the transition matrix and in the instantaneous cost functions. The importance of the last two results is related to the performance of adaptive policies for constrained MDP under various cost criteria [3,5]. Finally, we establish the convergence of the optimal value for the discounted constrained finite horizon problem to the optimal value of the corresponding infinite horizon problem.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF02204825

Permalink