ISSN:
1572-9338
Schlagwort(e):
Bandit process
;
Gittins' index
;
Markov decision process
;
stopping time
;
strategy evaluation
Quelle:
Springer Online Journal Archives 1860-2000
Thema:
Mathematik
,
Wirtschaftswissenschaften
Notizen:
Abstract Glazebrook [1] has given an account of improved procedures for strategy evaluation for resource allocation in a stochastic environment. These methods are extended in the paper in such a way that they can be applied to problems which, for example, have precedence constraints and/or an arrivals process of new jobs. Theoretical results, backed up by numerical studies, show that quasi-myopic heuristics often perform well.
Materialart:
Digitale Medien
URL:
http://dx.doi.org/10.1007/BF02204822
Permalink