Skip to main content
Log in

An experimental analysis of the bandit problem

  • Published:
Economic Theory Aims and scope Submit manuscript

Summary.

 We investigate, in an experimental setting, the behavior of single decision makers who at discrete time intervals over an “infinite” horizon may choose one action from a set of possible actions where this set is constant over time, i.e. a bandit problem. Two bandit environments are examined, one in which the predicted behavior should always be myopic (the two-armed bandit) and the other in which the predicted behavior should never be myopic (the one-armed bandit). We also investigate the comparative static predictions as the underlying parameters of the bandit environments are changed. The aggregate results show that the behavior in the two bandit environments are quantitatively different and in the direction of the theoretical predictions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received: October, 27, 1994; revised version February 27, 1996

Rights and permissions

Reprints and permissions

About this article

Cite this article

Banks, J., Olson, M. & Porter, D. An experimental analysis of the bandit problem. Economic Theory 10, 55–77 (1997). https://doi.org/10.1007/s001990050146

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s001990050146

Navigation