Dynamic allocation policies for the finite horizon one armed bandit problem

Citation:

Burnetas, A.N. & Katehakis, M.N., 1998. Dynamic allocation policies for the finite horizon one armed bandit problem. Stochastic Analysis and Applications, 16, pp.811-824.