TY - GEN
T1 - Algorithmic models of human decision making in Gaussian multi-armed bandit problems
AU - Reverdy, Paul
AU - Srivastava, Vaibhav
AU - Leonard, Naomi E.
N1 - Publisher Copyright:
© 2014 EUCA.
PY - 2014/7/22
Y1 - 2014/7/22
N2 - We consider a heuristic Bayesian algorithm as a model of human decision making in multi-armed bandit problems with Gaussian rewards. We derive a novel upper bound on the Gaussian inverse cumulative distribution function and use it to show that the algorithm achieves logarithmic regret. We extend the algorithm to allow for stochastic decision making using Boltzmann action selection with a dynamic temperature parameter and provide a feedback rule for tuning the temperature parameter such that the stochastic algorithm achieves logarithmic regret. The stochastic algorithm encodes many of the observed features of human decision making.
AB - We consider a heuristic Bayesian algorithm as a model of human decision making in multi-armed bandit problems with Gaussian rewards. We derive a novel upper bound on the Gaussian inverse cumulative distribution function and use it to show that the algorithm achieves logarithmic regret. We extend the algorithm to allow for stochastic decision making using Boltzmann action selection with a dynamic temperature parameter and provide a feedback rule for tuning the temperature parameter such that the stochastic algorithm achieves logarithmic regret. The stochastic algorithm encodes many of the observed features of human decision making.
UR - http://www.scopus.com/inward/record.url?scp=84911479192&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84911479192&partnerID=8YFLogxK
U2 - 10.1109/ECC.2014.6862580
DO - 10.1109/ECC.2014.6862580
M3 - Conference contribution
AN - SCOPUS:84911479192
T3 - 2014 European Control Conference, ECC 2014
SP - 2210
EP - 2215
BT - 2014 European Control Conference, ECC 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 13th European Control Conference, ECC 2014
Y2 - 24 June 2014 through 27 June 2014
ER -