TY - GEN
T1 - Gaussian multi-armed bandit problems with multiple objectives
AU - Reverdy, Paul
N1 - Publisher Copyright:
© 2016 American Automatic Control Council (AACC).
PY - 2016/7/28
Y1 - 2016/7/28
N2 - Motivated by the goal of formally integrating human designers into computational systems for engineering design optimization, I study decision making under uncertainty with multiple objectives in the context of the multi-armed bandit problem. A key aspect of multi-objective optimization is the need for scalarization, i.e., a way to combine the various objectives into a single well-defined scalar objective function. I study the case where the multi-objective rewards are Gaussian distributed and the scalarization is linear and develop an algorithm that achieves optimal performance, i.e., converges to selecting the best arm at the highest possible rate.
AB - Motivated by the goal of formally integrating human designers into computational systems for engineering design optimization, I study decision making under uncertainty with multiple objectives in the context of the multi-armed bandit problem. A key aspect of multi-objective optimization is the need for scalarization, i.e., a way to combine the various objectives into a single well-defined scalar objective function. I study the case where the multi-objective rewards are Gaussian distributed and the scalarization is linear and develop an algorithm that achieves optimal performance, i.e., converges to selecting the best arm at the highest possible rate.
UR - http://www.scopus.com/inward/record.url?scp=84992065860&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84992065860&partnerID=8YFLogxK
U2 - 10.1109/ACC.2016.7526494
DO - 10.1109/ACC.2016.7526494
M3 - Conference contribution
AN - SCOPUS:84992065860
T3 - Proceedings of the American Control Conference
SP - 5263
EP - 5269
BT - 2016 American Control Conference, ACC 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 American Control Conference, ACC 2016
Y2 - 6 July 2016 through 8 July 2016
ER -