Gaussian multi-armed bandit problems with multiple objectives

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

Motivated by the goal of formally integrating human designers into computational systems for engineering design optimization, I study decision making under uncertainty with multiple objectives in the context of the multi-armed bandit problem. A key aspect of multi-objective optimization is the need for scalarization, i.e., a way to combine the various objectives into a single well-defined scalar objective function. I study the case where the multi-objective rewards are Gaussian distributed and the scalarization is linear and develop an algorithm that achieves optimal performance, i.e., converges to selecting the best arm at the highest possible rate.

Original languageEnglish (US)
Title of host publication2016 American Control Conference, ACC 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5263-5269
Number of pages7
ISBN (Electronic)9781467386821
DOIs
StatePublished - Jul 28 2016
Externally publishedYes
Event2016 American Control Conference, ACC 2016 - Boston, United States
Duration: Jul 6 2016Jul 8 2016

Publication series

NameProceedings of the American Control Conference
Volume2016-July
ISSN (Print)0743-1619

Other

Other2016 American Control Conference, ACC 2016
Country/TerritoryUnited States
CityBoston
Period7/6/167/8/16

ASJC Scopus subject areas

  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Gaussian multi-armed bandit problems with multiple objectives'. Together they form a unique fingerprint.

Cite this