TY - GEN
T1 - Warm-starting contextual bandits
T2 - 36th International Conference on Machine Learning, ICML 2019
AU - Zhang, Chicheng
AU - Agarwal, Alekh
AU - Daumé, Hal
AU - Langford, John
AU - Negahban, Sahand N.
N1 - Publisher Copyright:
© 36th International Conference on Machine Learning, ICML 2019. All rights reserved.
PY - 2019
Y1 - 2019
N2 - We investigate the feasibility of learning from a mix of both fully-labeled supervised data and contextual bandit data. We specifically consider settings in which the underlying learning signal may be different between these two data sources. Theoretically, we state and prove no-regret algorithms for learning that is robust to misaligned cost distributions between the two sources. Empirically, we evaluate some of these algorithms on a large selection of datasets, showing that our approach is both feasible, and helpful in practice.
AB - We investigate the feasibility of learning from a mix of both fully-labeled supervised data and contextual bandit data. We specifically consider settings in which the underlying learning signal may be different between these two data sources. Theoretically, we state and prove no-regret algorithms for learning that is robust to misaligned cost distributions between the two sources. Empirically, we evaluate some of these algorithms on a large selection of datasets, showing that our approach is both feasible, and helpful in practice.
UR - http://www.scopus.com/inward/record.url?scp=85078052153&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85078052153&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85078052153
T3 - 36th International Conference on Machine Learning, ICML 2019
SP - 12723
EP - 12732
BT - 36th International Conference on Machine Learning, ICML 2019
PB - International Machine Learning Society (IMLS)
Y2 - 9 June 2019 through 15 June 2019
ER -