Warm-starting contextual bandits: Robustly combining supervised and bandit feedback

Chicheng Zhang, Alekh Agarwal, Hal Daumé, John Langford, Sahand N. Negahban

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Fingerprint

Dive into the research topics of 'Warm-starting contextual bandits: Robustly combining supervised and bandit feedback'. Together they form a unique fingerprint.

Engineering & Materials Science

Social Sciences