Skip to main navigation Skip to search Skip to main content

Efficient contextual bandits with continuous actions

  • Maryam Majzoubi
  • , Chicheng Zhang
  • , Rajan Chari
  • , Akshay Krishnamurthy
  • , John Langford
  • , Aleksandrs Slivkins

Research output: Contribution to journalConference articlepeer-review

Abstract

We create a computationally tractable algorithm for contextual bandits with continuous actions having unknown structure. Our reduction-style algorithm composes with most supervised learning representations. We prove that it works in a general sense and verify the new functionality with large-scale experiments.

Original languageEnglish (US)
JournalAdvances in Neural Information Processing Systems
Volume2020-December
StatePublished - 2020
Event34th Conference on Neural Information Processing Systems, NeurIPS 2020 - Virtual, Online
Duration: Dec 6 2020Dec 12 2020

ASJC Scopus subject areas

  • Signal Processing
  • Information Systems
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Efficient contextual bandits with continuous actions'. Together they form a unique fingerprint.

Cite this