Abstract
We give a computationally-efficient PAC active learning algorithm for d-dimensional homogeneous halfspaces that can tolerate Massart noise (Massart and Nédélec, 2006) and Tsybakov noise (Tsybakov, 2004). Specialized to the η-Massart noise setting, our algorithm achieves an information-theoretically near-optimal label complexity of (Equation presented) under a wide range of unlabeled data distributions (specifically, the family of “structured distributions” defined in Diakonikolas et al. (2020a)). Under the more challenging Tsybakov noise condition, we identify two subfamilies of noise conditions, under which our efficient algorithm provides label complexity guarantees strictly lower than passive learning algorithms.
Original language | English (US) |
---|---|
Pages (from-to) | 4526-4527 |
Number of pages | 2 |
Journal | Proceedings of Machine Learning Research |
Volume | 134 |
State | Published - 2021 |
Event | 34th Conference on Learning Theory, COLT 2021 - Boulder, United States Duration: Aug 15 2021 → Aug 19 2021 |
Keywords
- Active learning
- halfspaces
- noise-tolerant learning
ASJC Scopus subject areas
- Artificial Intelligence
- Software
- Control and Systems Engineering
- Statistics and Probability