Maximin Active Learning with Data-Dependent Norms

Karzand, Mina; Nowak, Robert

doi:10.1109/ALLERTON.2019.8919686

Citation Details

Maximin Active Learning with Data-Dependent Norms

Overparameterized machine learning models are often fit perfectly to training data, yet remarkably generalize well to new data. However, learning good models can require an enormous number of labeled training data. This challenge motivates the study of active learning algorithms that sequentially and adaptively request labels for “informative” examples for a large pool of unlabeled data. A maximin criterion was recently proposed for active learning specifically in the overparameterized and interpolating regime. Roughly speaking, the maximin criterion selects the example that is most difficult to interpolate, as measured by an appropriate norm on the interpolating func- tion. Data-dependent norms perform best empirically, exhibiting intriguing adaptivity to cluster structure within the data. The main contribution of this paper is to mathematically characterize this behavior. Our main results show that the maximin criterion based on data-dependent norms provably discovers clusters and also automatically generates labeled coverings of the dataset. more »

Award ID(s):: 1740707

PAR ID:: 10183608

Author(s) / Creator(s):: Karzand, Mina; Nowak, Robert

Date Published:: 2019-10-01

Journal Name:: Annual Allerton Conference on Communication Control and Computing

Volume:: 1

Issue:: 1

ISSN:: 2474-0195

Page Range / eLocation ID:: 871-878

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1109/ALLERTON.2019.8919686

More Like this