Similar: Submodular information measures based active learning in realistic scenarios

Kothawade, Suraj; Beck, Nathan; Killamsetty, Krishnateja; Iyer, Rishabh

Citation Details

Active learning has proven to be useful for minimizing labeling costs by selecting the most informative samples. However, existing active learning methods do not work well in realistic scenarios such as imbalance or rare classes, out-of-distribution data in the unlabeled set, and redundancy. In this work, we propose SIMILAR (Submodular Information Measures based actIve LeARning), a unified active learning framework using recently proposed submodular information measures (SIM) as acquisition functions. We argue that SIMILAR not only works in standard active learning, but also easily extends to the realistic settings considered above and acts as a one-stop solution for active learning that is scalable to large real-world datasets. Empirically, we show that SIMILAR significantly outperforms existing active learning algorithms by as much as ~5% - 18% in the case of rare classes and ~5% - 10% in the case of out-of-distribution data on several image classification tasks like CIFAR-10, MNIST, and ImageNet. SIMILAR is available as a part of the DISTIL toolkit: "this https URL". more »

Award ID(s):: 2106937

PAR ID:: 10336074

Author(s) / Creator(s):: Kothawade, Suraj; Beck, Nathan; Killamsetty, Krishnateja; Iyer, Rishabh

Date Published:: 2021-12-01

Journal Name:: Advances in neural information processing systems

Volume:: 34

ISSN:: 1049-5258

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this