Cost effective active search

Jiang, Shali; Moseley, Benjamin; Garnet, Roman

Citation Details

We study a special paradigm of active learning, called cost effective active search, where the goal is to find a given number of positive points from a large unlabeled pool with minimum labeling cost. Most existing methods solve this problem heuristically, and few theoretical results have been established. We adopt a principled Bayesian approach for the first time. We first derive the Bayesian optimal policy and establish a strong hardness result: the optimal policy is hard to approximate, with the best-possible approximation ratio lower bounded by Ω(n^0.16). We then propose an efficient and nonmyopic policy using the negative Poisson binomial distribution. We propose simple and fast approximations for computing its expectation, which serves as an essential role in our proposed policy. We conduct comprehensive experiments on various domains such as drug and materials discovery, and demonstrate that our proposed search procedure is superior to the widely used greedy baseline. more »

Award ID(s):: 1845434

PAR ID:: 10140928

Author(s) / Creator(s):: Jiang, Shali; Moseley, Benjamin; Garnet, Roman

Date Published:: 2019-01-01

Journal Name:: Advances in Neural Information Processing Systems

Volume:: 32

Page Range / eLocation ID:: 4880 - 4889

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this