Coding for Crowdsourced Classification with XOR Queries

Pang, James Chin-Jen; Mahdavifar, Hessam; Pradhan, S. Sandeep

doi:10.1109/ITW44776.2019.8989252

Citation Details

Coding for Crowdsourced Classification with XOR Queries

This paper models the crowdsourced labeling/classification problem as a sparsely encoded source coding problem, where each query answer, regarded as a code bit, is the XOR of a small number of labels, as source information bits. In this paper we leverage the connections between this problem and well-studied codes with sparse representations for the channel coding problem to provide querying schemes with almost optimal number of queries, each of which involving only a constant number of labels. We also extend this scenario to the case where some workers can be unresponsive. For this case, we propose querying schemes where each query involves only $$\log n$$ items, where $$n$$ is the total number of items to be labeled. Furthermore, we consider classification of two correlated labeling systems and provide two-stage querying schemes with almost optimal number of queries each involving a constant number of labels. more »

Award ID(s):: 1763348

PAR ID:: 10177927

Author(s) / Creator(s):: Pang, James Chin-Jen; Mahdavifar, Hessam; Pradhan, S. Sandeep

Date Published:: 2019-08-01

Journal Name:: 2019 IEEE Information Theory Workshop (ITW)

Page Range / eLocation ID:: 1 to 5

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/ITW44776.2019.8989252

More Like this