Testing conventional wisdom (of the crowd)

Burrell, Noah Burrell; Schoenebeck, Grant

Citation Details

Do common assumptions about the way that crowd workers make mistakes in microtask (labeling) applications manifest in real crowdsourcing data? Prior work only addresses this question indirectly. Instead, it primarily focuses on designing new label aggregation algorithms, seeming to imply that better performance justifies any additional assumptions. However, empirical evidence in past instances has raised significant challenges to common assumptions. We continue this line of work, using crowdsourcing data itself as directly as possible to interrogate several basic assumptions about workers and tasks. We find strong evidence that the assumption that workers respond correctly to each task with a constant probability, which is common in theoretical work, is implausible in real data. We also illustrate how heterogeneity among tasks and workers can take different forms, which have different implications for the design and evaluation of label aggregation algorithms. more »

Award ID(s):: 2208662

PAR ID:: 10526185

Author(s) / Creator(s):: Burrell, Noah Burrell; Schoenebeck, Grant

Editor(s):: Evans, Robin J; Shpitser, Ilya

Publisher / Repository:: Proceedings of Machine Learning Research; Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence

Date Published:: 2023-07-31

Volume:: 216

Page Range / eLocation ID:: 237--248

Format(s):: Medium: X

Location:: https://proceedings.mlr.press/v216/burrell23a.html

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Proceeding:
The DOI is not currently available.

More Like this