NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Testing conventional wisdom (of the crowd)

Burrell, Noah Burrell; Schoenebeck, Grant (July 2024, Proceedings of Machine Learning Research; Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence)
Evans, Robin J; Shpitser, Ilya (Ed.)
Do common assumptions about the way that crowd workers make mistakes in microtask (labeling) applications manifest in real crowdsourcing data? Prior work only addresses this question indirectly. Instead, it primarily focuses on designing new label aggregation algorithms, seeming to imply that better performance justifies any additional assumptions. However, empirical evidence in past instances has raised significant challenges to common assumptions. We continue this line of work, using crowdsourcing data itself as directly as possible to interrogate several basic assumptions about workers and tasks. We find strong evidence that the assumption that workers respond correctly to each task with a constant probability, which is common in theoretical work, is implausible in real data. We also illustrate how heterogeneity among tasks and workers can take different forms, which have different implications for the design and evaluation of label aggregation algorithms.
more » « less
Full Text Available
Strong Equilibria in Bayesian Games with Bounded Group Size

https://doi.org/10.1145/3696410.3714585

Han, Qishen; Schoenebeck, Grant; Tao, Biaoshuai; Xia, Lirong (April 2025, ACM)

Free, publicly-accessible full text available April 22, 2026
Eliciting Informative Text Evaluations with Large Language Models

https://doi.org/10.1145/3670865.3673532

Lu, Yuxuan; Xu, Shengwei; Zhang, Yichi; Kong, Yuqing; Schoenebeck, Grant (July 2024, ACM)

Full Text Available
Filter Bubble or Homogenization? Disentangling the Long-Term Effects of Recommendations on User Consumption Patterns

https://doi.org/10.1145/3589334.3645497

Anwar, Md Sanzeed; Schoenebeck, Grant; Dhillon, Paramveer S (May 2024, ACM)

Full Text Available
Spot Check Equivalence: An Interpretable Metric for Information Elicitation Mechanisms

https://doi.org/10.1145/3589334.3645679

Xu, Shengwei; Zhang, Yichi; Resnick, Paul; Schoenebeck, Grant (May 2024, ACM)

Full Text Available
Eliciting Honest Information from Authors Using Sequential Review

https://doi.org/10.1609/aaai.v38i9.28860

Zhang, Yichi; Schoenebeck, Grant; Su, Weijie (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

In the setting of conference peer review, the conference aims to accept high-quality papers and reject low-quality papers based on noisy review scores. A recent work proposes the isotonic mechanism, which can elicit the ranking of paper qualities from an author with multiple submissions to help improve the conference's decisions. However, the isotonic mechanism relies on the assumption that the author's utility is both an increasing and a convex function with respect to the review score, which is often violated in realistic settings (e.g.~when authors aim to maximize the number of accepted papers). In this paper, we propose a sequential review mechanism that can truthfully elicit the ranking information from authors while only assuming the agent's utility is increasing with respect to the true quality of her accepted papers. The key idea is to review the papers of an author in a sequence based on the provided ranking and conditioning the review of the next paper on the review scores of the previous papers. Advantages of the sequential review mechanism include: 1) eliciting truthful ranking information in a more realistic setting than prior work; 2) reducing the reviewing workload and increasing the average quality of papers being reviewed; 3) incentivizing authors to write fewer papers of higher quality.
more » « less
Full Text Available
Exit Ripple Effects: Understanding the Disruption of Socialization Networks Following Employee Departures

https://doi.org/10.1145/3589334.3645634

Gamba, David; Yu, Yulin; Yuan, Yuan; Schoenebeck, Grant; Romero, Daniel M (May 2024, ACM)

Full Text Available
Testing conventional wisdom (of the crowd)

Burrell, Noah Burrell; Schoenebeck, Grant (July 2023, Proceedings of Machine Learning Research; Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence)
Evans, Robin J; Shpitser, Ilya (Ed.)
Do common assumptions about the way that crowd workers make mistakes in microtask (labeling) applications manifest in real crowdsourcing data? Prior work only addresses this question indirectly. Instead, it primarily focuses on designing new label aggregation algorithms, seeming to imply that better performance justifies any additional assumptions. However, empirical evidence in past instances has raised significant challenges to common assumptions. We continue this line of work, using crowdsourcing data itself as directly as possible to interrogate several basic assumptions about workers and tasks. We find strong evidence that the assumption that workers respond correctly to each task with a constant probability, which is common in theoretical work, is implausible in real data. We also illustrate how heterogeneity among tasks and workers can take different forms, which have different implications for the design and evaluation of label aggregation algorithms.
more » « less
Full Text Available
Measurement Integrity in Peer Prediction: A Peer Assessment Case Study

https://doi.org/10.1145/3580507.3597744

Burrell, Noah; Schoenebeck, Grant (July 2023, ACM)

We propose measurement integrity, a property related to ex post reward fairness, as a novel desideratum for peer prediction mechanisms in many natural applications. Like robustness against strategic reporting, the property that has been the primary focus of the peer prediction literature, measurement integrity is an important consideration for understanding the practical performance of peer prediction mechanisms. We perform computational experiments, both with an agent-based model and with real data, to empirically evaluate peer prediction mechanisms according to both of these important properties. Our evaluations simulate the application of peer prediction mechanisms to peer assessment---a setting in which ex post fairness concerns are particularly salient. We find that peer prediction mechanisms, as proposed in the literature, largely fail to demonstrate significant measurement integrity in our experiments. We also find that theoretical properties concerning robustness against strategic reporting are somewhat noisy predictors of empirical performance. Further, there is an apparent trade-off between our two dimensions of analysis. The best-performing mechanisms in terms of measurement integrity are highly susceptible to strategic reporting. Ultimately, however, we show that supplementing mechanisms with realistic parametric statistical models can, in some cases, improve performance along both dimensions of our analysis and result in mechanisms that strike the best balance between them.
more » « less
Full Text Available
High-Effort Crowds: Limited Liability via Tournaments

https://doi.org/10.1145/3543507.3583334

Zhang, Yichi; Schoenebeck, Grant (April 2023, ACM)

We consider the crowdsourcing setting where, in response to the assigned tasks, agents strategically decide both how much effort to exert (from a continuum) and whether to manipulate their reports. The goal is to design payment mechanisms that (1) satisfy limited liability (all payments are non-negative), (2) reduce the principal’s cost of budget, (3) incentivize effort and (4) incentivize truthful responses. In our framework, the payment mechanism composes a performance measurement, which noisily evaluates agents’ effort based on their reports, and a payment function, which converts the scores output by the performance measurement to payments. Previous literature suggests applying a peer prediction mechanism combined with a linear payment function. This method can achieve either (1), (3) and (4), or (2), (3) and (4) in the binary effort setting. In this paper, we suggest using a rank-order payment function (tournament). Assuming Gaussian noise, we analytically optimize the rank-order payment function, and identify a sufficient statistic, sensitivity, which serves as a metric for optimizing the performance measurements. This helps us obtain (1), (2) and (3) simultaneously. Additionally, we show that adding noise to agents’ scores can preserve the truthfulness of the performance measurements under the non-linear tournament, which gives us all four objectives. Our real-data estimated agent-based model experiments show that our method can greatly reduce the payment of effort elicitation while preserving the truthfulness of the performance measurement. In addition, we empirically evaluate several commonly used performance measurements in terms of their sensitivities and strategic robustness.
more » « less
Full Text Available

« Prev Next »

Search for: All records