NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning to Ignore Adversarial Attacks

https://doi.org/10.18653/v1/2023.eacl-main.216

Zhang, Yiming; Zhou, Yangqiaoyu; Carton, Samuel; Tan, Chenhao (September 2023, Association for Computational Linguistics)

Full Text Available
What to Learn, and How: Toward Effective Learning from Rationales

https://doi.org/10.18653/v1/2022.findings-acl.86

Carton, Samuel; Kanoria, Surya; Tan, Chenhao (January 2022, Findings of the Association for Computational Linguistics: ACL 2022)

Full Text Available
Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation

https://doi.org/10.1145/3491102.3501999

Lai, Vivian; Carton, Samuel; Bhatnagar, Rajat; Liao, Q. Vera; Zhang, Yunfeng; Tan, Chenhao (April 2022, Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems)

Full Text Available
Explainable Prediction of Text Complexity: The Missing Preliminaries for Text Simplification

https://doi.org/10.18653/v1/2021.acl-long.88

Garbacea, Cristina; Guo, Mengtian; Carton, Samuel; Mei, Qiaozhu (January 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing)

Full Text Available
Feature-Based Explanations Don't Help People Detect Misclassifications of Online Toxicity

Carton, Samuel; Mei, Qiaozhu; Resnick, Paul (May 2020, Proceedings of the Fourteenth International AAAI Conference on Web and Social Media)
null (Ed.)
We present an experimental assessment of the impact of feature attribution-style explanations on human performance in predicting the consensus toxicity of social media posts with advice from an unreliable machine learning model. By doing so we add to a small but growing body of literature inspecting the utility of interpretable machine learning in terms of human outcomes. We also evaluate interpretable machine learning for the first time in the important domain of online toxicity, where fully-automated methods have faced criticism as being inadequate as a measure of toxic behavior. We find that, contrary to expectations, explanations have no significant impact on accuracy or agreement with model predictions, through they do change the distribution of subject error somewhat while reducing the cognitive burden of the task for subjects. Our results contribute to the recognition of an intriguing expectation gap in the field of interpretable machine learning between the general excitement the field has engendered and the ambiguous results of recent experimental work, including this study.
more » « less
Full Text Available
Feature-Based Explanations Don't Help People Detect Misclassifications of Online Toxicity

Carton, Samuel; Mei, Qiaozhu; Resnick, Paul (January 2020, Proceedings of the International AAAI Conference on Weblogs and Social Media)
null (Ed.)
Full Text Available
Evaluating and Characterizing Human Rationales

https://doi.org/10.18653/v1/2020.emnlp-main.747

Carton, Samuel; Rathore, Anirudh; Tan, Chenhao (January 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP))
null (Ed.)
Full Text Available
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation

Garbacea, Cristina; Carton, Samuel; Yan, Shiyan; Mei, Qiaozhu (January 2019, Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing)

We conduct a large-scale, systematic study to evaluate the existing evaluation methods for natural language generation in the context of generating online product reviews. We compare human-based evaluators with a variety of automated evaluation procedures, including discriminative evaluators that measure how well machine-generated text can be distinguished from human-written text, as well as word overlap metrics that assess how similar the generated text compares to human-written references. We determine to what extent these different evaluators agree on the ranking of a dozen of state-of-the-art generators for online product reviews. We find that human evaluators do not correlate well with discriminative evaluators, leaving a bigger question of whether adversarial accuracy is the correct objective for natural language generation. In general, distinguishing machine-generated text is challenging even for human evaluators, and human decisions correlate better with lexical overlaps. We find lexical diversity an intriguing metric that is indicative of the assessments of different evaluators. A post-experiment survey of participants provides insights into how to evaluate and improve the quality of natural language generation systems.
more » « less
Full Text Available
Extractive Adversarial Networks: High-Recall Explanations for Identifying Personal Attacks in Social Media Posts

https://doi.org/10.18653/v1/D18-1386

Carton, Samuel; Mei, Qiaozhu; Resnick, Paul (October 2018, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing)

We introduce an adversarial method for producing high-recall explanations of neural text classifier decisions. Building on an existing architecture for extractive explanations via hard attention, we add an adversarial layer which scans the residual of the attention for remaining predictive signal. Motivated by the important domain of detecting personal attacks in social media comments, we additionally demonstrate the importance of manually setting a semantically appropriate “default” behavior for the model by explicitly manipulating its bias term. We develop a validation set of human-annotated personal attacks to evaluate the impact of these changes.
more » « less
Full Text Available
Extractive Adversarial Networks: High-Recall Explanations for Identifying Personal Attacks in Social Media Posts

https://doi.org/https://www.aclweb.org/anthology/D18-1386

Carton, Samuel; Mei, Qiaozhu; Resnick, Paul (January 2018, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing)

Full Text Available

Search for: All records