The Insight-Inference Loop: Efficient Text Classification via Natural Language Inference and Threshold-Tuning

Chausson, Sandrine  (ORCID:0009000544154962); Fourcade, Marion  (ORCID:0000000248219031); Harding, David J  (ORCID:0000000221210790); Ross, Björn  (ORCID:0000000327173705); Renard, Grégory

doi:10.1177/00491241251326819

Citation Details

This content will become publicly available on April 18, 2026

The Insight-Inference Loop: Efficient Text Classification via Natural Language Inference and Threshold-Tuning

Modern computational text classification methods have brought social scientists tantalizingly close to the goal of unlocking vast insights buried in text data—from centuries of historical documents to streams of social media posts. Yet three barriers still stand in the way: the tedious labor of manual text annotation, the technical complexity that keeps these tools out of reach for many researchers, and, perhaps most critically, the challenge of bridging the gap between sophisticated algorithms and the deep theoretical understanding social scientists have already developed about human interactions, social structures, and institutions. To counter these limitations, we propose an approach to large-scale text analysis that requires substantially less human-labeled data, and no machine learning expertise, and efficiently integrates the social scientist into critical steps in the workflow. This approach, which allows the detection of statements in text, relies on large language models pre-trained for natural language inference, and a “few-shot” threshold-tuning algorithm rooted in active learning principles. We describe and showcase our approach by analyzing tweets collected during the 2020 U.S. presidential election campaign, and benchmark it against various computational approaches across three datasets. more »

Award ID(s):: 2243822

PAR ID:: 10646787

Author(s) / Creator(s):: Chausson, Sandrine ; Fourcade, Marion ; Harding, David J ; Ross, Björn ; Renard, Grégory

Publisher / Repository:: Sage

Date Published:: 2025-04-18

Journal Name:: Sociological Methods & Research

ISSN:: 0049-1241

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on April 18, 2026
Journal Article:
https://doi.org/10.1177/00491241251326819

More Like this