Does Active Learning Reduce Human Coding?: A Systematic Comparison of a Neural Network with nCoder

Choi, J.; Ruis, A. R.; Cai, Z.; Eagan, B. R.; Shaffer, D. W.

Citation Details

In quantitative ethnography (QE) studies which often involve large da-tasets that cannot be entirely hand-coded by human raters, researchers have used supervised machine learning approaches to develop automated classi-fiers. However, QE researchers are rightly concerned with the amount of human coding that may be required to develop classifiers that achieve the high levels of accuracy that QE studies typically require. In this study, we compare a neural network, a powerful traditional supervised learning ap-proach, with nCoder, an active learning technique commonly used in QE studies, to determine which technique requires the least human coding to produce a sufficiently accurate classifier. To do this, we constructed multi-ple training sets from a large dataset used in prior QE studies and designed a Monte Carlo simulation to test the performance of the two techniques sys-tematically. Our results show that nCoder can achieve high predictive accu-racy with significantly less human-coded data than a neural network. more »

Award ID(s):: 2100320

NSF-PAR ID:: 10354410

Author(s) / Creator(s):: Choi, J.; Ruis, A. R.; Cai, Z.; Eagan, B. R.; Shaffer, D. W.

Editor(s):: Barany, A.; Damsa, C.

Date Published:: 2022-01-01

Journal Name:: Advances in Quantitative Ethnography: Fourth International Conference, International Conference on Quantitative Ethnography 2022

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this