Annotating low-confidence questions improves classifier performance

Hernandez, Stephanie; Artstein, Ron

Citation Details

This paper compares methods to select data for annotation in order to improve a classifier used in a question-answering dialogue system. With a classifier trained on 1,500 questions, adding 300 training questions on which the classifier is least confident results in consistently improved performance, whereas adding 300 arbitrarily selected training questions does not yield consistent improvement, and sometimes even degrades performance. The paper uses a new method for comparative evaluation of classifiers for dialogue, which scores each classifier based on the number of appropriate responses retrieved. more »

Award ID(s):: 1852583

NSF-PAR ID:: 10313591

Author(s) / Creator(s):: Hernandez, Stephanie; Artstein, Ron

Date Published:: 2021-09-01

Journal Name:: Proceedings of the 25th Workshop on the Semantics and Pragmatics of Dialogue - Poster Abstracts

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this