A Comparison of Stemming Techniques in Tracing

Farrar, David; Huffman Hayes, Jane

Citation Details

We examine the effects of stemming on the tracing of software engineering artifacts. We compare two common stemming algorithms to each other as well as to a baseline of no stemming. We evaluate the algorithms on eight tracing datasets. We run the experiment using the TraceLab experimental framework to allow for ease of repeatability and knowledge sharing among the tracing community. We compare the algorithms on precision at recall levels of [0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0], as well as on mean average precision values. The experiment indicated that neither the Porter stemmer nor the Krovetz stemmer outperformed the other on all datasets tested. more »

Award ID(s):: 1642134

PAR ID:: 10094739

Author(s) / Creator(s):: Farrar, David; Huffman Hayes, Jane

Date Published:: 2019-05-27

Journal Name:: Proceedings of the 10th International Workshop on Software and System Traceability (SST'19) at the International Conference on Software Engineering

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this