Identifying Redundancies in Fork-based Development

Ren, Luyao; Zhou, Shurui; Kastner, Christian; Wasowski, Andrzej

doi:10.1109/SANER.2019.8668023

Citation Details

Identifying Redundancies in Fork-based Development

Fork-based development is popular and easy to use, but makes it difficult to maintain an overview of the whole community when the number of forks increases. This may lead to redundant development where multiple developers are solving the same problem in parallel without being aware of each other. Redundant development wastes effort for both maintainers and developers. In this paper, we designed an approach to identify redundant code changes in forks as early as possible by extracting clues indicating similarities between code changes, and building a machine learning model to predict redundancies. We evaluated the effectiveness from both the maintainer's and the developer's perspectives. The result shows that we achieve 57-83% precision for detecting duplicate code changes from maintainer's perspective, and we could save developers' effort of 1.9-3.0 commits on average. Also, we show that our approach significantly outperforms existing state-of-art. more »

Award ID(s):: 1813598

PAR ID:: 10109925

Author(s) / Creator(s):: Ren, Luyao; Zhou, Shurui; Kastner, Christian; Wasowski, Andrzej

Date Published:: 2019-02-01

Journal Name:: 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER)

Page Range / eLocation ID:: 230 to 241

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/SANER.2019.8668023

More Like this