Navigating the Landscape of Reproducible Research: A Predictive Modeling Approach

Akella, Akhil Pandey; Choudhury, Sagnik Ray; Koop, David; Alhoori, Hamed

doi:10.1145/3627673.3679831

Citation Details

Navigating the Landscape of Reproducible Research: A Predictive Modeling Approach

The reproducibility of scientific articles is central to the advancement of science. Despite this importance, evaluating reproducibility remains challenging due to the scarcity of ground truth data. Predictive models can address this limitation by streamlining the tedious evaluation process. Typically, a paper’s reproducibility is inferred based on the availability of artifacts such as code, data, or supplemental information, often without extensive empirical investigation. To address these issues, we utilized artifacts of papers as fundamental units to develop a novel, dual-spectrum framework that focuses on author-centric and external-agent perspectives. We used the author-centric spectrum, followed by the external-agent spectrum, to guide a structured, model-based approach to quantify and assess reproducibility. We explored the interdependencies between different factors influencing reproducibility and found that linguistic features such as readability and lexical diversity are strongly correlated with papers achieving the highest statuses on both spectrums. Our work provides a model-driven pathway for evaluating the reproducibility of scientific research. more »

Award ID(s):: 2022443

PAR ID:: 10552268

Author(s) / Creator(s):: Akella, Akhil Pandey; Choudhury, Sagnik Ray; Koop, David; Alhoori, Hamed

Publisher / Repository:: ACM CIKM

Date Published:: 2024-10-21

ISBN:: 9798400704369

Page Range / eLocation ID:: 24 to 33

Subject(s) / Keyword(s):: Reproducibility, Scientific Data, Science of Science

Format(s):: Medium: X

Location:: Boise ID USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3627673.3679831

More Like this