The unsolvable problem or the unheard answer?: a dataset of 24,669 open-source software conference talks

Truong, Kimberly; Miller, Courtney; Vasilescu, Bogdan; Kästner, Christian

doi:10.1145/3524842.3528488

Citation Details

The unsolvable problem or the unheard answer?: a dataset of 24,669 open-source software conference talks

Talks at practitioner-focused open-source software conferences are a valuable source of information for software engineering researchers. They provide a pulse of the community and are valuable source material for grey literature analysis. We curated a dataset of 24,669 talks from 87 open-source conferences between 2010 and 2021. We stored all relevant metadata from these conferences and provide scripts to collect the transcripts. We believe this data is useful for answering many kinds of questions, such as: What are the important/highly discussed topics within practitioner communities? How do practitioners interact? And how do they present themselves to the public? We demonstrate the usefulness of this data by reporting our findings from two small studies: a topic model analysis providing an overview of open-source community dynamics since 2011 and a qualitative analysis of a smaller community-oriented sample within our dataset to gain a better understanding of why contributors leave open-source software. more »

Award ID(s):: 2150217

PAR ID:: 10392544

Author(s) / Creator(s):: Truong, Kimberly; Miller, Courtney; Vasilescu, Bogdan; Kästner, Christian

Date Published:: 2022-05-23

Journal Name:: MSR '22: Proceedings of the 19th International Conference on Mining Software Repositories

Page Range / eLocation ID:: 348 to 352

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3524842.3528488

More Like this