NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Investigating user perceptions of conversational agents for software-related exploratory web search

https://doi.org/10.1145/3510455.3512778

Frazier, Matthew; Kumar, Shaayal; Damevski, Kostadin; Pollock, Lori (May 2022, Proceedings of the ACM/IEEE 44th International Conference on Software Engineering: New Ideas and Emerging Results)

Full Text Available
Automatically Identifying the Quality of Developer Chats for Post Hoc Use

https://doi.org/10.1145/3450503

Chatterjee, Preetha; Damevski, Kostadin; Kraft, Nicholas A.; Pollock, Lori (July 2021, ACM Transactions on Software Engineering and Methodology)
null (Ed.)
Software engineers are crowdsourcing answers to their everyday challenges on Q&A forums (e.g., Stack Overflow) and more recently in public chat communities such as Slack, IRC, and Gitter. Many software-related chat conversations contain valuable expert knowledge that is useful for both mining to improve programming support tools and for readers who did not participate in the original chat conversations. However, most chat platforms and communities do not contain built-in quality indicators (e.g., accepted answers, vote counts). Therefore, it is difficult to identify conversations that contain useful information for mining or reading, i.e., conversations of post hoc quality. In this article, we investigate automatically detecting developer conversations of post hoc quality from public chat channels. We first describe an analysis of 400 developer conversations that indicate potential characteristics of post hoc quality, followed by a machine learning-based approach for automatically identifying conversations of post hoc quality. Our evaluation of 2,000 annotated Slack conversations in four programming communities (python, clojure, elm, and racket) indicates that our approach can achieve precision of 0.82, recall of 0.90, F-measure of 0.86, and MCC of 0.57. To our knowledge, this is the first automated technique for detecting developer conversations of post hoc quality.
more » « less
Full Text Available
Automatic Extraction of Opinion-Based Q&A from Online Developer Chats

https://doi.org/10.1109/ICSE43902.2021.00115

Chatterjee, Preetha; Damevski, Kostadin; Pollock, Lori (May 2021, 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE))
null (Ed.)
Full Text Available
Automatically Selecting Follow-up Questions for Deficient Bug Reports

https://doi.org/10.1109/MSR52588.2021.00029

Imran, Mia Mohammad; Ciborowska, Agnieszka; Damevski, Kostadin (May 2021, 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR))
null (Ed.)
Full Text Available
Software-related Slack Chats with Disentangled Conversations

Chatterjee, Preetha; Damevski, Kostadin; Kraft, Nicholas A.; Pollock, Lori (January 2020, IEEE International Working Conference on Mining Software Repositories)

More than ever, developers are participating in public chat communities to ask and answer software development questions. With over ten million daily active users, Slack is one of the most popular chat platforms, hosting many active channels focused on software development technologies, e.g., python, react. Prior studies have shown that public Slack chat transcripts contain valuable information, which could provide support for improving automatic software maintenance tools or help researchers understand developer struggles or concerns. In this paper, we present a dataset of software-related chat conversations, curated for two years from three open Slack communities (python, clojure, elm). Our dataset consists of 38,955 conversations, 437,893 utterances, contributed by 12,171 users. We also share the code for a customized machine-learning based algorithm that automatically extracts (or disentangles) conversations from the downloaded chat transcripts.
more » « less
Full Text Available
Automatically Identifying Valid API Versions for Software Development Tutorials on the Web

Nishi, Manziba A.; Damevski, K. (August 2019, Journal of software)

Online tutorials are a valuable source of community created information used by numerous developers to learn new APIs and techniques. Once written, tutorials are rarely actively curated and can become dated over time. Tutorials often reference APIs that change rapidly, and deprecated classes, methods and fields can render tutorials inapplicable to newer releases of the API.Newer tutorials may not be compatible with older APIs that are still in use. In this paper, we first empirically study the tutorial versioning problem, confirming its presence in popular tutorials on the Web. We subsequently propose a technique, based on similar techniques in the literature, for automatically detecting the applicable API version ranges of tutorials, given access to the official API documentation they reference. The proposed technique identifies each API mention in a tutorial and maps the mention to the corresponding API element in the official documentation. The version of the tutorial is determined by combining the version ranges of all of the constituent API mentions. Our technique’s precision varies from 61% to 89% and recall varies from 42% to 84% based on different levels of granularity of API mentions and different problem constraints. We observe API methods are the most challenging to accurately disambiguate due to method overloading. As the API mentions in tutorials are often redundant, and each mention of a specific API element commonly occurs several times in a tutorial, the distance of the predicted version range from the true version range is low; 3.61 on average for the tutorials in our sample.
more » « less
Full Text Available
Exploratory Study of Slack Q&A Chats as a Mining Source for Software Engineering Tools

https://doi.org/10.1109/MSR.2019.00075

Chatterjee, P.; Damevski, K.; Pollock, L.; Augustine, V.; Kraft, N. (July 2019, Proceedings of the 16th International Conference on Mining Software Repositories (MSR’19))

Modern software development communities are increasingly social. Popular chat platforms such as Slack host public chat communities that focus on specific development topics such as Python or Ruby-on-Rails. Conversations in these public chats often follow a Q&A format, with someone seeking information and others providing answers in chat form. In this paper, we describe an exploratory study into the potential usefulness and challenges of mining developer Q&A conversations for supporting software maintenance and evolution tools. We designed the study to investigate the availability of information that has been successfully mined from other developer communications, particularly Stack Overflow. We also analyze characteristics of chat conversations that might inhibit accurate automated analysis. Our results indicate the prevalence of useful information, including API mentions and code snippets with descriptions, and several hurdles that need to be overcome to automate mining that information.
more » « less
Full Text Available

Search for: All records