Open-Source LLMs for Technical Q&A: Lessons from StackExchange

Citation Details

This content will become publicly available on May 23, 2026

In the rapidly evolving domain of software engineering (SE), Large Language Models (LLMs) are increasingly leveraged to automate developer support. Open source LLMs have grown competitive with pro- prietary models such as GPT-4 and Claude-3, without the associated financial and accessibility constraints. This study investigates whether state of the art open source LLMs including Solar-10.7B, CodeLlama-7B, Mistral-7B, Qwen2-7B, StarCoder2-7B, and LLaMA3-8B can generate responses to technical queries that align with those crafted by human experts. Leveraging retrieval augmented generation (RAG) and targeted fine tuning, we evaluate these models across critical performance dimen- sions, such as semantic alignment and contextual fluency. Our results show that Solar-10.7B, particularly when paired with RAG and fine tun- ing, most closely replicates expert level responses, o!ering a scalable and cost e!ective alternative to commercial models. This vision paper high- lights the potential of open-source LLMs to enable robust and accessible AI-powered developer assistance in software engineering. more »

Award ID(s):: 2020751

PAR ID:: 10639623

Author(s) / Creator(s):: Babar, Zeerak; Khan, Nafiz Imtiaz; Hassnain, Muhammad; Filkov, Vladimir

Publisher / Repository:: 2025 International Conference on Software Engineering of Emerging Technologies (SEET-25)

Date Published:: 2025-05-23

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on May 23, 2026
Conference Paper:
The DOI is not currently available.

More Like this