NSF PAR Search | NSF Public Access Repository

Open-Source LLMs for Technical Q&A: Lessons from StackExchange

Babar, Zeerak; Khan, Nafiz Imtiaz; Hassnain, Muhammad; Filkov, Vladimir (May 2025, 2025 International Conference on Software Engineering of Emerging Technologies (SEET-25))

In the rapidly evolving domain of software engineering (SE), Large Language Models (LLMs) are increasingly leveraged to automate developer support. Open source LLMs have grown competitive with pro- prietary models such as GPT-4 and Claude-3, without the associated financial and accessibility constraints. This study investigates whether state of the art open source LLMs including Solar-10.7B, CodeLlama-7B, Mistral-7B, Qwen2-7B, StarCoder2-7B, and LLaMA3-8B can generate responses to technical queries that align with those crafted by human experts. Leveraging retrieval augmented generation (RAG) and targeted fine tuning, we evaluate these models across critical performance dimen- sions, such as semantic alignment and contextual fluency. Our results show that Solar-10.7B, particularly when paired with RAG and fine tun- ing, most closely replicates expert level responses, o!ering a scalable and cost e!ective alternative to commercial models. This vision paper high- lights the potential of open-source LLMs to enable robust and accessible AI-powered developer assistance in software engineering.

Free, publicly-accessible full text available May 23, 2026

Search for: All records