RAGFix: Enhancing LLM Code Repair Using RAG and Stack Overflow Posts

Mansur, Elijah; Chen, Johnson; Raza, Muhammad A; Wardat, Mohammad

Citation Details

This content will become publicly available on January 16, 2026

RAGFix: Enhancing LLM Code Repair Using RAG and Stack Overflow Posts

Identifying, localizing, and resolving bugs in software engineering is challenging and costly. Approaches to resolve software bugs range from Large Language Model (LLM) code analysis and repair, and automated code repair technology that aims to alleviate the technical burden of difficult to solve bugs. We propose RAGFix, which enhances LLM’s capabilities for bug localization and code repair using Retrieval Augmented Generation (RAG) based on dynamically collected Stack Overflow posts. These posts are searchable via a Question and Answer Knowledge Graph (KGQA). We evaluate our method on the HumanEvalFix benchmark for Python using relevant closed and open-source models. Our approach facilitates error resolution in Python coding problems by creating a searchable, embedded knowledge graph representation of bug and solution information from Stack Overflow, interlinking bugs, and solutions through semi-supervised graph construction methods. We use cosine similarity on embeddings based on LLM-synthesized summaries and algorithmic features describing the coding problem and potential solution to find relevant results that improve LLM in-context performance. Our results indicate that our system enhances small open-source models’ ability to effectively repair code, particularly where these models have less parametric knowledge about relevant coding problems and can leverage nonparametric knowledge to provide accurate, actionable fixes. more »

Award ID(s):: 2349663

PAR ID:: 10570231

Author(s) / Creator(s):: Mansur, Elijah; Chen, Johnson; Raza, Muhammad A; Wardat, Mohammad

Publisher / Repository:: IEEE

Date Published:: 2025-01-16

ISSN:: 2573-2978

ISBN:: 979-8-3503-6248-0

Format(s):: Medium: X

Location:: Washington, DC, USA

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on January 16, 2026
Conference Proceeding:
The DOI is not currently available.

More Like this