Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey
Abstract
Large language models face significant challenges in software issue resolution, prompting the development of autonomous coding agents through various training-free and training-based methodologies.
Issue resolution, a complex Software Engineering (SWE) task integral to real-world development, has emerged as a compelling challenge for artificial intelligence. The establishment of benchmarks like SWE-bench revealed this task as profoundly difficult for large language models, thereby significantly accelerating the evolution of autonomous coding agents. This paper presents a systematic survey of this emerging domain. We begin by examining data construction pipelines, covering automated collection and synthesis approaches. We then provide a comprehensive analysis of methodologies, spanning training-free frameworks with their modular components to training-based techniques, including supervised fine-tuning and reinforcement learning. Subsequently, we discuss critical analyses of data quality and agent behavior, alongside practical applications. Finally, we identify key challenges and outline promising directions for future research. An open-source repository is maintained at https://github.com/DeepSoftwareAnalytics/Awesome-Issue-Resolution to serve as a dynamic resource in this field.
Community
๐ Awesome issue resolution: a comprehensive survey!
This paper surveyed 175+ works to construct the first unified taxonomy serving as the comprehensive roadmap for issue resolution.
arXiv explained breakdown of this paper ๐ https://arxivexplained.com/papers/advances-and-frontiers-of-llm-based-issue-resolution-in-software-engineering-a-comprehensive-survey
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper