π€ AI Summary
To address the challenge of precisely localizing buggy functions from natural language error descriptions in large-scale multilingual codebases, this paper proposes SweRankβthe first cross-lingual defect localization framework supporting multi-turn interactive retrieval. Methodologically, it integrates multilingual-cooperative cross-lingual code embedding retrieval with a listwise LLM-based re-ranker, and introduces a memory-augmented agent search loop for progressive, context-aware function ranking. Key contributions include: (1) the first large-scale benchmark dataset specifically designed for multilingual defect localization; (2) a novel hybrid architecture combining cross-lingual embedding retrieval and LLM-based re-ranking; and (3) an iterative, memory-equipped search agent. Experiments demonstrate that SweRankMulti establishes new state-of-the-art performance in multilingual defect localization, while SweRankAgent improves average Top-1 accuracy by 12.7% over prior approaches.
π Abstract
Maintaining large-scale, multilingual codebases hinges on accurately localizing issues, which requires mapping natural-language error descriptions to the relevant functions that need to be modified. However, existing ranking approaches are often Python-centric and perform a single-pass search over the codebase. This work introduces SweRank+, a framework that couples SweRankMulti, a cross-lingual code ranking tool, with SweRankAgent, an agentic search setup, for iterative, multi-turn reasoning over the code repository. SweRankMulti comprises a code embedding retriever and a listwise LLM reranker, and is trained using a carefully curated large-scale issue localization dataset spanning multiple popular programming languages. SweRankAgent adopts an agentic search loop that moves beyond single-shot localization with a memory buffer to reason and accumulate relevant localization candidates over multiple turns. Our experiments on issue localization benchmarks spanning various languages demonstrate new state-of-the-art performance with SweRankMulti, while SweRankAgent further improves localization over single-pass ranking.