Applying Large Language Models to Issue Classification: Revisiting with Extended Data and New Models

📅 2025-05-01
🏛️ Science of Computer Programming
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Automated classification of issue reports in open-source projects remains challenging due to inefficiency of manual labeling and poor generalizability of existing automated approaches, which rely heavily on large-scale annotated data. Method: This paper proposes a lightweight adaptation framework for large language models (LLMs) to enable zero-shot and few-shot cross-project issue classification. It systematically evaluates the generalization capabilities of multiple generations of both open-source (LLaMA-3, Qwen2) and closed-source (Claude-3, GPT-4) LLMs, and introduces instruction tuning with context enhancement—integrating dynamic in-context example retrieval and structured output constraints. Contribution/Results: The approach achieves an 89.7% F1 score on cross-project classification, outperforming BERT by 12.3 percentage points; long-tail category accuracy improves by 27.6%, and domain adaptation cost is significantly reduced.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Automating issue classification in software engineering using LLMs
Reducing reliance on large training datasets for accurate classification
Comparing performance of LLMs like GPT-4o and DeepSeek R1
Innovation

Methods, ideas, or system contributions that make the work stand out.

Using LLMs for automated issue classification
Comparing GPT-4o and DeepSeek R1 performance
Reducing dependency on large training datasets
🔎 Similar Papers
No similar papers found.
G
Gabriel Aracena
Grand Canyon University, 3300 W. Camelback Rd., Phoenix, AZ, USA
K
Kyle Luster
Grand Canyon University, 3300 W. Camelback Rd., Phoenix, AZ, USA
Fabio Santos
Fabio Santos
Colorado State University
Software engineeringArtificial IntelligenceKnowledge ModelingOpen Source SoftwareSocial
Igor Steinmacher
Igor Steinmacher
Northern Arizona University
Software EngineeringCSCWMining Software RepositoriesOpen Source Software
M
M. Gerosa
Northern Arizona University, 1900 S. Knoles Dr., Flagstaff, AZ, USA