Commit-Aware Learning-Based Test Case Prioritization for Continuous Integration

📅 2026-04-28
📈 Citations: 0
Influential: 0
📄 PDF

career value

164K/year
🤖 AI Summary
This work addresses the high cost of regression testing in continuous integration by proposing a unified learning model that incorporates the structural semantics of code diffs into test prioritization—a dimension overlooked by existing approaches. The model integrates diff structural features, test coverage relationships, and historical execution behavior to predict the likelihood of test cases revealing faults. Evaluated through cross-project experiments on five projects from Defects4J, the approach demonstrates significantly superior fault detection effectiveness and model generalizability compared to baseline methods that do not account for commit-aware information.
📝 Abstract
Regression testing in Continuous Integration (CI) pipelines is increasingly costly due to the growing size and execution frequency of test suites. Test Case Prioritization (TCP) mitigates this problem by reordering tests to expose faults earlier. However, most existing techniques rely primarily on historical execution data and coverage metrics, neglecting the rich structural information contained in code changes. This paper proposes a commit-aware, learning-based TCP method that combines structural properties of version-control diffs, test coverage relations, and historical execution behavior into a unified predictive model. Given a new commit, the method estimates the probability that each test suite will reveal at least one failure and prioritizes test execution accordingly. We evaluate our method on five Defects4J projects using a leave-one-project-out cross-project validation setting. Results show that the commit-aware TCP significantly outperform non-commit-aware-baselines in both classification and prioritization effectiveness. Our findings show that including commit structural semantics substantially enhances regression fault detection and enables robust, generalizable learning-based TCP in CI environments.
Problem

Research questions and friction points this paper is trying to address.

Test Case Prioritization
Continuous Integration
Regression Testing
Commit-aware
Code Changes
Innovation

Methods, ideas, or system contributions that make the work stand out.

commit-aware
test case prioritization
learning-based
structural semantics
continuous integration
🔎 Similar Papers
No similar papers found.