RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform

📅 2026-03-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes RepoLaunch, the first fully automated agent capable of cross-language and cross-platform software repository building and testing. Addressing the heavy reliance on manual intervention in existing workflows under heterogeneous environments, RepoLaunch leverages a large language model to integrate dependency resolution, compilation execution, and test result extraction. Given only a task specification, it autonomously generates high-quality datasets for training and evaluating Software Engineering (SWE) models. The approach substantially improves both the efficiency and coverage of dataset construction and has already enabled automatic task generation and benchmark evaluation in multiple cutting-edge research efforts.

Technology Category

Application Category

📝 Abstract
Building software repositories typically requires significant manual effort. Recent advances in large language model (LLM) agents have accelerated automation in software engineering (SWE). We introduce RepoLaunch, the first agent capable of automatically resolving dependencies, compiling source code, and extracting test results for repositories across arbitrary programming languages and operating systems. To demonstrate its utility, we further propose a fully automated pipeline for SWE dataset creation, where task design is the only human intervention. RepoLaunch automates the remaining steps, enabling scalable benchmarking and training of coding agents and LLMs. Notably, several works on agentic benchmarking and training have recently adopted RepoLaunch for automated task generation.
Problem

Research questions and friction points this paper is trying to address.

build automation
test pipeline
code repositories
software engineering
cross-platform
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM agents
cross-language automation
build-and-test pipeline
automated dataset generation
software engineering automation
🔎 Similar Papers
No similar papers found.