The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers

📅 2026-01-05
🏛️ CCF Transactions on High Performance Computing
📈 Citations: 1
Influential: 1
📄 PDF
🤖 AI Summary
Traditional compilers face limitations in development accessibility, optimization capabilities, and application scope. This work proposes the first multidimensional classification framework for large language model (LLM)-driven compilation, offering a systematic survey of existing research through four analytical dimensions: design philosophy, methodology, level of code abstraction, and task type. The study identifies three core design paradigms—Selector, Translator, and Generator—and highlights three transformative directions: democratizing compiler development, discovering novel optimization strategies, and expanding functional boundaries. It further argues that hybrid systems represent a critical pathway forward and provides a technical roadmap for building correct, scalable, and intelligent compilation tools.

Technology Category

Application Category

📝 Abstract
This survey has provided a systematic overview of the emerging field of LLM-enabled compilation by addressing several key research questions. We first answered how LLMs are being integrated by proposing a comprehensive, multi-dimensional taxonomy that categorizes works based on their Design Philosophy (Selector, Translator, Generator), LLM Methodology, their operational Level of Code Abstraction, and the specific Task Type they address. In answering what advancements these approaches offer, we identified three primary benefits: the democratization of compiler development, the discovery of novel optimization strategies, and the broadening of the compiler's traditional scope. Finally, in addressing the field's challenges and opportunities, we highlighted the critical hurdles of ensuring correctness and achieving scalability, while identifying the development of hybrid systems as the most promising path forward. By providing these answers, this survey serves as a foundational roadmap for researchers and practitioners, charting the course for a new generation of LLM-powered, intelligent, adaptive and synergistic compilation tools.
Problem

Research questions and friction points this paper is trying to address.

LLM-enabled compilation
compiler correctness
scalability
compiler optimization
hybrid systems
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-enabled compilation
compiler democratization
hybrid compilation systems
code optimization with LLMs
taxonomy of compiler-LLM integration
🔎 Similar Papers
No similar papers found.
S
Shuoming Zhang
SKLP, Institute of Computing Technology, CAS, 6th Kexueyuan South Rd, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
Jiacheng Zhao
Jiacheng Zhao
Institute of Computing Technology, Chinese Academy of Scienses
Parallel ComputingParallel CompilingComputer ArchitectureProgramming ModelDatacenter
Q
Qiuchu Yu
SKLP, Institute of Computing Technology, CAS, 6th Kexueyuan South Rd, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
C
Chunwei Xia
University of Leeds, UK
Zheng Wang
Zheng Wang
Professor of Intelligent Software Technology, University of Leeds
Code optimisationparallel computingsoftware testingmachine learning
Xiaobing Feng
Xiaobing Feng
Professor of Institute of Computing Technology, Chinese Academy of Sciecnes
Programming ModelProgramming Analysis and Optimizationg
H
Huimin Cui
SKLP, Institute of Computing Technology, CAS, 6th Kexueyuan South Rd, Beijing, China; University of Chinese Academy of Sciences, Beijing, China