Large Language Models-Aided Program Debloating

📅 2025-03-12

📈 Citations: 0

✨ Influential: 0

career value

163K/year

🤖 AI Summary

Software bloat leads to functional degradation and security vulnerabilities, yet existing debloating techniques struggle to simultaneously preserve functionality and ensure security. This paper proposes LEADER, a novel framework that first leverages large language models (LLMs) for documentation understanding and test generation to guarantee functional equivalence post-debloating; it then integrates neuro-symbolic program analysis with a multi-expert collaborative decision-making mechanism to achieve security-aware code reduction. Its key innovations include: (i) the first documentation-guided test augmentation strategy, and (ii) the pioneering use of LLMs as a core reasoning engine for joint functional-safety optimization—overcoming the overfitting limitations of traditional input-dependent approaches. Evaluated on mainstream benchmarks, LEADER achieves a 23.7% higher functionality retention rate and a 68.4% lower security vulnerability introduction rate compared to the state-of-the-art tool CovA, attaining a Pareto-optimal trade-off between functionality and security.

Technology Category

Application Category

📝 Abstract

As software grows in complexity to accommodate diverse features and platforms, software bloating has emerged as a significant challenge, adversely affecting performance and security. However, existing approaches inadequately address the dual objectives of debloating: maintaining functionality by preserving essential features and enhancing security by reducing security issues. Specifically, current software debloating techniques often rely on input-based analysis, using user inputs as proxies for the specifications of desired features. However, these approaches frequently overfit provided inputs, leading to functionality loss and potential security vulnerabilities. To address these limitations, we propose LEADER, a program debloating framework enhanced by Large Language Models (LLMs), which leverages their semantic understanding, generative capabilities, and decision-making strengths. LEADER mainly consists of two modules: (1) a documentation-guided test augmentation module designed to preserve functionality, which leverages LLMs to comprehend program documentation and generates sufficient tests to cover the desired features comprehensively, and (2) a multi-advisor-aided program debloating module that employs a neuro-symbolic pipeline to ensure that the security of the software can be perceived during debloating. This module combines debloating and security advisors for analysis and employs an LLM as a decision-maker to eliminate undesired code securely. Extensive evaluations on widely used benchmarks demonstrate the efficacy of LEADER. These results demonstrate that LEADER surpasses the state-of-the-art tool CovA in functionality and security. These results underscore the potential of LEADER to set a new standard in program debloating by effectively balancing functionality and security.

Problem

Research questions and friction points this paper is trying to address.

Address software bloating affecting performance and security.

Enhance debloating by preserving functionality and reducing vulnerabilities.

Overcome overfitting in input-based debloating techniques using LLMs.

Innovation

Methods, ideas, or system contributions that make the work stand out.

LLMs enhance semantic understanding for debloating.

Documentation-guided test augmentation preserves functionality.

Neuro-symbolic pipeline ensures security during debloating.

🔎 Similar Papers

A Systematic Literature Review on Large Language Models for Automated Program Repair