Hunting Vulnerability Variants in AI Infra: Measurement and Reference-Driven Detection

๐Ÿ“… 2026-05-19
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

204K/year
๐Ÿค– AI Summary
This study addresses the growing concern of vulnerability variants in AI infrastructure caused by code reuse, whose prevalence and detectability remain poorly understood. The work presents the first systematic evidence of widespread cross-repository vulnerability variants and introduces a reference-driven, semantic-level detection approach. By leveraging a multi-agent framework to extract semantic features from known vulnerabilities and integrating static analysis, the method automatically identifies and validates variant vulnerabilities in new codebases. Evaluated across 20 prominent AI infrastructure projects, the approach uncovered over 20 previously unknown vulnerabilities, 11 of which were confirmed by developers and 4 assigned CVE identifiers.
๐Ÿ“ Abstract
AI infra has become a shared execution layer for model training, deployment, and agent orchestration. Because many projects reimplement similar model-centric workflows, a vulnerability disclosed in one repository can recur as a variant in another repository with a related design. Yet the prevalence and detectability of these variants remain poorly understood. This paper presents a measurement study of vulnerability variants in AI infra. Analyzing 688 GitHub repositories and 251 publicly disclosed vulnerabilities, we find that AI infra projects frequently share overlapping functionality and recurrent vulnerable patterns, creating a concrete basis for cross-repository variants. Building on this finding, we study how to automatically identify such variants from known disclosures. We propose INFRASCOPE, a reference-driven multi-agent framework that extracts transferable vulnerability semantics from known cases and uses them to locate and validate variants in new repositories. Evaluating INFRASCOPE on 20 real-world AI infra repositories, we uncover over 20 vulnerabilities, including 11 acknowledged cases and 4 cases that have been assigned CVEs so far.
Problem

Research questions and friction points this paper is trying to address.

vulnerability variants
AI infra
cross-repository
security measurement
recurrent vulnerabilities
Innovation

Methods, ideas, or system contributions that make the work stand out.

vulnerability variants
AI infrastructure
reference-driven detection
multi-agent framework
transferable vulnerability semantics
๐Ÿ”Ž Similar Papers
No similar papers found.
Tian Dong
Tian Dong
Shanghai Jiao Tong University
Computer SecurityMachine Learning
Yanjun Chen
Yanjun Chen
University of Illinois Urbana-Champaign
Human Computer InteractionHaptics
S
Shoufeng Zhang
The University of Hong Kong
H
Huaien Zhang
The University of Hong Kong
Y
Yunlong Lyu
The University of Hong Kong
K
Keke Lian
Tencent
D
Dong Zhang
Tencent
Shaofeng Li
Shaofeng Li
Southeast University
AI SecurityBackdoor Attacks
H
Hao Chen
The University of Hong Kong