AI-RAN: Transforming RAN with AI-driven Computing Infrastructure

📅 2025-01-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the growing demand for deep integration of AI and wireless networks, this paper proposes the AI-RAN three-layer architectural paradigm—the first systematic definition of key requirements and a reference architecture for synergistic communication and AI computing. Leveraging the NVIDIA Grace-Hopper GH200 platform, we realize real-time co-location of RAN baseband processing and AI inference on homogeneous hardware, enabled by software-hardware co-scheduling, ultra-low-latency task orchestration, and heterogeneous workload co-placement. End-to-end proof-of-concept validation demonstrates a 40% improvement in resource utilization and end-to-end latency meeting uRLLC requirements (<1 ms). This work establishes a unified infrastructure foundation supporting all three AI-RAN paradigms—AI-for-RAN, AI-on-RAN, and AI-and-RAN—thereby advancing standardization and industrial deployment of integrated communication-and-computation wireless access networks.

Technology Category

Application Category

📝 Abstract
The radio access network (RAN) landscape is undergoing a transformative shift from traditional, communication-centric infrastructures towards converged compute-communication platforms. This article introduces AI-RAN which integrates both RAN and artificial intelligence (AI) workloads on the same infrastructure. By doing so, AI-RAN not only meets the performance demands of future networks but also improves asset utilization. We begin by examining how RANs have evolved beyond mobile broadband towards AI-RAN and articulating manifestations of AI-RAN into three forms: AI-for-RAN, AI-on-RAN, and AI-and-RAN. Next, we identify the key requirements and enablers for the convergence of communication and computing in AI-RAN. We then provide a reference architecture for advancing AI-RAN from concept to practice. To illustrate the practical potential of AI-RAN, we present a proof-of-concept that concurrently processes RAN and AI workloads utilizing NVIDIA Grace-Hopper GH200 servers. Finally, we conclude the article by outlining future work directions to guide further developments of AI-RAN.
Problem

Research questions and friction points this paper is trying to address.

Artificial Intelligence
Wireless Networks
Optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI-RAN
Integrated Computing and Communication
Resource Optimization
🔎 Similar Papers
No similar papers found.
L
Lopamudra Kundu
NVIDIA
Xingqin Lin
Xingqin Lin
NVIDIA
5G/6GAI/MLNon-Terrestrial NetworksUAV/satellite/HAPS/A2GNR/NB-IoT/eMTC/LTE
R
Rajesh Gadiyar
NVIDIA
J
Jean-Francois Lacasse
NVIDIA
S
Shuvo Chowdhury
NVIDIA