π€ AI Summary
Compliance testing in O-RANβs multi-vendor environment suffers from fragmented, error-prone, and non-scalable manual processes. Method: This paper proposes the first specification-aware, AI-driven automated testing framework, featuring a collaborative tri-modal LLM architecture (Gen-LLM, Val-LLM, Debug-LLM) that enables end-to-end, trustworthy mapping from 3GPP/O-RAN specifications to executable test logic. It integrates formal specification semantic parsing, signaling flow modeling, human-in-the-loop review, and automated root-cause analysis to establish a closed loop: βstandard parsing β test generation β signaling conformance verification β diagnosis & repair.β Contribution/Results: Evaluated on O-RAN TIFG and WG5-IoT standard test cases, the framework reduces test cycle time significantly, improves verification accuracy, and effectively overcomes consistency and scalability bottlenecks inherent in manual testing.
π Abstract
The advent of Open Radio Access Networks (O-RAN) has transformed the telecommunications industry by promoting interoperability, vendor diversity, and rapid innovation. However, its disaggregated architecture introduces complex testing challenges, particularly in validating multi-vendor components against O-RAN ALLIANCE and 3GPP specifications. Existing frameworks, such as those provided by Open Testing and Integration Centres (OTICs), rely heavily on manual processes, are fragmented and prone to human error, leading to inconsistency and scalability issues. To address these limitations, we present AI5GTest -- an AI-powered, specification-aware testing framework designed to automate the validation of O-RAN components. AI5GTest leverages a cooperative Large Language Models (LLM) framework consisting of Gen-LLM, Val-LLM, and Debug-LLM. Gen-LLM automatically generates expected procedural flows for test cases based on 3GPP and O-RAN specifications, while Val-LLM cross-references signaling messages against these flows to validate compliance and detect deviations. If anomalies arise, Debug-LLM performs root cause analysis, providing insight to the failure cause. To enhance transparency and trustworthiness, AI5GTest incorporates a human-in-the-loop mechanism, where the Gen-LLM presents top-k relevant official specifications to the tester for approval before proceeding with validation. Evaluated using a range of test cases obtained from O-RAN TIFG and WG5-IOT test specifications, AI5GTest demonstrates a significant reduction in overall test execution time compared to traditional manual methods, while maintaining high validation accuracy.