🤖 AI Summary
This work addresses the limitations of existing deep reinforcement learning–based agents in 6G networks, which commonly suffer from poor interpretability, weak adaptability, and catastrophic forgetting. The authors propose the BRAIN agent, which introduces the active inference framework to mobile networking for the first time. By unifying perception and action through variational free energy minimization, BRAIN achieves strong adaptability without retraining, supports causal reasoning, and enables interpretable decision-making. The approach integrates deep generative models, Bayesian inference, and the O-RAN xApp architecture, and is validated on a GPU-accelerated platform. Experimental results demonstrate that BRAIN effectively maintains slice QoS under dynamic traffic conditions, improves robustness to traffic bursts by 28.3%, and facilitates human-understandable diagnosis through its belief-state representations.
📝 Abstract
Future sixth-generation (6G) mobile networks will demand artificial intelligence (AI) agents that are not only autonomous and efficient, but also capable of real-time adaptation in dynamic environments and transparent in their decisionmaking. However, prevailing agentic AI approaches in networking, exhibit significant shortcomings in this regard. Conventional deep reinforcement learning (DRL)-based agents lack explainability and often suffer from brittle adaptation, including catastrophic forgetting of past knowledge under non-stationary conditions. In this paper, we propose an alternative solution for these challenges: Bayesian reasoning via Active Inference (BRAIN) agent. BRAIN harnesses a deep generative model of the network environment and minimizes variational free energy to unify perception and action in a single closed-loop paradigm. We implement BRAIN as O-RAN eXtended application (xApp) on GPU-accelerated testbed and demonstrate its advantages over standard DRL baselines. In our experiments, BRAIN exhibits (i) robust causal reasoning for dynamic radio resource allocation, maintaining slice-specific quality of service (QoS) targets (throughput, latency, reliability) under varying traffic loads, (ii) superior adaptability with up to 28.3% higher robustness to sudden traffic shifts versus benchmarks (achieved without any retraining), and (iii) real-time interpretability of its decisions through human-interpretable belief state diagnostics.