Bridging the Gap Between Simulated and Real Network Data Using Transfer Learning

📅 2025-10-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Network machine learning models trained on simulated data suffer from degraded prediction accuracy in real-world deployments due to domain shift between simulation and reality. Method: This paper proposes a lightweight hybrid transfer learning framework that bridges the simulation-to-reality gap using only a small number of real-network samples. Built upon RouteNet-Fermi, it jointly leverages OMNeT++-generated synthetic traffic data and real traffic traces collected from a custom testbed for efficient fine-tuning. Contribution/Results: The approach significantly reduces reliance on large-scale, labeled real-world data—addressing critical data scarcity in rare but pivotal scenarios (e.g., network failures). Experimental evaluation on packet delay prediction shows up to an 88% reduction in mean absolute percentage error (MAPE). With merely 10 real-world scenario samples, MAPE decreases by 37%; with 50 samples, the reduction reaches 48%. These results demonstrate substantial improvements in model robustness and generalization capability for practical deployment.

Technology Category

Application Category

📝 Abstract
Machine Learning (ML)-based network models provide fast and accurate predictions for complex network behaviors but require substantial training data. Collecting such data from real networks is often costly and limited, especially for critical scenarios like failures. As a result, researchers commonly rely on simulated data, which reduces accuracy when models are deployed in real environments. We propose a hybrid approach leveraging transfer learning to combine simulated and real-world data. Using RouteNet-Fermi, we show that fine-tuning a pre-trained model with a small real dataset significantly improves performance. Our experiments with OMNeT++ and a custom testbed reduce the Mean Absolute Percentage Error (MAPE) in packet delay prediction by up to 88%. With just 10 real scenarios, MAPE drops by 37%, and with 50 scenarios, by 48%.
Problem

Research questions and friction points this paper is trying to address.

Reducing accuracy gap between simulated and real network data
Minimizing real data requirements for machine learning models
Improving packet delay prediction in network deployments
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses transfer learning to combine simulated and real data
Fine-tunes pre-trained models with small real datasets
Reduces packet delay prediction error by up to 88%
🔎 Similar Papers
No similar papers found.
C
Carlos Güemes-Palau
Barcelona Neural Networking Center, Universitat Politècnica de Catalunya
M
Miquel Ferriol-Galmés
Barcelona Neural Networking Center, Universitat Politècnica de Catalunya
J
Jordi Paillisse-Vilanova
Barcelona Neural Networking Center, Universitat Politècnica de Catalunya
A
Albert López-Brescó
Barcelona Neural Networking Center, Universitat Politècnica de Catalunya
Pere Barlet-Ros
Pere Barlet-Ros
Universitat Politècnica de Catalunya (UPC)
CybersecurityPrivacyMachine LearningGraph Neural Networks
Albert Cabellos-Aparicio
Albert Cabellos-Aparicio
UPC-BarcelonaTech
Computer Networks