🤖 AI Summary
This work addresses the absence of open-source, self-calibrating digital twin systems in existing data centers, which hinders real-time autonomous operations and dynamic optimization. To bridge this gap, we propose OpenDT—the first open, self-calibrating digital twin framework tailored for data centers—that integrates continuous telemetry streams, discrete-event simulation driven by real-world ICT workloads, and SLO-aware human-in-the-loop feedback to enable online modeling and optimization of both performance and energy efficiency. Designed in accordance with FAIR and FOSS principles, OpenDT supports online recalibration to enhance model fidelity. Experimental evaluation demonstrates that OpenDT not only reproduces prior findings but also extends energy-efficiency analysis capabilities, significantly reducing the mean absolute percentage error (MAPE) from 7.86% to 4.39%.
📝 Abstract
Datacenters are the backbone of our digital society, but raise numerous operational challenges. We envision digital twins becoming primary instruments in datacenter operations, continuously and autonomously helping with major operational decisions and with adapting ICT infrastructure, live, with a human-in-the-loop. Although fields such as aviation and autonomous driving successfully employ digital twins, an open-source digital twin for datacenters has not been demonstrated to the community. Addressing this challenge, we design, implement, and experiment using OpenDT, an Open-source, Digital Twin for monitoring and operating datacenters through a continuous integration cycle that includes: (1) live and continuous telemetry data; (2) discrete-event simulation using live telemetry from the physical ICT, with self-calibration; and (3) SLO-aware and human-approved feedback to physical ICT. Through trace-driven experiments with a prototype mainly covering stages 1 and 2 of the cycle, we show that (i) OpenDT can be used to reproduce peer-reviewed experiments and extend the analysis with performance and energy-efficiency results; (ii) OpenDT's online re-calibration can increase digital-twinning accuracy, quantified to a MAPE of 4.39% vs. 7.86% in peer-reviewed work. OpenDT adheres to FAIR/FOSS principles and is available at: https://github.com/atlarge-research/opendt/tree/hcp.