๐ค AI Summary
This work addresses the joint optimization of carbon budget compliance and low-latency service availability in microservice deployment within data centers. We propose the first holistic framework for hourly carbon-intensity-aware microservice version selection and horizontal autoscaling. Methodologically, we formulate an integer linear programming model that integrates real-time carbon intensity signals, dynamic workload fluctuations, and hard QoS constraints (latency and availability), enabling carbon-aware scheduling at the microservice level in real time. Our key contributions are: (i) extending carbon-aware scheduling to the fine-grained, highly dynamic microservice layer for the first time; and (ii) introducing a version-aware autoscaling mechanism that supports heterogeneous deployment of multiple service versions. Experiments demonstrate that our approach strictly satisfies hourly carbon budgets across diverse application configurations, improves average user satisfaction by 18.7%, increases revenue by 12.3%, and responds to minute-scale variations in both carbon intensity and workload.
๐ Abstract
The carbon footprint of data centers has recently become a critical concern. So far, most carbon-aware strategies have focused on leveraging the flexibility of scheduling decisions for batch processing by shifting the time and location of workload executions. However, such approaches cannot be applied to service-oriented cloud applications, since they have to be reachable at every point in time and often at low latencies. We propose a carbon-aware approach for operating microservices under hourly carbon budgets. By choosing the most appropriate version and horizontal scaleout for each microservice, our strategy maximizes user experience and revenue while staying within budget constraints. Experiments across various application configurations and carbon budgets demonstrate that the approach adapts properly to changing workloads and carbon intensities.