🤖 AI Summary
AI workloads in multi-tenant clouds suffer from inefficient resource utilization due to bidirectional architectural and semantic misalignment between tenants and cloud providers.
Method: This paper proposes the first tenant–provider co-designed cloud abstraction paradigm,重构 the cloud abstraction layer via cross-layer interface design, lightweight workload feature embedding, and verifiable collaborative policy negotiation—enabling joint optimization of task partitioning, scheduling, and fault tolerance.
Contribution/Results: The work systematically identifies and models four key collaboration opportunities, establishing the first comprehensive framework for AI workload co-optimization. Experiments demonstrate significant improvements in performance, energy efficiency, and elasticity—while maintaining task stability and supporting sustainability objectives.
📝 Abstract
AI workloads, often hosted in multi-tenant cloud environments, require vast computational resources but suffer inefficiencies due to limited tenant-provider coordination. Tenants lack infrastructure insights, while providers lack workload details to optimize tasks like partitioning, scheduling, and fault tolerance. We propose the HarmonAIze project to redefine cloud abstractions, enabling cooperative optimization for improved performance, efficiency, resiliency, and sustainability. This paper outlines key opportunities, challenges, and a research agenda to realize this vision.