🤖 AI Summary
Financial institutions face capacity planning and job scheduling challenges in hybrid cloud and on-premise grid environments, where both resource requirements and execution durations exhibit dual uncertainty. Method: This paper proposes a co-optimization framework that jointly minimizes resource provisioning while maximizing service quality—specifically, on-time completion rate. Innovatively, it is the first to jointly model resource and duration uncertainty within capacity planning, employing a constraint programming framework based on paired sampling that integrates deterministic estimation with stochastic sampling for efficient approximate optimization. Contribution/Results: Experiments demonstrate that the method significantly reduces peak resource demand compared to manual scheduling, while maintaining a high on-time completion rate—validating its effectiveness in balancing these conflicting objectives under uncertainty.
📝 Abstract
Organizations around the world schedule jobs (programs) regularly to perform various tasks dictated by their end users. With the major movement towards using a cloud computing infrastructure, our organization follows a hybrid approach with both cloud and on-prem servers. The objective of this work is to perform capacity planning, i.e., estimate resource requirements, and job scheduling for on-prem grid computing environments. A key contribution of our approach is handling uncertainty in both resource usage and duration of the jobs, a critical aspect in the finance industry where stochastic market conditions significantly influence job characteristics. For capacity planning and scheduling, we simultaneously balance two conflicting objectives: (a) minimize resource usage, and (b) provide high quality-of-service to the end users by completing jobs by their requested deadlines. We propose approximate approaches using deterministic estimators and pair sampling-based constraint programming. Our best approach (pair sampling-based) achieves much lower peak resource usage compared to manual scheduling without compromising on the quality-of-service.