Capacity Planning and Scheduling for Jobs with Uncertainty in Resource Usage and Duration

📅 2025-07-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Financial institutions face capacity planning and job scheduling challenges in hybrid cloud and on-premise grid environments, where both resource requirements and execution durations exhibit dual uncertainty. Method: This paper proposes a co-optimization framework that jointly minimizes resource provisioning while maximizing service quality—specifically, on-time completion rate. Innovatively, it is the first to jointly model resource and duration uncertainty within capacity planning, employing a constraint programming framework based on paired sampling that integrates deterministic estimation with stochastic sampling for efficient approximate optimization. Contribution/Results: Experiments demonstrate that the method significantly reduces peak resource demand compared to manual scheduling, while maintaining a high on-time completion rate—validating its effectiveness in balancing these conflicting objectives under uncertainty.

Technology Category

Application Category

📝 Abstract
Organizations around the world schedule jobs (programs) regularly to perform various tasks dictated by their end users. With the major movement towards using a cloud computing infrastructure, our organization follows a hybrid approach with both cloud and on-prem servers. The objective of this work is to perform capacity planning, i.e., estimate resource requirements, and job scheduling for on-prem grid computing environments. A key contribution of our approach is handling uncertainty in both resource usage and duration of the jobs, a critical aspect in the finance industry where stochastic market conditions significantly influence job characteristics. For capacity planning and scheduling, we simultaneously balance two conflicting objectives: (a) minimize resource usage, and (b) provide high quality-of-service to the end users by completing jobs by their requested deadlines. We propose approximate approaches using deterministic estimators and pair sampling-based constraint programming. Our best approach (pair sampling-based) achieves much lower peak resource usage compared to manual scheduling without compromising on the quality-of-service.
Problem

Research questions and friction points this paper is trying to address.

Estimate resource needs for hybrid cloud and on-prem grid computing
Handle uncertainty in job resource usage and duration
Balance minimal resource usage and meeting job deadlines
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid cloud and on-prem servers approach
Handling uncertainty in resource and duration
Pair sampling-based constraint programming technique
🔎 Similar Papers
No similar papers found.