🤖 AI Summary
This work addresses the challenge of dynamic load balancing for block-structured grid–particle coupled simulations within the AMReX framework on heterogeneous supercomputing systems. To tackle hardware-aware partitioning inefficiencies, we propose two topology-aware algorithms: (1) a hybrid partitioning scheme integrating the knapsack optimization with space-filling curves (SFCs), and (2) an SFC-based refinement inspired by painter’s partitioning. Compared to AMReX’s default percentage-based load balancing, our methods significantly improve balance quality—particularly under low-weight-variance scenarios. Rigorous evaluation via knapsack optimization, topology-aware domain decomposition, and brute-force enumeration benchmarks demonstrates superior partitioning quality and runtime efficiency across most test cases, with pronounced advantages when particle weight distributions are uniform. The proposed approaches establish a scalable, high-performance paradigm for dynamic load balancing in large-scale, multi-physics HPC simulations.
📝 Abstract
Load balancing is critical for successful large-scale high-performance computing (HPC) simulations. With modern supercomputers increasing in complexity and variability, dynamic load balancing is becoming more critical to use computational resources efficiently. In this study, performed during a summer collaboration at Lawrence Berkeley National Laboratory, we investigate various standard dynamic load-balancing algorithms. This includes the time evaluation of a brute-force solve for application in algorithmic evaluation, as well as quality and time evaluations of the Knapsack algorithm, an SFC algorithm, and two novel algorithms: a painter's partition-based SFC algorithm and a combination Knapsack+SFC methodology-based on hardware topology. The results suggest Knapsack and painter's partition-based algorithms should be among the first algorithms evaluated by HPC codes for cases with limited weight deviation and will perform at least slightly better than AMReX's percentage-tracking partitioning strategy across most simulations, although effects diminish as weight variety increases.