Understanding Power and Energy Utilization in Large Scale Production Physics Simulation Codes

📅 2022-01-04
🏛️ arXiv.org
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
Processor thermal design power (TDP) is widely misused as a proxy for actual power consumption in physics simulations, leading to inaccurate energy-efficiency assessments. Method: This study conducts the first empirical power and energy measurements of major production-scale physics simulation codes on heterogeneous exascale supercomputers at LLNL and Sandia. Leveraging multi-granularity energy modeling, cross-platform benchmarking, and real-time monitoring across commercial and advanced CPU–GPU heterogeneous nodes, it systematically quantifies runtime energy efficiency. Contribution/Results: Under typical simulation workloads, measured power draw is only 30–60% of TDP—substantially lower than nominal ratings. This work challenges the longstanding practice of substituting TDP for measured power, establishing an empirically grounded methodology for evaluating energy efficiency in exascale systems. It provides critical, reproducible, and generalizable energy benchmarks to guide hardware deployment and energy-aware optimization, thereby advancing low-carbon scientific computing.
📝 Abstract
Power is an often-cited reason for moving to advanced architectures on the path to Exascale computing. This is due to the practical concern of delivering enough power to successfully site and operate these machines, as well as concerns over energy usage while running large simulations. Since accurate power measurements can be difficult to obtain, processor thermal design power (TDP) is a possible surrogate due to its simplicity and availability. However, TDP is not indicative of typical power usage while running simulations. Using commodity and advance technology systems at Lawrence Livermore National Laboratory (LLNL) and Sandia National Laboratory, we performed a series of experiments to measure power and energy usage in running simulation codes. These experiments indicate that large scale LLNL simulation codes are significantly more efficient than a simple processor TDP model might suggest.
Problem

Research questions and friction points this paper is trying to address.

Measure power usage in large-scale physics simulations
Compare TDP with actual simulation energy efficiency
Evaluate energy efficiency of advanced computing architectures
Innovation

Methods, ideas, or system contributions that make the work stand out.

Measure power usage in simulation codes
Compare TDP with actual energy consumption
Use advanced systems for accurate experiments
🔎 Similar Papers
No similar papers found.
B
Brian S. Ryujin
Lawrence Livermore National Laboratory, Livermore, CA, USA
A
Arturo Vargas
Lawrence Livermore National Laboratory, Livermore, CA, USA
Ian Karlin
Ian Karlin
Lawrence Livermore National Laboratory
S
Shawn A. Dawson
Lawrence Livermore National Laboratory, Livermore, CA, USA
Kenneth Weiss
Kenneth Weiss
Computer Scientist, Lawrence Livermore National Laboratory
Scientific VisualizationSpatial Data StructuresShape ModelingGeometry ProcessingHigh Performance Computing
A
Adam Bertsch
Lawrence Livermore National Laboratory, Livermore, CA, USA
M
M. Scott McKinley
Lawrence Livermore National Laboratory, Livermore, CA, USA
M
Michael R. Collette
Lawrence Livermore National Laboratory, Livermore, CA, USA
S
Si D. Hammond
National Nuclear Security Administration US Department of Energy, Washington, DC, USA
Kevin Pedretti
Kevin Pedretti
Sandia National Laboratories
High Performance Computing - Operating Systems - Networking
Robert N. Rieben
Robert N. Rieben
Computational Physicist, Lawrence Livermore National Laboratory
MagnetohydrodynamicsComputational PhysicsFinite Element MethodsHigh Order MethodsALE Methods