Enabling Seamless Transitions from Experimental to Production HPC for Interactive Workflows

📅 2025-06-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Experimental HPC environments struggle to support seamless migration of interactive scientific workflows into production systems. Method: This paper proposes a tripartite transition framework integrating dataflow-driven execution, zero-trust security interfaces, and QoS-aware adaptive scheduling. It innovatively unifies dynamic dataflow architecture, zero-trust–based secure service interfaces, and elastic, timeliness-sensitive resource scheduling to enable the paradigm shift from batch-oriented HPC to near-real-time interactive ecosystems. Contribution/Results: Evaluated on the Oak Ridge Leadership Computing Facility (OLCF) platform, the framework achieves a 40% reduction in end-to-end latency, sub-second interactive response times, and enables multi-disciplinary real-time closed-loop experiments. It has been deployed in production on the Summit and Frontier exascale supercomputers, providing a reusable, structured migration pathway for cross-facility collaboration, dynamic experimental control, and near-real-time analysis.

Technology Category

Application Category

📝 Abstract
The evolving landscape of scientific computing requires seamless transitions from experimental to production HPC environments for interactive workflows. This paper presents a structured transition pathway developed at OLCF that bridges the gap between development testbeds and production systems. We address both technological and policy challenges, introducing frameworks for data streaming architectures, secure service interfaces, and adaptive resource scheduling for time-sensitive workloads and improved HPC interactivity. Our approach transforms traditional batch-oriented HPC into a more dynamic ecosystem capable of supporting modern scientific workflows that require near real-time data analysis, experimental steering, and cross-facility integration.
Problem

Research questions and friction points this paper is trying to address.

Bridging experimental and production HPC for interactive workflows
Addressing technological and policy challenges in HPC transitions
Enhancing HPC interactivity for real-time data analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Structured transition pathway for HPC environments
Frameworks for data streaming and secure interfaces
Adaptive resource scheduling for real-time workloads
🔎 Similar Papers
No similar papers found.
B
Brian D. Etz
Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
David M. Rogers
David M. Rogers
Staff Scientist, Oak Ridge National Laboratory
Computational Physical ChemistryNonequilibrium Statistical MechanicsSolvation Dynamics
Michael J. Brim
Michael J. Brim
Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Ketan Maheshwari
Ketan Maheshwari
Engineer, Oak Ridge National Laboratory
Science ApplicationsPerformant ComputingStorage
K
Kellen Leland
Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Tyler J. Skluzacek
Tyler J. Skluzacek
Oak Ridge National Lab
workflowshpcinformation extraction
Jack Lange
Jack Lange
Oak Ridge National Laboratory and University of Pittsburgh
High Performance ComputingOperating SystemsDistributed Computing
D
Daniel Pelfrey
Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
J
Jordan Webb
Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Patrick Widener
Patrick Widener
Oak Ridge National Laboratory
Operating systemshigh-performance computinglarge-data applicationsmiddleware
R
Ryan Adamson
Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Christopher Zimmer
Christopher Zimmer
Oak Ridge National Laboratory
HPCHigh Performance NetworksReliabilityStorage
Verónica G. Melesse Vergara
Verónica G. Melesse Vergara
Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
Mallikarjun Shankar
Mallikarjun Shankar
Distinguished Research Scientist, Oak Ridge National Laboratory
MiddlewareHPC Data AnalyticsSensor NetworksEnergy GridsHealthcare
Sarp Oral
Sarp Oral
Oak Ridge National Laboratory
HPCParallel I/OStorage
Rafael Ferreira da Silva
Rafael Ferreira da Silva
Oak Ridge National Laboratory
Scientific WorkflowsDistributed ComputingWorkflow ManagementModeling and SimulationHigh Performance Computing