Hybrid Edge-HPC Systems for Low-Latency Data-Driven Inference

📅 2026-05-19
📈 Citations: 0
Influential: 0
📄 PDF

career value

232K/year
🤖 AI Summary
This work addresses the mismatch between the responsiveness required for low-latency edge inference and the update delays inherent in remote high-performance computing (HPC)-driven high-fidelity simulations. To bridge this gap, the authors propose the RBF architecture, which deploys lightweight proxy models at the edge for real-time inference while asynchronously integrating high-accuracy models generated by HPC systems, thereby decoupling inference from training. Innovatively, HPC computation is leveraged for model accuracy enhancement rather than system utilization optimization, enabling heterogeneous collaboration across edge devices, 5G networks, cloud infrastructure, and HPC resources, as well as plug-and-play deployment of proxy models. Evaluated in a digital agriculture scenario, the approach consistently delivers low-latency inference and progressively improves prediction accuracy despite irregular and delayed model updates.
📝 Abstract
Emerging cyber-physical systems increasingly require low-latency inference from streaming sensor data while maintaining models that reflect complex and evolving physical processes. In many domains, however, model updates depend on high-fidelity simulations and training executed on remote high-performance computing (HPC) systems under batch scheduling. This creates a fundamental mismatch between the responsiveness required at the edge and the cost, throughput, and availability of simulation-driven model updates. We present RBF (Reverse Backfill), a hybrid edge-HPC learning and inference architecture that integrates low-latency edge inference with asynchronous, simulation-driven model improvement. RBF targets simulation-bounded settings in which model updates are constrained by simulation throughput and HPC scheduling delays, and reinterprets HPC backfilling by using opportunistic computation to improve model accuracy rather than system utilization. RBF decouples inference from simulation and training by deploying lightweight surrogate models at the edge while incorporating improved models asynchronously as they become available. The architecture supports pluggable surrogate models and orchestrates computation across heterogeneous infrastructure spanning edge devices, private 5G, cloud, and HPC resources. We instantiate RBF using a real-world digital agriculture deployment that couples edge sensing with computational fluid dynamics (CFD) simulations to infer airflow patterns in a large agricultural screenhouse. Our evaluation characterizes end-to-end system behavior under realistic constraints, quantifying simulation latency, training cost, inference throughput, and the impact of delayed model updates on prediction accuracy. Results demonstrate that RBF enables continuous, low-latency inference while improving model fidelity over time despite delayed and irregular model updates.
Problem

Research questions and friction points this paper is trying to address.

low-latency inference
edge-HPC systems
simulation-driven model updates
cyber-physical systems
model fidelity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hybrid Edge-HPC
Reverse Backfill
Surrogate Models
Asynchronous Model Updates
Simulation-Driven Inference
🔎 Similar Papers
No similar papers found.
L
Liubov Kurafeeva
Dept. of Computer Science, University of California Santa Barbara
R
Ryan Hartung
Dept. of Computer Science and Engineering, University of Notre Dame
B
Benjamin Carter
Dept. of Computer Science, University of California Santa Barbara
A
Alan Subedi
School of Computing, University of Nebraska-Lincoln
A
Avhishek Biswas
School of Computing, University of Nebraska-Lincoln
M
Michael Fay
School of Computing, University of Nebraska-Lincoln
Shantenu Jha
Shantenu Jha
Rutgers University and Brookhaven National Laboratory
High-performance and Distributed ComputingCyberinfrastructureComputational Science
C
Chandra Krintz
Dept. of Computer Science and Engineering, University of Notre Dame
Andre Merzky
Andre Merzky
Rutgers University
Douglas Thain
Douglas Thain
Professor, University of Notre Dame
Distributed systemscloudsworkflowsfilesystemsscientific computing
M
Memet Can Vuran
School of Computing, University of Nebraska-Lincoln
R
Rich Wolski
Dept. of Computer Science and Engineering, University of Notre Dame