JANUS: Resilient and Adaptive Data Transmission for Enabling Timely and Efficient Cross-Facility Scientific Workflows

📅 2025-06-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address bottlenecks in wide-area, large-scale data transfer for cross-institutional scientific workflows—including excessive bandwidth consumption, inefficient TCP packet-loss recovery, and high overhead of conventional fault-tolerance mechanisms—this paper proposes a UDP-based elastic and adaptive transfer framework. The framework innovatively integrates error-bounded lossy compression (e.g., SZ), dynamically tunable fountain-code erasure coding, and a real-time network-aware integer programming optimization model. This enables on-demand trade-offs between timeliness and fidelity, as well as online, self-adaptive transmission policy adjustment. Experimental evaluation demonstrates that, compared to Globus and standard TCP-based approaches, the framework achieves up to a 3.2× improvement in transfer throughput, accelerates packet-loss recovery by up to 5.8× under bandwidth fluctuations, and ensures controllable, mathematically verifiable data fidelity.

Technology Category

Application Category

📝 Abstract
In modern science, the growing complexity of large-scale projects has increased reliance on cross-facility workflows, where institutions share resources and expertise to accelerate discovery. These workflows often involve transferring massive data over wide-area networks. While high-speed networks like ESnet and data transfer services like Globus have improved data mobility, challenges remain. Large data volumes can strain bandwidth, TCP suffers from retransmissions due to packet loss, and traditional fault-tolerance methods like erasure coding introduce significant overhead. This paper presents JANUS, a resilient and adaptive data transmission approach for cross-facility scientific workflows. JANUS uses UDP, integrates erasure coding for fault tolerance, and applies error-bounded lossy compression to reduce overhead. This design enables users to balance transmission time and accuracy based on specific needs. JANUS also adapts coding parameters to real-time network conditions and uses optimization models to determine ideal configurations. Experiments show that JANUS significantly improves data transfer efficiency while preserving fidelity.
Problem

Research questions and friction points this paper is trying to address.

Improving data transfer efficiency in cross-facility workflows
Reducing overhead from fault-tolerance methods like erasure coding
Adapting to real-time network conditions for optimal performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

UDP-based resilient data transmission
Error-bounded lossy compression technique
Real-time adaptive coding parameters
🔎 Similar Papers
No similar papers found.