๐ค AI Summary
Addressing the challenge of simultaneously achieving progressive decompression, random access, and high compression quality/speed in lossy scientific data compression, this paper proposes the first unified framework supporting both decompression modes. Our method employs hierarchical data partitioning and hierarchical prediction, integrated with error-bounded lossy compression and streaming-based codec design. It maintains reconstruction accuracy comparable to state-of-the-art non-streaming compressors (e.g., SZ3), while significantly improving decompression throughputโup to 6.7ร faster than SZ3. To our knowledge, this is the first approach to jointly optimize high fidelity, high throughput, and flexible data access. The framework enables practical online analysis and on-demand data retrieval, bridging a critical gap between theoretical compression efficiency and real-world scientific computing workflows.
๐ Abstract
Error-bounded lossy compression is one of the most efficient solutions to reduce the volume of scientific data. For lossy compression, progressive decompression and random-access decompression are critical features that enable on-demand data access and flexible analysis workflows. However, these features can severely degrade compression quality and speed. To address these limitations, we propose a novel streaming compression framework that supports both progressive decompression and random-access decompression while maintaining high compression quality and speed. Our contributions are three-fold: (1) we design the first compression framework that simultaneously enables both progressive decompression and random-access decompression; (2) we introduce a hierarchical partitioning strategy to enable both streaming features, along with a hierarchical prediction mechanism that mitigates the impact of partitioning and achieves high compression quality -- even comparable to state-of-the-art (SOTA) non-streaming compressor SZ3; and (3) our framework delivers high compression and decompression speed, up to 6.7$ imes$ faster than SZ3.