STZ: A High Quality and High Speed Streaming Lossy Compression Framework for Scientific Data

๐Ÿ“… 2025-09-01
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Addressing the challenge of simultaneously achieving progressive decompression, random access, and high compression quality/speed in lossy scientific data compression, this paper proposes the first unified framework supporting both decompression modes. Our method employs hierarchical data partitioning and hierarchical prediction, integrated with error-bounded lossy compression and streaming-based codec design. It maintains reconstruction accuracy comparable to state-of-the-art non-streaming compressors (e.g., SZ3), while significantly improving decompression throughputโ€”up to 6.7ร— faster than SZ3. To our knowledge, this is the first approach to jointly optimize high fidelity, high throughput, and flexible data access. The framework enables practical online analysis and on-demand data retrieval, bridging a critical gap between theoretical compression efficiency and real-world scientific computing workflows.

Technology Category

Application Category

๐Ÿ“ Abstract
Error-bounded lossy compression is one of the most efficient solutions to reduce the volume of scientific data. For lossy compression, progressive decompression and random-access decompression are critical features that enable on-demand data access and flexible analysis workflows. However, these features can severely degrade compression quality and speed. To address these limitations, we propose a novel streaming compression framework that supports both progressive decompression and random-access decompression while maintaining high compression quality and speed. Our contributions are three-fold: (1) we design the first compression framework that simultaneously enables both progressive decompression and random-access decompression; (2) we introduce a hierarchical partitioning strategy to enable both streaming features, along with a hierarchical prediction mechanism that mitigates the impact of partitioning and achieves high compression quality -- even comparable to state-of-the-art (SOTA) non-streaming compressor SZ3; and (3) our framework delivers high compression and decompression speed, up to 6.7$ imes$ faster than SZ3.
Problem

Research questions and friction points this paper is trying to address.

Enabling progressive and random-access decompression without quality loss
Maintaining high compression speed while supporting streaming features
Overcoming partitioning impact to achieve state-of-the-art compression quality
Innovation

Methods, ideas, or system contributions that make the work stand out.

Streaming compression with progressive and random-access decompression
Hierarchical partitioning strategy for streaming features
High compression speed and quality comparable to SZ3
๐Ÿ”Ž Similar Papers
No similar papers found.