🤖 AI Summary
This work proposes DeXOR, a novel compression framework addressing the inefficiency of existing methods in handling high-precision or non-smooth streaming floating-point data. DeXOR introduces XOR-based encoding directly in decimal space for the first time, leveraging longest common prefix and suffix extraction to eliminate redundancy. The framework integrates several optimization strategies—including error-tolerant rounding, scaled truncation, exponent outlier handling, and bit-level management—to enhance compression while preserving decompression accuracy. Evaluated on 22 real-world datasets, DeXOR achieves an average 15% improvement in compression ratio and a 20% increase in decompression speed, demonstrating robustness and efficiency even under extreme conditions.
📝 Abstract
With streaming floating-point numbers being increasingly prevalent, effective and efficient compression of such data is critical. Compression schemes must be able to exploit the similarity, or smoothness, of consecutive numbers and must be able to contend with extreme conditions, such as high-precision values or the absence of smoothness. We present DeXOR, a novel framework that enables decimal XOR procedure to encode decimal-space longest common prefixes and suffixes, achieving optimal prefix reuse and effective redundancy elimination. To ensure accurate and low-cost decompression even with binary-decimal conversion errors, DeXOR incorporates 1) scaled truncation with error-tolerant rounding and 2) different bit management strategies optimized for decimal XOR. Additionally, a robust exception handler enhances stability by managing floating-point exponents, maintaining high compression ratios under extreme conditions. In evaluations across 22 datasets, DeXOR surpasses state-of-the-art schemes, achieving a 15% higher compression ratio and a 20% faster decompression speed while maintaining a competitive compression speed. DeXOR also offers scalability under varying conditions and exhibits robustness in extreme scenarios where other schemes fail.