Tight Additive Sensitivity on LZ-style Compressors and String Attractors

📅 2025-06-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper investigates the worst-case additive sensitivity of string repetitiveness measures under single-character edits. Specifically, it analyzes how γ (minimum attractor size), b (minimum bidirectional scheme size), and three Lempel–Ziv compression measures—LZSS, LZ-End, and LZ78—change when one character is inserted, deleted, or substituted. The work establishes the first tight bounds: Θ(√n) for γ and b; Θ(n²/³) for LZSS and LZ-End; and Θ(n) for LZ78. Methodologically, it combines combinatorial construction with compression-theoretic analysis, achieving matching upper and lower bounds via explicit adversarial examples and rigorous upper-bound proofs. The results fully characterize the edit sensitivity of these fundamental repetitiveness measures, resolving several long-standing open problems. Moreover, this work provides the first systematic theoretical framework for analyzing the robustness of string compression under small perturbations.

Technology Category

Application Category

📝 Abstract
The worst-case additive sensitivity of a string repetitiveness measure $c$ is defined to be the largest difference between $c(w)$ and $c(w')$, where $w$ is a string of length $n$ and $w'$ is a string that can be obtained by performing a single-character edit operation on $w$. We present $O(sqrt{n})$ upper bounds for the worst-case additive sensitivity of the smallest string attractor size $γ$ and the smallest bidirectional scheme size $b$, which match the known lower bounds $Ω(sqrt{n})$ for $γ$ and $b$ [Akagi et al. 2023]. Further, we present matching upper and lower bounds for the worst-case additive sensitivity of the Lempel-Ziv family - $Θ(n^{frac{2}{3}})$ for LZSS and LZ-End, and $Θ(n)$ for LZ78.
Problem

Research questions and friction points this paper is trying to address.

Study worst-case additive sensitivity of string repetitiveness measures
Establish upper bounds for sensitivity of string attractor size
Derive bounds for Lempel-Ziv family sensitivity variations
Innovation

Methods, ideas, or system contributions that make the work stand out.

O(√n) bounds for string attractor sensitivity
Matching bounds for bidirectional scheme size
Θ(n^(2/3)) and Θ(n) for LZSS and LZ78
🔎 Similar Papers
No similar papers found.
Y
Yuto Fujie
Joint Graduate School of Mathematics for Innovation, Kyushu University, Japan
H
Hiroki Shibata
Joint Graduate School of Mathematics for Innovation, Kyushu University, Japan
Yuto Nakashima
Yuto Nakashima
Kyushu University
Computer Science
Shunsuke Inenaga
Shunsuke Inenaga
Professor, Department of Informatics, Kyushu University
Algorithms and Data StructuresString AlgorithmsCompressionCombinatorics on Words