🤖 AI Summary
This work investigates optimal repair of $(k+2,k)$ MDS array codes—i.e., MDS codes with two parity nodes and subpacketization level $alpha = 2$—under single-node failure. First, it rigorously derives tight theoretical lower bounds on both repair bandwidth and I/O cost for this setting. Second, it proposes two explicit code constructions: one achieving the minimum repair bandwidth bound, and another attaining the minimum I/O bound; remarkably, a unified construction simultaneously meets both bounds—the first such result. The approach integrates regenerating code theory, algebraic structure of MDS codes, and combinatorial design, employing fine-grained subpacketization parameterization and carefully engineered parity-check matrices to minimize network transmission and disk reads during repair. Experimental evaluation demonstrates that the proposed schemes reduce repair bandwidth by approximately 30% and I/O overhead by about 40% compared to state-of-the-art alternatives, establishing a new paradigm for theoretically optimal and practically implementable repair coding in highly reliable distributed storage systems.
📝 Abstract
Maximum distance separable (MDS) codes are widely used in distributed storage systems as they provide optimal fault tolerance for a given amount of storage overhead.
The seminal work of Dimakis~emph{et al.} first established a lower bound on the repair bandwidth for a single failed node of MDS codes, known as the emph{cut-set bound}. MDS codes that achieve this bound are called minimum storage regenerating (MSR) codes. Numerous constructions and theoretical analyses of MSR codes reveal that they typically require exponentially large sub-packetization levels, leading to significant disk I/O overhead. To mitigate this issue, many studies explore the trade-offs between the sub-packetization level and repair bandwidth, achieving reduced sub-packetization at the cost of suboptimal repair bandwidth. Despite these advances, the fundamental question of determining the minimum repair bandwidth for a single failure of MDS codes with fixed sub-packetization remains open.
In this paper, we address this challenge for the case of two parity nodes ($n-k=2$) and sub-packetization $ell=2$. We derive tight lower bounds on both the minimum repair bandwidth and the minimum I/O overhead. Furthermore, we present two explicit MDS array code constructions that achieve these bounds, respectively, offering practical code designs with provable repair efficiency.