🤖 AI Summary
This paper addresses fault-tolerant communication in the congested clique model under adversarial edge failures: an adaptive adversary may corrupt up to αn incident edges per node per round—implying nearly linear or even quadratic total edge failures—thus surpassing the classical fixed bound of f faulty edges. To tackle this, we introduce the first generic compiler framework that integrates error-correcting codes, locally decodable codes, covering-free families, and sparse recovery sketches to construct a resilient routing mechanism. Our compiler robustly simulates any congested clique algorithm with only constant-round overhead, without increasing communication bandwidth or computational cost. It achieves Ω(n)-fold improvement in tolerable fault capacity compared to prior approaches and uniformly supports non-adaptive, adaptive, and deterministic algorithms.
📝 Abstract
Resilient computation in all-to-all-communication models has attracted tremendous attention over the years. Most of these works assume the classical faulty model which restricts the total number of corrupted edges (or vertices) by some integer fault parameter $f$. A recent work by [Bodwin, Haeupler and Parter, SODA 2024] introduced a stronger notion of fault-tolerance, in the context of graph sparsification, which restricts the degree of the failing edge set $F$, rather than its cardinality. For a subset of faulty edges $F$, the faulty-degree $mathrm{deg}(F)$ is the largest number of faults in $F$ incident to any given node. In this work, we study the communication aspects of this faulty model which allows us to handle almost linearly more edge faults (possibly quadratic), with no extra cost. Our end results are general compilers that take any Congested Clique algorithm and simulate it, in a round by round manner, in the presence of a $alpha$-Byzantine mobile adversary that controls a $alpha$-fraction of the edges incident to each node in the fully connected network. For every round $i$, the mobile adversary is allowed to select a distinct set of corrupted edges $F_i$ under the restriction that $mathrm{deg}(F_i)leq alpha n$. In the non-adaptive setting, the $F_i$ sets are selected at the beginning of the simulation, while in the adaptive setting, these edges can be chosen based on the entire history of the protocol up to round $i$. We show general compilers for the non-adaptive, adaptive, and deterministic settings. A key component of our algorithms is a new resilient routing scheme which may be of independent interest. Our approach is based on a combination of techniques, including error-correcting-code, locally decodable codes, cover-free families, and sparse recovery sketches.