🤖 AI Summary
This paper studies the graph edge partitioning problem: partitioning the edges of a graph into $k$ nearly equal-sized subsets to minimize the vertex replication factor—the number of vertices assigned to multiple partitions. To overcome the limited expressiveness of conventional symmetric intersecting families, we introduce the “balanced intersecting system,” a novel combinatorial structure that relaxes symmetry constraints and improves adaptability to arbitrary $k$. Leveraging tools from combinatorial design, intersecting family theory, and asymptotic analysis, we construct an edge partition achieving a replication factor of $sqrt{n}(1+o(1))$, matching the theoretical lower bound for this problem. Our method ensures near-perfect load balancing across partitions and is universally applicable to any $k$, thereby significantly advancing both the theoretical foundations and practical applicability of graph partitioning in distributed graph processing.
📝 Abstract
We study the problem of edge-centric graph partitioning, where the goal is to distribute the edges of a graph among several almost equally sized partitions in order to minimize the replication factor of vertices. We build a partitioning algorithm that guarantees near-perfect balance and replication factor $sqrt{n}(1 + o(1))$ for arbitrary number of partitions $n$. This asymptotical bound cannot be improved. To do so, we introduce balanced intersecting systems. It is a construction similar to symmetric intersecting families, but the symmetry condition is replaced by a weaker balance condition. We build an algorithm that uses such a system, and prove that by using a system of optimal cardinality we achieve exactly optimal guarantees for the replication factor. Finally, we build balanced intersecting systems with asymptotically optimal cardinality.