🤖 AI Summary
Broadcast communication in large-scale systems (e.g., supercomputers) is constrained by network topology, bandwidth bottlenecks, and synchronization overhead. To address these challenges, this paper proposes the Balanced Saturated Broadcast (BBS) algorithm. BBS employs a topology-adaptive, precisely periodic scheduling scheme coupled with persistent node participation to construct a repeatable, progressive broadcast framework—achieving high node utilization while significantly reducing synchronization costs. Simulation results across representative topologies—including Mesh, Torus, and Fat-Tree—demonstrate that BBS reduces average broadcast latency by 23%–41% compared to standard algorithms such as MPI_Bcast and Binomial Tree, with superior scalability. Its core innovation lies in explicitly modeling “communication saturation” as an optimization objective for the first time, enabling topology-agnostic, efficient broadcast. This yields strong generality and robustness without requiring topology-specific tuning.
📝 Abstract
We present Broadcast by Balanced Saturation (BBS), a general broadcast algorithm designed to optimize communication efficiency across diverse network topologies. BBS maximizes node utilization, addressing challenges in broadcast operations such as topology constraints, bandwidth limitations, and synchronization overhead, particularly in large-scale systems like supercomputers. The algorithm ensures sustained activity with nodes throughout the broadcast, thereby enhancing data propagation and significantly reducing latency. Through a precise communication cycle, BBS provides a repeatable, streamlined, stepwise broadcasting framework. Simulation results across various topologies demonstrate that the BBS algorithm consistently outperforms common general broadcast algorithms, often by a substantial margin. These findings suggest that BBS is a versatile and robust framework with the potential to redefine broadcast strategies across network topologies.