🤖 AI Summary
This work addresses the lack of tight and practical time complexity bounds in bandwidth-constrained decentralized non-convex stochastic optimization. By introducing graph-theoretic concepts—specifically the min-cut/max-flow quantity, Gomory-Hu trees, and Steiner tree packing—we develop a bandwidth-aware communication-computation co-analysis framework and derive a near-optimal lower bound on time complexity. Building on this foundation, we propose two algorithms: Grace SGD for homogeneous settings and Leon SGD for heterogeneous environments. Both algorithms achieve time complexity that is optimal up to logarithmic factors in their respective scenarios, significantly outperforming existing methods.
📝 Abstract
We consider a realistic decentralized setup with bandwidth-constrained communication and derive optimal time complexities for non-convex stochastic parallel and asynchronous optimization (up to logarithmic factors). We develop the corresponding methods, Grace SGD and Leon SGD, for both homogeneous and heterogeneous settings. Unlike previous work, our optimal bounds are characterized in terms of min-cut/max-flow quantities and rely on tools from Gomory-Hu trees and Steiner Tree Packing problems, providing tighter and more practical complexities.