DICE: Data Influence Cascade in Decentralized Learning

📅 2025-07-09
📈 Citations: 0
Influential: 0
📄 PDF

career value

217K/year
🤖 AI Summary
In decentralized learning, quantifying data influence across sparsely connected nodes remains challenging due to localized communication topologies, impeding fair contribution assessment and incentive design. This paper introduces the first cascade modeling framework for data influence in decentralized networks, theoretically establishing that influence propagation is jointly governed by data distribution, communication topology, and loss function curvature. Building on this analysis, we propose a computationally tractable multi-hop influence approximation method enabling node-level contribution attribution. The resulting traceable influence framework provides principled foundations for partner selection and adversarial behavior detection, while also enabling verifiable, incentive-compatible mechanism design. Empirical evaluation demonstrates significant improvements in collaboration incentives and system robustness.

Technology Category

Application Category

📝 Abstract
Decentralized learning offers a promising approach to crowdsource data consumptions and computational workloads across geographically distributed compute interconnected through peer-to-peer networks, accommodating the exponentially increasing demands. However, proper incentives are still in absence, considerably discouraging participation. Our vision is that a fair incentive mechanism relies on fair attribution of contributions to participating nodes, which faces non-trivial challenges arising from the localized connections making influence ``cascade'' in a decentralized network. To overcome this, we design the first method to estimate extbf{D}ata extbf{I}nfluence extbf{C}ascad extbf{E} (DICE) in a decentralized environment. Theoretically, the framework derives tractable approximations of influence cascade over arbitrary neighbor hops, suggesting the influence cascade is determined by an interplay of data, communication topology, and the curvature of loss landscape. DICE also lays the foundations for applications including selecting suitable collaborators and identifying malicious behaviors. Project page is available at https://raiden-zhu.github.io/blog/2025/DICE/.
Problem

Research questions and friction points this paper is trying to address.

Estimating data influence cascade in decentralized learning networks
Designing fair incentive mechanisms for decentralized participation
Addressing challenges from localized connections and influence cascades
Innovation

Methods, ideas, or system contributions that make the work stand out.

Estimates data influence in decentralized networks
Uses tractable approximations for influence cascade
Determines influence by data, topology, loss curvature