🤖 AI Summary
This study investigates the burstiness of collective developer behavior in open-source communities and its underlying social diffusion mechanisms to model and predict long-term community sustainability. Method: We propose an activity-cascade identification method based on co-editing temporal networks to detect bursty code-commit events and uncover their propagation patterns across developer collaboration relationships. Contribution/Results: Empirical analysis across 50 mainstream open-source projects reveals statistically significant activity cascades in over half of them. These cascades effectively capture collaborative dynamics among developers and substantially improve prediction of developer attrition—yielding an average AUC gain of 12.3%. To our knowledge, this is the first work to formally link activity-cascade modeling with community sustainability, establishing a novel paradigm and a scalable analytical framework for understanding the socio-dynamics of open-source collaboration.
📝 Abstract
Understanding the collective social behavior of software developers is crucial to model and predict the long-term dynamics and sustainability of Open Source Software (OSS) communities. To this end, we analyze temporal activity patterns of developers, revealing an inherently ``bursty'' nature of commit contributions. To investigate the social mechanisms behind this phenomenon, we adopt a network-based modelling framework that captures developer interactions through co-editing networks. Our framework models social interactions, where a developer editing the code of other developers triggers accelerated activity among collaborators. Using a large data set on 50 major OSS communities, we further develop a method that identifies activity cascades, i.e. the propagation of developer activity in the underlying co-editing network. Our results suggest that activity cascades are a statistically significant phenomenon in more than half of the studied projects. We further show that our insights can be used to develop a simple yet practical churn prediction method that forecasts which developers are likely to leave a project. Our work sheds light on the emergent collective social dynamics in OSS communities and highlights the importance of activity cascades to understand developer churn and retention in collaborative software projects.