🤖 AI Summary
The accountability, sustainability, and robustness of open-source projects remain poorly understood as they transition from founder-led governance to community-driven models.
Method: Leveraging GOVERNANCE.md files from 637 GitHub repositories, we develop a scalable semantic parsing pipeline that systematically traces governance evolution—integrating version-control analysis, extraction of institutional roles/behaviors/permissions, and clustering-based modeling.
Contribution/Results: We find governance maturation does not stem from discursive shifts but from progressive responsibility layering and refinement: persistent role and behavioral differentiation, increasing clarity of oversight functions, strengthened ecosystem-level collaboration, and growing regulatory balance. This work establishes the first large-scale empirical framework and reusable methodology for studying governance evolution in open-source digital public infrastructure.
📝 Abstract
Open digital public infrastructure needs community management to ensure accountability, sustainability, and robustness. Yet open-source projects often rely on centralized decision-making, and the determinants of successful community management remain unclear. We analyze 637 GitHub repositories to trace transitions from founder-led to shared governance. Specifically, we document trajectories to community governance by extracting institutional roles, actions, and deontic cues from version-controlled project constitutions GOVERNANCE.md. With a semantic parsing pipeline, we cluster elements into broader role and action types. We find roles and actions grow, and regulation becomes more balanced, reflecting increases in governance scope and differentiation over time. Rather than shifting tone, communities grow by layering and refining responsibilities. As transitions to community management mature, projects increasingly regulate ecosystem-level relationships and add definition to project oversight roles. Overall, this work offers a scalable pipeline for tracking the growth and development of community governance regimes from open-source software's familiar default of founder-ownership.