🤖 AI Summary
This study investigates the temporal deployment, content characteristics, and community impact mechanisms of README and CONTRIBUTING documents in Free/Libre Open Source Software (FLOSS) projects. Method: Leveraging historical version data from 4,226 Debian-ecosystem projects, we apply text classification, temporal sequence analysis, and contributor behavior modeling to examine document evolution and its relationship with contribution activity. Contribution/Results: We establish, for the first time, causal associations between early documentation practices and subsequent contributor engagement. README files are typically created proactively during project inception and emphasize technical specifications; in contrast, CONTRIBUTING files frequently appear *after* the initial contribution peak—indicating reactive adoption—and tend to be terse, process-oriented, and weak in community onboarding guidance. These findings reveal that current early-stage documentation practices inadequately support sustainable community growth, challenging the prevailing “documentation-first” paradigm in open-source governance. The work provides empirical foundations and actionable recommendations for improving FLOSS documentation strategies.
📝 Abstract
README and CONTRIBUTING files can serve as the first point of contact for potential contributors to free/libre and open source software (FLOSS) projects. Prominent open source software organizations such as Mozilla, GitHub, and the Linux Foundation advocate that projects provide community-focused and process-oriented documentation early to foster recruitment and activity. In this paper we investigate the introduction of these documents in FLOSS projects, including whether early documentation conforms to these recommendations or explains subsequent activity. We use a novel dataset of FLOSS projects packaged by the Debian GNU/Linux distribution and conduct a quantitative analysis to examine README (n=4226) and CONTRIBUTING (n=714) files when they are first published into projects' repositories. We find that projects create minimal READMEs proactively, but often publish CONTRIBUTING files following an influx of contributions. The initial versions of these files rarely focus on community development, instead containing descriptions of project procedure for library usage or code contribution. The findings suggest that FLOSS projects do not create documentation with community-building in mind, but rather favor brevity and standardized instructions.