Beyond the Tip of the Iceberg: Understanding SATD in Dockerfiles through the Lens of Co-evolution

📅 2026-05-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

190K/year
🤖 AI Summary
This study addresses the limitation of existing research on self-admitted technical debt (SATD) in Dockerfiles, which typically focuses on individual Dockerfiles in isolation and overlooks their co-evolution with other artifacts such as source code, thereby hindering a comprehensive understanding of SATD origins and resolution mechanisms. To bridge this gap, this work pioneers a multi-artifact co-evolution perspective, integrating quantitative methods (event tracing and statistical testing) with qualitative analysis (open and axial coding) to model Dockerfiles and their associated source code across large-scale projects. The findings reveal that 27% of SATD introductions and 40% of repayments involve non-Dockerfile artifacts; coupled SATD is generally resolved faster (p = 0.0201), though missing functionality persists longer; external dependency issues are the primary trigger for SATD introduction, while architectural refactoring serves as a critical enabler for repayment.
📝 Abstract
Dockerfiles enable the creation of portable container-based execution environments for the application code, and have become an important part of the modern software development process. As Dockerfiles are a form of Infrastructure-as-Code (IaC), they can include temporary workarounds and other suboptimal implementations, leading to the accrual of technical debt that affects their reliability, security, and maintainability in the future. Prior work characterized self-admitted technical debt (SATD) in Dockerfile comments and the surrounding file chunks. This single-file view is incomplete since source code evolution involves changes across different types of software artifacts such as production, test, build, and other configuration files. Thus, we address this gap by studying SATD events in Dockerfiles alongside the related source code. We find that approximately 27% of admission events and 40% of repayment events are coupled to non-Dockerfile artifacts, and coupling sources are subtype-specific. We also observed that coupled SATD in general are repaid significantly faster overall (p = 0.0201), while coupled SATD regarding missing functionalities persists longer than its isolated counterparts; Lastly, we conducted open and axial coding of coupled SATD events, and we observe that external dependency issues, more particularly regarding unreleased upstream packages and bug fixes, are the most common cause of admission triggers in the source code; we also observe that architectural refactoring is the most common prerequisite for the repayment of SATD in Dockerfiles. These findings indicate that both practitioners (e.g. developers and project managers) and SATD researchers should integrate the source code-side co-evolution, rather than the single-file view, as the primary unit of analysis.
Problem

Research questions and friction points this paper is trying to address.

Self-Admitted Technical Debt
Dockerfiles
Co-evolution
Infrastructure-as-Code
Software Artifacts
Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-Admitted Technical Debt
Dockerfiles
Co-evolution
Infrastructure-as-Code
Technical Debt Repayment
🔎 Similar Papers