🤖 AI Summary
This study investigates developers’ actual adherence to the Arrange-Act-Assert (AAA) pattern in unit tests, the root causes of deviations, and their willingness to adopt refactoring. Method: We conducted an empirical analysis of 12,846 test cases from 435 open-source projects, systematically identifying three AAA anti-patterns and four Arrange-block design defects. A qualitative study—combining manual annotation, code review, and GitHub issue analysis—was performed to understand underlying motivations. Contribution/Results: We find that 71.5% of test cases conform to AAA guidelines. Based on our findings, we propose 18 actionable, automation-friendly refactoring suggestions, 78% of which received positive developer adoption. Crucially, developers’ refactoring decisions are strongly driven by perceived return on investment (ROI). This work delivers the first large-scale, evidence-based AAA practice map and a practical, empirically grounded refactoring guide to enhance test maintainability.
📝 Abstract
The AAA pattern, i.e. arrange, act, and assert, provides a unified structure for unit test cases, which benefits comprehension and maintenance. However, there is little understanding regarding whether and how common real-life developers structure unit test cases following AAA in practice. In particular, are there recurring anti-patterns that deviate from the AAA structure and merit refactoring? And, if test cases follow the AAA structure, could they contain design flaws in the A blocks? If we propose refactoring to fix the design of test cases following the AAA, how do developers receive the proposals? Do they favor refactoring? If not, what are their considerations? This study presents an empirical study on 435 real-life unit test cases randomly selected from four open-source projects. Overall, the majority (71.5%) of test cases follow the AAA structure. And, we observed three recurring anti-patterns that deviate from the AAA structure, as well as four design flaws that may reside inside of the A blocks. Each issue type has its drawbacks and merits corresponding refactoring resolutions. We sent a total of 18 refactoring proposals as issue tickets for fixing these problems. We received 78% positive feedback favoring the refactoring. From the rejections, we learned that return-on-investment is a key consideration for developers. The findings provide insights for practitioners to structure unit test cases with AAA in mind, and for researchers to develop related techniques for enforcing AAA in test cases.