Automated Mobile App Test Script Intent Generation via Image and Code Understanding

📅 2021-07-12

🏛️ arXiv.org

📈 Citations: 1

✨ Influential: 0

career value

175K/year

🤖 AI Summary

Mobile test scripts suffer from opaque intent and loose coupling with application logic, leading to high maintenance costs. Method: This paper proposes the first end-to-end joint modeling approach that semantically aligns UI screenshots with source code: it parses interface images via OCR and object detection, identifies responsive code through AST analysis and method localization, and establishes an operation-sequence modeling framework with cross-modal intent alignment to tightly couple widget selectors with functional code; an encoder-decoder model then generates natural-language intent descriptions. Contribution/Results: Evaluated on a real-world app dataset, our method achieves a BLEU-4 score of 58.3%, significantly outperforming all baselines. A user study shows developers’ script comprehension time decreases by 62% on average. This work pioneers coordinated intent inference bridging GUI visual elements and code semantics, establishing a novel paradigm for enhancing test script understandability and maintainability.

📝 Abstract

Testing is the most direct and effective technique to ensure software quality. However, it is a burden for developers to understand the poorly-commented tests, which are common in industry environment projects. Mobile applications (app) are GUI-intensive and event-driven, so test scripts focusing on GUI interactions play a more important role in mobile app testing besides the test cases for the source code. Therefore, more attention should be paid to the user interactions and the corresponding user event responses. However, test scripts are loosely linked to apps under test (AUT) based on widget selectors, making it hard to map the operations to the functionality code of AUT. In such a situation, code understanding algorithms may lose efficacy if directly applied to mobile app test scripts. We present a novel approach, TestIntent, to infer the intent of mobile app test scripts. TestIntent combines the GUI image understanding and code understanding technologies. The test script is transferred into an operation sequence model. For each operation, TestIntent extracts the operated widget selector and link the selector to the UI layout structure, which stores the detailed information of the widgets, including coordinates, type, etc. With code understanding technologies, TestIntent can locate response methods in the source code. Afterwards, NLP algorithms are adopted to understand the code and generate descriptions. Also, TestIntent can locate widgets on the app GUI images. Then, TestIntent can understand the widget intent with an encoder-decoder model. With the combination of the results from GUI and code understanding, TestIntent generates the test intents in natural language format. We also conduct an empirical experiment, and the results prove the outstanding performance of TestIntent. A user study also declares that TestIntent can save developers' time to understand test scripts.

Problem

Research questions and friction points this paper is trying to address.

Generates test script intentions for mobile apps using GUI and code understanding.

Addresses challenges in understanding test scripts due to lack of business logic.

Utilizes deep learning and code understanding to infer user expectations from test scripts.

Innovation

Methods, ideas, or system contributions that make the work stand out.

TestIntention infers GUI test script intentions.

Uses image understanding for XPath-identified widgets.

Maps ID-identified widgets to response methods.

🔎 Similar Papers

No similar papers found.