LDMDroid: Leveraging LLMs for Detecting Data Manipulation Errors in Android Apps

๐Ÿ“… 2026-04-01
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the challenge of detecting data manipulation errors (DMEs) in Android applications, which are difficult for existing automated testing tools to uncover due to their dependence on specific UI interaction sequences and manifestation as logical faults. To tackle this, the authors propose LDMDroid, the first automated testing framework that integrates large language models (LLMs), state-aware UI exploration, and visual verification. LDMDroid leverages a state-aware mechanism to guide the LLM in generating high-coverage UI event sequences that effectively trigger data manipulation functionalities, while employing visual features to detect changes in data statesโ€”enabling precise, script-free error identification. Evaluation on 24 real-world apps demonstrates that LDMDroid significantly improves functional triggering success rates and uncovers 17 previously unknown bugs, 14 of which have been confirmed by developers and 11 already fixed.
๐Ÿ“ Abstract
Android apps rely heavily on Data Manipulation Functionalities (DMFs) for handling app-specific data through CRUDS operations, making their correctness vital for reliability. However, detecting Data Manipulation Errors (DMEs) is challenging due to their dependence on specific UI interaction sequences and manifestation as logic bugs. Existing automated UI testing tools face two primary challenges: insufficient UI path coverage for adequate DMF triggering and reliance on manually written test scripts. To address these issues, we propose an automated approach using Large Language Models (LLMs) for DME detection. We developed LDMDroid, an automated UI testing framework for Android apps. LDMDroid enhances DMF triggering success by guiding LLMs through a state-aware process for generating UI event sequences. It also uses visual features to identify changes in data states, improving DME verification accuracy. We evaluated LDMDroid on 24 real-world Android apps, demonstrating improved DMF triggering success rates compared to baselines. LDMDroid discovered 17 unique bugs, with 14 confirmed by developers and 11 fixed. The tool is publicly available at https://github.com/runnnnnner200/LDMDroid.
Problem

Research questions and friction points this paper is trying to address.

Data Manipulation Errors
Android Apps
UI Testing
Logic Bugs
Automated Testing
Innovation

Methods, ideas, or system contributions that make the work stand out.

Large Language Models
Data Manipulation Errors
Automated UI Testing
State-aware Generation
Visual State Verification
๐Ÿ”Ž Similar Papers
No similar papers found.
X
Xiangyang Xiao
Xiamen University, China
H
Huaxun Huang
Xiamen University, China
Rongxin Wu
Rongxin Wu
Xiamen University
software securityprogram analysissoftware engineering