๐ค AI Summary
Prior studies lack large-scale longitudinal analysis of code obfuscation practices in Android apps, suffer from low detection accuracy, and neglect temporal dynamics. Method: We propose a multi-dimensional static analysis classifier integrating APK decompilation, obfuscation feature extraction, and time-series modeling to analyze the evolution of obfuscation usage across 500,000+ Android apps on Google Play from 2016 to 2023. Contribution/Results: We observe a 13% overall increase in obfuscation adoption, with ProGuard and Allatori dominating tool usage. High-ranking apps and casino-themed applications exhibit significantly higher obfuscation rates, revealing strong domain-specific preferences. This work establishes the most comprehensive empirical benchmark of obfuscation adoption trends to date and provides the first systematic characterization of tool evolution, distribution heterogeneity, and industry practice biasesโoffering actionable evidence for mobile security defense mechanisms and regulatory policy formulation.
๐ Abstract
The Android ecosystem is vulnerable to issues such as app repackaging, counterfeiting, and piracy, threatening both developers and users. To mitigate these risks, developers often employ code obfuscation techniques. However, while effective in protecting legitimate applications, obfuscation also hinders security investigations as it is often exploited for malicious purposes. As such, it is important to understand code obfuscation practices in Android apps. In this paper, we analyze over 500,000 Android APKs from Google Play, spanning an eight-year period, to investigate the evolution and prevalence of code obfuscation techniques. First, we propose a set of classifiers to detect obfuscated code, tools, and techniques and then conduct a longitudinal analysis to identify trends. Our results show a 13% increase in obfuscation from 2016 to 2023, with ProGuard and Allatori as the most commonly used tools. We also show that obfuscation is more prevalent in top-ranked apps and gaming genres such as Casino apps. To our knowledge, this is the first large-scale study of obfuscation adoption in the Google Play Store, providing insights for developers and security analysts.