🤖 AI Summary
This study investigates real-user perceptions, challenges, and expectations regarding functional dimensions—AI performance, content quality, and policy moderation—in generative AI applications (Gen-AI apps). Method: Leveraging 676,000 user reviews from 173 Gen-AI apps on Google Play, we propose the SARA framework (Screening, Acquisition, Refinement, Analysis) and introduce a novel LLM-driven few-shot (5-shot) thematic extraction method, achieving 91% precision in identifying themes from unstructured text. Temporal trend modeling and empirical analysis uncover dynamic shifts in user concerns and generational differences. Contribution/Results: We identify the ten most frequent thematic issues and their evolving sentiment trajectories. Based on findings, we derive 12 actionable, evidence-based recommendations spanning functional optimization, ethical governance, and human-AI interaction design—providing practitioners and researchers with a data-driven roadmap for responsible Gen-AI development.
📝 Abstract
The release of ChatGPT in 2022 triggered a rapid surge in generative artificial intelligence mobile apps (i.e., Gen-AI apps). Despite widespread adoption, little is known about how end users perceive and evaluate these Gen-AI functionalities in practice. In this work, we conduct a user-centered analysis of 676,066 reviews from 173 Gen-AI apps on the Google Play Store. We introduce a four-phase methodology, SARA (Selection, Acquisition, Refinement, and Analysis), that enables the systematic extraction of user insights using prompt-based LLM techniques. First, we demonstrate the reliability of LLMs in topic extraction, achieving 91% accuracy through five-shot prompting and non-informative review filtering. Then, we apply this method to the informative reviews, identify the top 10 user-discussed topics (e.g., AI Performance, Content Quality, and Content Policy&Censorship) and analyze the key challenges and emerging opportunities. Finally, we examine how these topics evolve over time, offering insight into shifting user expectations and engagement patterns with Gen-AI apps. Based on our findings and observations, we present actionable implications for developers and researchers.