🤖 AI Summary
Prior studies lack empirical evidence on the clinical viability of large language models (LLMs) for behavioral weight-loss interventions in real-world settings.
Method: This study conducted the first randomized controlled trial comparing LLM-generated (GPT-series-based) behavioral guidance with human coach–delivered guidance across efficacy, credibility, and user acceptability. Multimodal evaluation employed validated questionnaires, mixed-effects modeling, qualitative thematic analysis, and a novel interpretable assessment framework.
Contribution/Results: LLM-generated guidance achieved near-human performance in information quality and perceived empathy, while significantly outperforming human guidance in conciseness and actionability. Overall user acceptability reached 82%. These findings demonstrate the clinical feasibility and high acceptability of LLMs in behavioral weight-loss support, providing empirical validation and a methodological paradigm for AI-driven, personalized health coaching.