Transforming Wearable Data into Health Insights using Large Language Model Agents

📅 2024-06-10
🏛️ arXiv.org
📈 Citations: 18
Influential: 3
📄 PDF
🤖 AI Summary
To address the challenge of extracting personalized health insights from wearable-device data—tasks requiring complex numerical reasoning that general-purpose large language models (LLMs) inadequately support—this paper proposes Personal Health Insights Agent (PHIA), the first systematic LLM-agent framework for personal behavioral health data analysis. PHIA integrates tool invocation, code generation, external knowledge retrieval, and multi-turn collaborative reasoning. We construct a dual-dimension benchmark comprising 4,000+ questions—covering numeric factual queries and open-ended health questions. Experiments show PHIA achieves ≥84% accuracy on numeric factual questions and ≥83% on open-ended health questions. Rigorous evaluation over 650 person-hours by domain experts confirms its practical utility and reliability. Key contributions include: (1) introducing the PHIA agent paradigm; (2) releasing the first wearable-health-analysis-specific benchmark; and (3) realizing a tool-augmented multimodal reasoning architecture.

Technology Category

Application Category

📝 Abstract
Despite the proliferation of wearable health trackers and the importance of sleep and exercise to health, deriving actionable personalized insights from wearable data remains a challenge because doing so requires non-trivial open-ended analysis of these data. The recent rise of large language model (LLM) agents, which can use tools to reason about and interact with the world, presents a promising opportunity to enable such personalized analysis at scale. Yet, the application of LLM agents in analyzing personal health is still largely untapped. In this paper, we introduce the Personal Health Insights Agent (PHIA), an agent system that leverages state-of-the-art code generation and information retrieval tools to analyze and interpret behavioral health data from wearables. We curate two benchmark question-answering datasets of over 4000 health insights questions. Based on 650 hours of human and expert evaluation we find that PHIA can accurately address over 84% of factual numerical questions and more than 83% of crowd-sourced open-ended questions. This work has implications for advancing behavioral health across the population, potentially enabling individuals to interpret their own wearable data, and paving the way for a new era of accessible, personalized wellness regimens that are informed by data-driven insights.
Problem

Research questions and friction points this paper is trying to address.

Deriving personalized insights from wearable tracker data
Overcoming complex numerical reasoning challenges with LLMs
Enabling accessible data-driven personal health analysis
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM agents with code generation
Multistep reasoning with information retrieval
Personal Health Insights Agent system
🔎 Similar Papers
No similar papers found.