When the Domain Expert Has No Time and the LLM Developer Has No Clinical Expertise: Real-World Lessons from LLM Co-Design in a Safety-Net Hospital

📅 2025-08-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In resource-constrained settings, LLM developers lack clinical expertise while domain experts face severe time constraints, hindering effective co-design for clinical NLP applications. Method: This paper proposes a novel co-design framework for generating patient social needs summaries in safety-net hospitals. It decomposes the summarization task into semantically distinct, independently optimizable attributes and employs a multi-level cascaded LLM inference pipeline with cross-validation to align developer and clinician perspectives efficiently. Contribution/Results: The attribute-based modeling lowers the clinical knowledge barrier for developers, while cascaded validation ensures output accuracy, completeness, and traceability. Deployed in real clinical workflows, the system achieves a 3.2× improvement in social needs information extraction efficiency over baselines, satisfying both clinical utility and lightweight deployment requirements. This work establishes a reusable methodological paradigm for LLM co-design in underserved communities.

Technology Category

Application Category

📝 Abstract
Large language models (LLMs) have the potential to address social and behavioral determinants of health by transforming labor intensive workflows in resource-constrained settings. Creating LLM-based applications that serve the needs of underserved communities requires a deep understanding of their local context, but it is often the case that neither LLMs nor their developers possess this local expertise, and the experts in these communities often face severe time/resource constraints. This creates a disconnect: how can one engage in meaningful co-design of an LLM-based application for an under-resourced community when the communication channel between the LLM developer and domain expert is constrained? We explored this question through a real-world case study, in which our data science team sought to partner with social workers at a safety net hospital to build an LLM application that summarizes patients' social needs. Whereas prior works focus on the challenge of prompt tuning, we found that the most critical challenge in this setting is the careful and precise specification of what information to surface to providers so that the LLM application is accurate, comprehensive, and verifiable. Here we present a novel co-design framework for settings with limited access to domain experts, in which the summary generation task is first decomposed into individually-optimizable attributes and then each attribute is efficiently refined and validated through a multi-tier cascading approach.
Problem

Research questions and friction points this paper is trying to address.

Bridging gap between LLM developers and time-constrained domain experts
Ensuring LLM outputs are accurate, comprehensive, and verifiable for underserved communities
Developing co-design framework for resource-limited settings with expert scarcity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Decompose summary task into optimizable attributes
Multi-tier cascading refinement and validation
Co-design framework for limited expert access
Avni Kothari
Avni Kothari
University of California, San Francisco
Machine LearningFairnessInterpretabilityHealthcare
Patrick Vossler
Patrick Vossler
University of California, San Francisco
StatisticsCausal InferenceMachine LearningHigh-dimensional statisticsfeature selection
J
Jean Digitale
University of California, San Francisco
M
Mohammad Forouzannia
University of California, San Francisco
E
Elise Rosenberg
Zuckerberg San Francisco General Hospital
M
Michele Lee
Zuckerberg San Francisco General Hospital
J
Jennee Bryant
Zuckerberg San Francisco General Hospital
M
Melanie Molina
University of California, San Francisco
J
James Marks
University of California, San Francisco
L
Lucas Zier
University of California, San Francisco
Jean Feng
Jean Feng
Department of Epidemiology and Biostatistics, University of California, San Francisco
machine learningstatisticsbiostatistics