Can Domain Experts Rely on AI Appropriately? A Case Study on AI-Assisted Prostate Cancer MRI Diagnosis

📅 2025-02-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates how radiologists appropriately calibrate trust and reliance on AI for prostate cancer diagnosis from MRI. Method: Two dual-phase controlled experiments systematically examine how the timing of AI prediction presentation and performance feedback—individual versus team-level accuracy—affect human-AI collaborative behavior and diagnostic accuracy. An interactive, pedagogy-informed diagnostic interface was developed to enable fine-grained behavioral tracking and multimodal integration. Contribution/Results: Human-AI collaboration consistently outperforms unassisted human diagnosis but remains below AI-only performance due to under-reliance. Preemptive display of AI predictions significantly increases clinician adherence. Critically, integrating multiple radiologists’ inputs with AI surpasses AI-only performance—demonstrating a novel pathway to synergistic human-AI augmentation. This work provides the first empirical validation in practicing clinical experts of performance feedback as a key moderator of AI trust, advancing evidence-based design of AI-assisted clinical decision support systems.

Technology Category

Application Category

📝 Abstract
Despite the growing interest in human-AI decision making, experimental studies with domain experts remain rare, largely due to the complexity of working with domain experts and the challenges in setting up realistic experiments. In this work, we conduct an in-depth collaboration with radiologists in prostate cancer diagnosis based on MRI images. Building on existing tools for teaching prostate cancer diagnosis, we develop an interface and conduct two experiments to study how AI assistance and performance feedback shape the decision making of domain experts. In Study 1, clinicians were asked to provide an initial diagnosis (human), then view the AI's prediction, and subsequently finalize their decision (human-AI team). In Study 2 (after a memory wash-out period), the same participants first received aggregated performance statistics from Study 1, specifically their own performance, the AI's performance, and their human-AI team performance, and then directly viewed the AI's prediction before making their diagnosis (i.e., no independent initial diagnosis). These two workflows represent realistic ways that clinical AI tools might be used in practice, where the second study simulates a scenario where doctors can adjust their reliance and trust on AI based on prior performance feedback. Our findings show that, while human-AI teams consistently outperform humans alone, they still underperform the AI due to under-reliance, similar to prior studies with crowdworkers. Providing clinicians with performance feedback did not significantly improve the performance of human-AI teams, although showing AI decisions in advance nudges people to follow AI more. Meanwhile, we observe that the ensemble of human-AI teams can outperform AI alone, suggesting promising directions for human-AI collaboration.
Problem

Research questions and friction points this paper is trying to address.

AI-assisted prostate cancer MRI diagnosis
impact of AI on domain expert decisions
human-AI collaboration effectiveness
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI-assisted MRI diagnosis
Human-AI team workflows
Performance feedback integration
🔎 Similar Papers
No similar papers found.
Chacha Chen
Chacha Chen
University of Chicago
Human-centered ML
H
Han Liu
University of Chicago
Jiamin Yang
Jiamin Yang
Toyota Technological Institute at Chicago
B
Benjamin M. Mervak
University of Michigan
B
B. Kalaycıoğlu
University of Chicago
G
Grace Lee
University of Chicago
E
Emre Cakmakli
Bagcilar Training and Research Hospital
Matteo Bonatti
Matteo Bonatti
MD, Bolzano Central Hospital
RadiologyMRIDECT
S
Sridhar Pudu
Radiology Associates of North Texas
O
Osman Kahraman
İstanbul Medipol University Hospital
G
Gül Gizem Pamuk
Bagcilar Training and Research Hospital
A
A. Oto
University of Chicago
A
Aritrick Chatterjee
University of Chicago
Chenhao Tan
Chenhao Tan
University of Chicago
Human-centered AICommunication & IntelligenceScientific DiscoveryAI alignmentAI governance