SFT-TA: Supervised Fine-Tuned Agents in Multi-Agent LLMs for Automated Inductive Thematic Analysis

📅 2025-09-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Manual thematic analysis of clinical interview transcripts is time-consuming and inefficient, while existing large language model (LLM)-based automated approaches exhibit insufficient agreement with human annotations. To address these limitations, this paper proposes a supervised fine-tuning (SFT)-driven multi-agent LLM framework specifically designed for inductive thematic analysis of clinical interview texts. The framework explicitly assigns distinct roles—such as coder, integrator, and verifier—to specialized SFT agents, embedding them within a collaborative workflow that enhances semantic understanding and enables iterative validation. Experimental results demonstrate that our method significantly outperforms GPT-4o and mainstream single-agent or unsupervised baselines in thematic consistency metrics, including Cohen’s κ and F1-score. These findings validate the efficacy of SFT-powered multi-agent architectures in improving both the reliability and interpretability of automated thematic analysis.

Technology Category

Application Category

📝 Abstract
Thematic Analysis (TA) is a widely used qualitative method that provides a structured yet flexible framework for identifying and reporting patterns in clinical interview transcripts. However, manual thematic analysis is time-consuming and limits scalability. Recent advances in LLMs offer a pathway to automate thematic analysis, but alignment with human results remains limited. To address these limitations, we propose SFT-TA, an automated thematic analysis framework that embeds supervised fine-tuned (SFT) agents within a multi-agent system. Our framework outperforms existing frameworks and the gpt-4o baseline in alignment with human reference themes. We observed that SFT agents alone may underperform, but achieve better results than the baseline when embedded within a multi-agent system. Our results highlight that embedding SFT agents in specific roles within a multi-agent system is a promising pathway to improve alignment with desired outputs for thematic analysis.
Problem

Research questions and friction points this paper is trying to address.

Manual thematic analysis of clinical interviews is time-consuming and limits scalability
Existing LLM approaches for automated thematic analysis lack alignment with human results
Individual supervised fine-tuned agents may underperform without proper system integration
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses supervised fine-tuned agents in multi-agent system
Embeds SFT agents in specific roles for thematic analysis
Combines SFT with multi-agent architecture for better alignment
🔎 Similar Papers
No similar papers found.
S
Seungjun Yi
Department of Biomedical Engineering, University of Texas at Austin, Austin, TX, USA
J
Joakim Nguyen
Department of Computer Science, University of Texas at Austin, Austin, TX, USA
Huimin Xu
Huimin Xu
Ph.D. Student, School of Information, University of Texas at Austin
computational social sciencescience of team science
T
Terence Lim
College of Natural Sciences, University of Texas at Austin, Austin, TX, USA Graphen, Inc., New York, NY, USA
J
Joseph Skrovan
School of Information, University of Texas at Austin, Austin, TX, USA
M
Mehak Beri
School of Information, University of Texas at Austin, Austin, TX, USA
Hitakshi Modi
Hitakshi Modi
The University of Texas at Austin
Congenital Heart DiseaseQualitative Research
Andrew Well
Andrew Well
Research Assistant Professor of Cardiac Surgery
Liu Leqi
Liu Leqi
UT Austin
Artificial IntelligenceMachine Learning
M
Mia Markey
Department of Biomedical Engineering, University of Texas at Austin, Austin, TX, USA
Ying Ding
Ying Ding
Bill & Lewis Suit Professor, School of Information, Dell Med, University of Texas at Austin
AI in HealthKnowledge GraphScience of Science