🤖 AI Summary
Radiotherapy treatment planning suffers from high subjectivity, lengthy manual workflows, and substantial inter-physician variability; moreover, conventional AI approaches pose patient data privacy risks. This paper introduces DOLA (Dose Optimization Language Agent), the first locally deployed large language model (LLM) agent embedded directly into a commercial radiotherapy planning system, enabling fully automated, privacy-preserving dose optimization. We propose a novel architecture tightly coupling a local LLM with clinical planning software and pioneer the synergistic integration of retrieval-augmented generation (RAG) and reinforcement learning (RL) for autonomous treatment plan optimization—all executed entirely offline. Experiments demonstrate that the 70B-parameter LLM achieves a 16.4% higher optimization score than the 8B variant; the RAG+RL strategy accelerates convergence by 19.8% over the RAG-free baseline; and an optimal sampling temperature of 0.4 is identified, confirming clinical feasibility.
📝 Abstract
Radiotherapy treatment planning is a complex and time-intensive process, often impacted by inter-planner variability and subjective decision-making. To address these challenges, we introduce Dose Optimization Language Agent (DOLA), an autonomous large language model (LLM)-based agent designed for optimizing radiotherapy treatment plans while rigorously protecting patient privacy. DOLA integrates the LLaMa3.1 LLM directly with a commercial treatment planning system, utilizing chain-of-thought prompting, retrieval-augmented generation (RAG), and reinforcement learning (RL). Operating entirely within secure local infrastructure, this agent eliminates external data sharing. We evaluated DOLA using a retrospective cohort of 18 prostate cancer patients prescribed 60 Gy in 20 fractions, comparing model sizes (8 billion vs. 70 billion parameters) and optimization strategies (No-RAG, RAG, and RAG+RL) over 10 planning iterations. The 70B model demonstrated significantly improved performance, achieving approximately 16.4% higher final scores than the 8B model. The RAG approach outperformed the No-RAG baseline by 19.8%, and incorporating RL accelerated convergence, highlighting the synergy of retrieval-based memory and reinforcement learning. Optimal temperature hyperparameter analysis identified 0.4 as providing the best balance between exploration and exploitation. This proof of concept study represents the first successful deployment of locally hosted LLM agents for autonomous optimization of treatment plans within a commercial radiotherapy planning system. By extending human-machine interaction through interpretable natural language reasoning, DOLA offers a scalable and privacy-conscious framework, with significant potential for clinical implementation and workflow improvement.