🤖 AI Summary
Medical large language models (LLMs) suffer from hallucinations due to incomplete knowledge understanding. Method: This paper proposes the Source Planning Optimization (SPO) framework, which constructs MedOmniKB—a multi-source, heterogeneous medical knowledge base—and formalizes multi-source retrieval as an explicit source planning task. SPO innovatively decouples expert-model-based planning exploration from lightweight student-model-based alignment learning, employing positive/negative sample-driven planning alignment training to mitigate planning failure caused by source-content semantic mismatch. The method integrates heterogeneous multi-source knowledge, models source-selection logic, and enables expert-student co-optimization. Results: Experiments demonstrate that the lightweight student model achieves state-of-the-art performance across multiple medical RAG benchmarks, significantly suppressing hallucinations and enhancing the accuracy and reliability of clinical reasoning.
📝 Abstract
Large language models (LLMs) hold promise for addressing healthcare challenges but often generate hallucinations due to limited integration of medical knowledge. Incorporating external medical knowledge is therefore critical, especially considering the breadth and complexity of medical content, which necessitates effective multi-source knowledge acquisition. We address this challenge by framing it as a source planning problem, where the task is to formulate context-appropriate queries tailored to the attributes of diverse knowledge sources. Existing approaches either overlook source planning or fail to achieve it effectively due to misalignment between the model's expectation of the sources and their actual content. To bridge this gap, we present MedOmniKB, a comprehensive repository comprising multigenre and multi-structured medical knowledge sources. Leveraging these sources, we propose the Source Planning Optimisation (SPO) method, which enhances multi-source utilisation through explicit planning optimisation. Our approach involves enabling an expert model to explore and evaluate potential plans while training a smaller model to learn source alignment using positive and negative planning samples. Experimental results demonstrate that our method substantially improves multi-source planning performance, enabling the optimised small model to achieve state-of-the-art results in leveraging diverse medical knowledge sources.