Multi-Source Retrieval and Reasoning for Legal Sentencing Prediction

📅 2026-02-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Legal sentencing prediction poses a significant challenge due to the need to integrate fine-grained objective knowledge with flexible subjective reasoning. To address this, this work proposes the MSR² framework, which uniquely combines multi-source information retrieval with the reasoning capabilities of large language models and introduces a process-level reward mechanism optimized via reinforcement learning to refine intermediate reasoning steps. Evaluated on two real-world sentencing datasets, the approach substantially improves both prediction accuracy and reasoning interpretability. The core innovation lies in the synergistic design of multi-source retrieval and process-level supervision, establishing a new paradigm for legal AI that achieves both high performance and transparency.

Technology Category

Application Category

📝 Abstract
Legal judgment prediction (LJP) aims to predict judicial outcomes from case facts and typically includes law article, charge, and sentencing prediction. While recent methods perform well on the first two subtasks, legal sentencing prediction (LSP) remains difficult due to its need for fine-grained objective knowledge and flexible subjective reasoning. To address these limitations, we propose $MSR^2$, a framework that integrates multi-source retrieval and reasoning in LLMs with reinforcement learning. $MSR^2$ enables LLMs to perform multi-source retrieval based on reasoning needs and applies a process-level reward to guide intermediate subjective reasoning steps. Experiments on two real-world datasets show that $MSR^2$ improves both accuracy and interpretability in LSP, providing a promising step toward practical legal AI. Our code is available at https://anonymous.4open.science/r/MSR2-FC3B.
Problem

Research questions and friction points this paper is trying to address.

Legal Sentencing Prediction
Legal Judgment Prediction
Subjective Reasoning
Objective Knowledge
Multi-Source Retrieval
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-source retrieval
legal sentencing prediction
large language models
reinforcement learning
reasoning
Junjie Chen
Junjie Chen
Tsinghua University
Information RetrievalNatural Language ProcessingAutomatic Evaluation of LLM
Haitao Li
Haitao Li
TsingHua University
Information Retrieval
Q
Qilei Zhang
Quan Cheng Laboratory, Beijing, China
Z
Zhenghua Li
DCST, Tsinghua University, Beijing, China
Ya Zhang
Ya Zhang
Shanghai Jiao Tong University
Machine learningComputer visionMedical Imaging
Q
Quan Zhou
Quan Cheng Laboratory & MegaTech.AI, Beijing, China
Cheng Luo
Cheng Luo
MegaTech.AI
AI in LegalNatural Language ProcessingMachine LearningInformation RetrievalRecommendation
Y
Yiqun Liu
DCST, Tsinghua University & Quan Cheng Laboratory, Beijing, China
D
Dongsheng Guo
Quan Cheng Laboratory, Beijing, China
Qingyao Ai
Qingyao Ai
Associate Professor, Dept. of CS&T, Tsinghua University
Information RetrievalMachine Learning