PoliLegalLM: A Technical Report on a Large Language Model for Political and Legal Affairs

📅 2026-04-19

📈 Citations: 0

✨ Influential: 0

career value

172K/year

🤖 AI Summary

This work addresses key challenges faced by general-purpose large language models in the legal domain—namely, citation hallucination, insufficient knowledge coverage, and weak structured reasoning—by proposing a domain-specific large language model tailored for political and legal affairs. The authors introduce a unified training paradigm that integrates continual pretraining on high-quality legal corpora, progressive supervised fine-tuning, and preference-based reinforcement learning to systematically enhance the model’s factual accuracy, task alignment, and reasoning capabilities. Experimental results demonstrate that the proposed model significantly outperforms same-scale baselines on established benchmarks such as LawBench and LexEval, as well as on PoliLegal, a real-world legal dataset, exhibiting superior practicality and reliability in authentic legal applications.

Technology Category

Application Category

📝 Abstract

Large language models (LLMs) have achieved remarkable success in general-domain tasks, yet their direct application to the legal domain remains challenging due to hallucinated legal citations, incomplete knowledge coverage, and weak structured reasoning. To address these issues, we propose PoliLegalLM, a domain-specific large language model tailored for political and legal applications. Our approach adopts a unified training framework that integrates continued pretraining, progressive supervised fine-tuning, and preference-based reinforcement learning to jointly enhance legal knowledge grounding, task alignment, and reasoning capability. We construct a large-scale, high-quality legal corpus and design a structured post-training pipeline, enabling the model to effectively learn domain-specific knowledge and adapt to diverse legal tasks. We evaluate PoliLegalLM on three representative benchmarks, including LawBench, LexEval, and a real-world dataset, PoliLegal. Experimental results demonstrate that PoliLegalLM achieves strong and consistent performance, outperforming competitive models of similar scale and remaining highly competitive with significantly larger models, while achieving the best results on real-world legal scenarios. These results highlight the effectiveness of our training paradigm and the practical value of domain-specific LLMs for real-world legal applications.

Problem

Research questions and friction points this paper is trying to address.

legal hallucination

knowledge coverage

structured reasoning

domain-specific LLM

legal applications

Innovation

Methods, ideas, or system contributions that make the work stand out.

domain-specific LLM

structured reasoning

legal knowledge grounding