Kongzi: A Historical Large Language Model with Fact Enhancement

📅 2025-04-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Large language models (LLMs) suffer from factual inaccuracies in multi-step historical reasoning, weak cross-temporal-spatial association, and difficulty integrating fragmented historical sources. To address these challenges, we propose HistoLLM—a domain-specific LLM tailored for historical analysis. Methodologically, we introduce a novel fact-augmented reinforcement learning framework that integrates high-fidelity historical corpus fine-tuning with three synergistic mechanisms: historical knowledge injection, fact-consistency constraints, and multi-source historiographical alignment modeling—jointly optimizing factual accuracy and reasoning depth. Experimental results demonstrate that HistoLLM significantly outperforms baseline models on historical question answering and narrative generation tasks, achieving an 18.7% absolute gain in factual accuracy and setting a new state-of-the-art in reasoning coherence. This work establishes a trustworthy AI inference infrastructure for knowledge-intensive historical research.

Technology Category

Application Category

📝 Abstract
The capabilities of the latest large language models (LLMs) have been extended from pure natural language understanding to complex reasoning tasks. However, current reasoning models often exhibit factual inaccuracies in longer reasoning chains, which poses challenges for historical reasoning and limits the potential of LLMs in complex, knowledge-intensive tasks. Historical studies require not only the accurate presentation of factual information but also the ability to establish cross-temporal correlations and derive coherent conclusions from fragmentary and often ambiguous sources. To address these challenges, we propose Kongzi, a large language model specifically designed for historical analysis. Through the integration of curated, high-quality historical data and a novel fact-reinforcement learning strategy, Kongzi demonstrates strong factual alignment and sophisticated reasoning depth. Extensive experiments on tasks such as historical question answering and narrative generation demonstrate that Kongzi outperforms existing models in both factual accuracy and reasoning depth. By effectively addressing the unique challenges inherent in historical texts, Kongzi sets a new standard for the development of accurate and reliable LLMs in professional domains.
Problem

Research questions and friction points this paper is trying to address.

Addresses factual inaccuracies in long reasoning chains of LLMs
Enhances historical analysis with accurate cross-temporal correlations
Improves factual accuracy and reasoning depth in knowledge-intensive tasks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates curated high-quality historical data
Uses novel fact-reinforcement learning strategy
Enhances factual alignment and reasoning depth
🔎 Similar Papers
No similar papers found.
J
Jiashu Yang
Dalian University of Technology
N
Ningning Wang
Dalian University of Technology
Yian Zhao
Yian Zhao
Peking University
3D Gaussian SplattingMLLM
Chaoran Feng
Chaoran Feng
🎓 Peking University
3D VisionEvent-based VisionmLLM/VLM
J
Junjia Du
Nanyang Technological University
H
Hao Pang
Dalian University of Technology
Zhirui Fang
Zhirui Fang
Master of Artificial Intelligence Tsinghua University
Embodied AIReinforcement Learning
Xuxin Cheng
Xuxin Cheng
University of California, San Diego