XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion

📅 2025-02-08
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To lower the barrier to understanding nuclear fusion science and promote public engagement and scientific literacy, this work introduces XiHeFusion—the first large language model (LLM) specifically designed for nuclear fusion outreach. Built upon Qwen2.5-14B, it undergoes supervised fine-tuning on a diverse, multi-source knowledge corpus comprising arXiv preprints, electronic textbooks, theses, and web-scraped fusion-related materials, augmented with chain-of-thought (CoT) prompting to enhance physical reasoning and interpretability. We propose a novel, comprehensive evaluation benchmark—FusionSciBench—covering 180+ questions spanning conceptual understanding, quantitative reasoning, and real-world applications in fusion science communication. Experimental results demonstrate that XiHeFusion significantly outperforms general-purpose LLMs on fusion-specific科普 QA tasks. The model and benchmark are fully open-sourced to foster community-driven development, validation, and deployment in science education and outreach.

Technology Category

Application Category

📝 Abstract
Nuclear fusion is one of the most promising ways for humans to obtain infinite energy. Currently, with the rapid development of artificial intelligence, the mission of nuclear fusion has also entered a critical period of its development. How to let more people to understand nuclear fusion and join in its research is one of the effective means to accelerate the implementation of fusion. This paper proposes the first large model in the field of nuclear fusion, XiHeFusion, which is obtained through supervised fine-tuning based on the open-source large model Qwen2.5-14B. We have collected multi-source knowledge about nuclear fusion tasks to support the training of this model, including the common crawl, eBooks, arXiv, dissertation, etc. After the model has mastered the knowledge of the nuclear fusion field, we further used the chain of thought to enhance its logical reasoning ability, making XiHeFusion able to provide more accurate and logical answers. In addition, we propose a test questionnaire containing 180+ questions to assess the conversational ability of this science popularization large model. Extensive experimental results show that our nuclear fusion dialogue model, XiHeFusion, can perform well in answering science popularization knowledge. The pre-trained XiHeFusion model is released on https://github.com/Event-AHU/XiHeFusion.
Problem

Research questions and friction points this paper is trying to address.

Develops XiHeFusion for nuclear fusion communication
Enhances logical reasoning with chain of thought
Assesses model with 180+ question test questionnaire
Innovation

Methods, ideas, or system contributions that make the work stand out.

Supervised fine-tuning of Qwen2.5-14B
Chain of thought for logical reasoning
Multi-source knowledge training support
🔎 Similar Papers
No similar papers found.
X
Xiao Wang
Anhui University, Hefei 230601, China
Q
Qingquan Yang
Institute of Plasma Physics, Chinese Academy of Sciences, Hefei, China
Fuling Wang
Fuling Wang
Anhui University
Medical Report Generation
Q
Qiang Chen
Anhui University, Hefei 230601, China
W
Wentao Wu
Anhui University, Hefei 230601, China
Y
Yu Jin
Anhui University, Hefei 230601, China
J
Jingtao Jiang
Anhui University, Hefei 230601, China
L
Liye Jin
Anhui University, Hefei 230601, China
B
Bo Jiang
Anhui University, Hefei 230601, China
Dengdi Sun
Dengdi Sun
Anhui University
Machine LearningComputer Vision
W
Wanli Lv
Anhui University, Hefei 230601, China
M
Meiwen Chen
Anhui University, Hefei 230601, China
Zehua Chen
Zehua Chen
PostDoc at Tsinghua University | Ph.D. from Imperial College
Generative ModelsMulti-modal GenerationHealth Monitoring
Guosheng Xu
Guosheng Xu
Institute of Plasma Physics, Chinese Academy of Sciences, Hefei, China
Jin Tang
Jin Tang
Anhui University
Computer visionintelligent video analysis