Structured Document Translation via Format Reinforcement Learning

📅 2025-12-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing structured text translation research primarily operates at the sentence level, failing to preserve document-level structural integrity in XML/HTML documents. To address this, we propose a formatting-aware reinforcement learning framework that jointly optimizes translation quality and structural fidelity via a structure-aware reward mechanism. We introduce two novel structural rewards: TreeSim, measuring tree-structure similarity, and Node-chrF, a node-level character n-gram F-score; additionally, we propose StrucAUC as a fine-grained, integrated evaluation metric for structural and semantic alignment. Our method builds upon a supervised fine-tuned base model and employs Group Relative Policy Optimization for end-to-end optimization. Evaluated on the SAP software documentation benchmark, our approach achieves statistically significant improvements across all six evaluation metrics—demonstrating its effectiveness in simultaneously ensuring structural preservation and semantic accuracy.

Technology Category

Application Category

📝 Abstract
Recent works on structured text translation remain limited to the sentence level, as they struggle to effectively handle the complex document-level XML or HTML structures. To address this, we propose extbf{Format Reinforcement Learning (FormatRL)}, which employs Group Relative Policy Optimization on top of a supervised fine-tuning model to directly optimize novel structure-aware rewards: 1) TreeSim, which measures structural similarity between predicted and reference XML trees and 2) Node-chrF, which measures translation quality at the level of XML nodes. Additionally, we apply StrucAUC, a fine-grained metric distinguishing between minor errors and major structural failures. Experiments on the SAP software-documentation benchmark demonstrate improvements across six metrics and an analysis further shows how different reward functions contribute to improvements in both structural and translation quality.
Problem

Research questions and friction points this paper is trying to address.

Optimizes document-level structured text translation
Measures structural similarity and translation quality
Improves performance across multiple evaluation metrics
Innovation

Methods, ideas, or system contributions that make the work stand out.

Format Reinforcement Learning optimizes structure-aware rewards
TreeSim measures structural similarity between XML trees
Node-chrF evaluates translation quality at XML node level
🔎 Similar Papers
No similar papers found.
H
Haiyue Song
National Institute of Information and Communications Technology, Japan
J
Johannes Eschbach-Dymanus
SAP, Germany
H
Hour Kaing
National Institute of Information and Communications Technology, Japan
S
Sumire Honda
SAP, Germany
H
Hideki Tanaka
National Institute of Information and Communications Technology, Japan
B
Bianka Buschbeck
SAP, Germany
Masao Utiyama
Masao Utiyama
NICT
Machine Translation