Two-stage Incomplete Utterance Rewriting on Editing Operation

📅 2025-03-20
🏛️ International Conference on Computational Linguistics
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address incomplete utterance rewriting (IUR) in dialogues—caused by anaphora and ellipsis—existing methods predominantly rely on context-aware direct generation, overlooking the intrinsic structural decomposition into coreference resolution and ellipsis recovery. This paper proposes an edit-operation-driven two-stage IUR paradigm: Stage 1 predicts fine-grained edit actions (e.g., replace, insert, restore), and Stage 2 performs action- and context-conditioned rewriting. We introduce adversarial perturbation training to mitigate exposure bias and error propagation, and design a context-aware edit representation mechanism. Evaluated on three standard IUR benchmarks, our approach achieves significant improvements over state-of-the-art methods in both rewriting accuracy and BLEU score, demonstrating the effectiveness and robustness of the edit-based modeling paradigm.

Technology Category

Application Category

📝 Abstract
Previous work on Incomplete Utterance Rewriting (IUR) has primarily focused on generating rewritten utterances based solely on dialogue context, ignoring the widespread phenomenon of coreference and ellipsis in dialogues. To address this issue, we propose a novel framework called TEO (emph{Two-stage approach on Editing Operation}) for IUR, in which the first stage generates editing operations and the second stage rewrites incomplete utterances utilizing the generated editing operations and the dialogue context. Furthermore, an adversarial perturbation strategy is proposed to mitigate cascading errors and exposure bias caused by the inconsistency between training and inference in the second stage. Experimental results on three IUR datasets show that our TEO outperforms the SOTA models significantly.
Problem

Research questions and friction points this paper is trying to address.

Addresses coreference and ellipsis in dialogue utterances.
Proposes a two-stage framework for incomplete utterance rewriting.
Mitigates cascading errors and exposure bias in rewriting.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Two-stage approach for utterance rewriting
Editing operations enhance context utilization
Adversarial perturbation reduces training-inference inconsistency
🔎 Similar Papers
No similar papers found.
Z
Zhiyu Cao
School of Computer Science and Technology, Soochow University, Suzhou, China
P
Peifeng Li
School of Computer Science and Technology, Soochow University, Suzhou, China
Qiaoming Zhu
Qiaoming Zhu
Soochow University
Natural Language Processing
Y
Yaxin Fan
School of Computer Science and Technology, Soochow University, Suzhou, China