🤖 AI Summary
Traditional Minimum Bayes Risk (MBR) decoding relies on internal model sampling, limiting its ability to incorporate out-of-domain knowledge and thus suffering from poor cross-domain robustness. To address this, we propose Case-Based Decision-Theoretic (CBDT) decoding—a novel framework that integrates high-quality external domain examples into the MBR paradigm for semantic-aware decoding decisions via case retrieval and expected utility estimation. Unlike conventional MBR, CBDT bypasses internal sampling and instead dynamically evaluates candidate outputs’ utility using semantically similar cases retrieved from an external memory bank, substantially improving cross-domain generalization. Empirical results across seven De↔En/Ja↔En machine translation tasks and MSCOCO/nocaps image captioning demonstrate consistent superiority of CBDT over both MAP and standard MBR decoding—particularly under low-resource and out-of-distribution conditions. Our core contribution lies in systematically embedding case-based reasoning into Bayesian decoding, establishing a new paradigm for controllable text generation grounded in external memory.
📝 Abstract
Minimum Bayes risk (MBR) decoding is a decision rule of text generation, which selects the hypothesis that maximizes the expected utility and robustly generates higher-quality texts than maximum a posteriori (MAP) decoding. However, it depends on sample texts drawn from the text generation model; thus, it is difficult to find a hypothesis that correctly captures the knowledge or information of out-of-domain. To tackle this issue, we propose case-based decision-theoretic (CBDT) decoding, another method to estimate the expected utility using examples of domain data. CBDT decoding not only generates higher-quality texts than MAP decoding, but also the combination of MBR and CBDT decoding outperformed MBR decoding in seven domain De--En and Ja$leftrightarrow$En translation tasks and image captioning tasks on MSCOCO and nocaps datasets.