Decoder-based Sense Knowledge Distillation

📅 2026-02-25

📈 Citations: 0

✨ Influential: 0

career value

133K/year

🤖 AI Summary

Existing decoder-only large language models struggle to effectively leverage structured lexical semantic knowledge during text generation, resulting in insufficient inheritance of semantic relationships. This work proposes a lexical semantic knowledge distillation framework tailored for such models, which, for the first time, integrates structured semantic knowledge from lexical resources into the training process. By combining lexical knowledge distillation, contextual embedding alignment, and decoder fine-tuning, the approach enhances the model’s semantic expressiveness without requiring on-the-fly dictionary lookups during inference. Experimental results demonstrate that the method significantly improves the model’s ability to inherit lexical semantic relations across multiple benchmarks, achieving a favorable balance between semantic richness and training efficiency.

Technology Category

Application Category

📝 Abstract

Large language models (LLMs) learn contextual embeddings that capture rich semantic information, yet they often overlook structured lexical knowledge such as word senses and relationships. Prior work has shown that incorporating sense dictionaries can improve knowledge distillation for encoder models, but their application to decoder as generative models remains challenging. In this paper, we introduce Decoder-based Sense Knowledge Distillation (DSKD), a framework that integrates lexical resources into the training of decoder-style LLMs without requiring dictionary lookup at inference time. Extensive experiments on diverse benchmarks demonstrate that DSKD significantly enhances knowledge distillation performance for decoders, enabling generative models to inherit structured semantics while maintaining efficient training.

Problem

Research questions and friction points this paper is trying to address.

knowledge distillation

decoder-based LLMs

lexical knowledge

word senses

structured semantics

Innovation

Methods, ideas, or system contributions that make the work stand out.

knowledge distillation

decoder-based LLMs

lexical resources