TreeMatch: A Fully Unsupervised WSD System Using Dependency Knowledge on a Specific Domain

πŸ“… 2010-07-15
πŸ›οΈ International Workshop on Semantic Evaluation
πŸ“ˆ Citations: 7
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses domain-specific Word Sense Disambiguation (WSD), proposing the first fully unsupervised, dependency-knowledge-driven framework. Unlike conventional supervised or semi-supervised approaches that rely on annotated corpora or general-purpose lexical resources (e.g., WordNet), our method leverages only domain-customized dependency relations extracted from a domain-specific knowledge base to model word semantics. It performs unsupervised semantic similarity computation guided by dependency structures and conducts coarse-grained sense matchingβ€”both without any human-annotated training data or external dictionaries. The key contribution lies in the explicit integration of domain-specific dependency knowledge into the WSD pipeline, enabling effective sense discrimination in a purely unsupervised setting. Evaluated on the SemEval 2010 Task 17 benchmark, our approach substantially outperforms the First Sense Baseline, demonstrating both the efficacy and transferability of domain dependency knowledge for unsupervised WSD.

Technology Category

Application Category

πŸ“ Abstract
Word sense disambiguation (WSD) is one of the main challenges in Computational Linguistics. TreeMatch is a WSD system originally developed using data from SemEval 2007 Task 7 (Coarse-grained English All-words Task) that has been adapted for use in SemEval 2010 Task 17 (All-words Word Sense Disambiguation on a Specific Domain). The system is based on a fully unsupervised method using dependency knowledge drawn from a domain specific knowledge base that was built for this task. When evaluated on the task, the system precision performs above the First Sense Baseline.
Problem

Research questions and friction points this paper is trying to address.

Automatic Word Sense Disambiguation
Computer Understanding
Specific Domains
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automatic Semantic Disambiguation
Unsupervised Learning
TreeMatch System
πŸ”Ž Similar Papers
No similar papers found.
Andrew Tran
Andrew Tran
Temple University
Generative AIHuman-computer Interaction
C
Chris Bowes
University of Houston-Downtown
D
David Brown
University of Houston-Downtown
P
Ping Chen
University of Houston-Downtown
M
Max Choly
University of Massachusetts-Boston
W
Wei Ding
University of Massachusetts-Boston