🤖 AI Summary
Entity disambiguation and linking in IT domains suffer from poor domain adaptability and difficulty in incorporating domain-specific knowledge when relying solely on general-purpose knowledge graphs (e.g., Wikidata, DBpedia).
Method: This paper proposes a lightweight, extensible ontology construction method tailored for the IT domain. Starting from general Linked Open Data (LOD) resources, it employs a domain-agnostic pipeline and—novelly—integrates an IT-specific terminology lexicon to drive ontology schema expansion. The approach synergistically combines SPARQL querying, RDF reasoning, and ontology alignment.
Contribution/Results: The resulting paradigm balances generality and domain specificity, significantly improving accuracy in IT entity disambiguation and linking. It establishes a low-barrier, reusable ontology engineering framework that supports continuous injection of proprietary domain knowledge, thereby enabling sustainable, scalable domain ontology development.
📝 Abstract
Logical and probabilistic reasoning tasks that require a deeper knowledge of semantics are increasingly relying on general purpose ontologies such as Wikidata and DBpedia. However, tasks such as entity disambiguation and linking may benefit from domain-specific knowledge graphs, which make it more efficient to consume the knowledge and easier to extend with proprietary content. We discuss our experience bootstrapping one such ontology for IT with a domain-agnostic pipeline, and extending it using domain-specific glossaries.