Representing provenance and track changes of cultural heritage metadata in RDF: a survey of existing approaches

📅 2023-05-15
🏛️ Digital Humanities Conference
📈 Citations: 4
Influential: 0
📄 PDF
🤖 AI Summary
Digital humanities face challenges in provenance tracking and change management for cultural heritage metadata, as existing RDF-based approaches suffer from weak standards compliance (e.g., W3C RDF reification, n-ary relations) and poor cross-domain interoperability. Method: This study conducts a systematic, multidimensional empirical evaluation of six mainstream semantic models—Named Graphs, RDF*, PROV-O, among others—assessing their standards conformance, extensibility, and domain adaptability specifically within cultural heritage contexts. Contribution/Results: We propose a practice-oriented provenance modeling selection framework that explicitly characterizes trade-offs among trustworthiness assurance, computational overhead, and interoperability. The framework delivers reusable, verifiable decision support for metadata provenance modeling in digital humanities projects, thereby bridging a critical methodological gap in the deep adaptation of Semantic Web technologies to humanities scholarship.
📝 Abstract
In the realm of Digital Humanities, the management of cultural heritage metadata is pivotal for ensuring data trustworthiness. Provenance information - contextual metadata detailing the origin and history of data - plays a crucial role in this process. However, tracking provenance and changes in metadata using the Resource Description Framework (RDF) presents significant challenges due to the limitations of foundational Semantic Web technologies. This article offers a comprehensive review of existing models and approaches for representing provenance and tracking changes in RDF, with a specific focus on cultural heritage metadata. It examines W3C standard proposals such as RDF Reification and n-ary relations, along with various alternative systems. Through an in-depth analysis, the study identifies Named Graphs, RDF*, the Provenance Ontology (PROV-O), Dublin Core (DC), Conjectural Graphs, and the OpenCitations Data Model (OCDM) as the most effective solutions. These models are evaluated based on their compliance with RDF standards, scalability, and applicability across different domains. The findings underscore the importance of selecting the appropriate model to ensure robust and reliable management of provenance in RDF datasets, thereby contributing to the ongoing discourse on provenance representation in the Digital Humanities.
Problem

Research questions and friction points this paper is trying to address.

Challenges in tracking provenance and metadata changes in RDF
Review of models for cultural heritage metadata in RDF
Evaluation of RDF solutions for scalability and compliance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses Named Graphs for RDF provenance tracking
Applies PROV-O ontology for metadata history
Evaluates RDF* for scalable change management