The Causal-Effect Score in Data Management

📅 2025-02-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing database attribution methods—such as Shapley values—fail to accurately quantify the causal contribution strength of individual tuples to query results. Method: We propose the Causal-Effect Score (CES), the first framework to integrate structural causal models (SCMs) and counterfactual reasoning into data management, enabling unified tuple-level causal attribution for both deterministic and probabilistic databases. CES combines query semantics modeling, probabilistic inference, and efficient approximation algorithms. Contribution/Results: We provide an axiomatized definition of CES, analyze its computational complexity, and prove it satisfies key causal properties—including causal sensitivity and consistency. Experiments demonstrate that CES significantly outperforms baseline methods in attribution accuracy while maintaining strong scalability. By bridging causal inference and database systems, CES establishes a novel paradigm for interpretable, causally grounded database explanations.

Technology Category

Application Category

📝 Abstract
The Causal Effect (CE) is a numerical measure of causal influence of variables on observed results. Despite being widely used in many areas, only preliminary attempts have been made to use CE as an attribution score in data management, to measure the causal strength of tuples for query answering in databases. In this work, we introduce, generalize and investigate the so-called Causal-Effect Score in the context of classical and probabilistic databases.
Problem

Research questions and friction points this paper is trying to address.

Measure causal influence in databases
Use CE as attribution score
Investigate CE in probabilistic databases
Innovation

Methods, ideas, or system contributions that make the work stand out.

Causal Effect Score
Data Management Application
Probabilistic Databases Analysis
🔎 Similar Papers
No similar papers found.