🤖 AI Summary
To address result explosion and ambiguous path semantics in navigational graph database queries, this paper proposes a formal framework for path attributes: quantitative constraints—such as path length and accumulated cost—are natively embedded into the operational semantics of the query language, enabling the first seamless integration of path properties with query processing. Unlike register automata, our framework offers strictly greater expressive power; its completeness and consistency are rigorously established via co-modeling of operational and logical semantics. We implement a prototype system based on this framework. Experiments demonstrate that incorporating path-attribute filtering significantly reduces result set size (by 72% on average), improves query throughput (up to 4.3× speedup), and effectively identifies and eliminates malformed paths.
📝 Abstract
This paper presents a formalism for defining properties of paths in graph databases, which can be used to restrict the number of solutions to navigational queries. In particular, our formalism allows us to define quantitative properties such as length or accumulated cost, which can be used as query filters. Furthermore, it enables the identification and removal of paths that may be considered ill-formed.
The new formalism is defined in terms of an operational semantics for the query language that incorporates these new constructs, demonstrating its soundness and completeness by proving its compatibility with a simple logical semantics. We also analyze its expressive power, showing that path properties are more expressive than register automata. Finally, after discussing some complexity issues related to this new approach, we present an empirical analysis carried out using our prototype implementation of the graph database that serves as a running example throughout the paper. The results show that queries using path properties as filters outperform standard queries that do not use them.