🤖 AI Summary
This work addresses the quadratic-time bottleneck in counting k-order subhypergraphs (hypergraphlets) in hypergraphs by proposing the first subquadratic-time algorithm. Leveraging a novel structural assumption termed (α,β)-niceness, the input hypergraph is decomposed into low-rank and low-degree components. The algorithm combines color coding, rank- and degree-based divide-and-conquer strategies, and randomized sampling to process these components separately before merging the results. This approach enables both efficient exact counting and uniform sampling of hypergraphlets. Empirical evaluation on real-world hypergraph datasets demonstrates over an order-of-magnitude speedup compared to the naive quadratic algorithm, substantially breaking through the prevailing complexity barrier.
📝 Abstract
We study the problem of counting $k$-\emph{hyper}graphlets, an interesting but surprisingly ignored primitive, with the aim of understanding if efficient algorithms exist. To this end we consider \emph{color coding}, a well-known technique for approximately counting $k$-graphlets in graphs. Our first result is that, on hypergraphs, color coding encounters a \emph{quadratic barrier}: under the Orthogonal Vector Conjecture, no implementation of it can run in time sub-quadratic in the size of the input. We then introduce a simple property, $(α,β)$-niceness, that hypergraphs from real-world datasets appear to satisfy for small values of $α$ and $β$. Intuitively, an $(α,β)$-nice hypergraph can be split into two sub-hypergraphs having respectively rank at most $α$ and degree at most $β$. By applying different techniques to each sub-hypergraph and carefully combining the outputs, we show how to run color coding in time $2^{O(k)} \cdot \big(2^β|V| + α^k |E| + α^2 β\size{H}\big)$, where $H=(V,E)$ is the input hypergraph. Afterwards, we can sample colorful $k$-hypergraphlets uniformly in expected $k^{O(k)} \cdot (β^2+\ln |V|)$ time per sample. Experiments on real-world hypergraphs show that our algorithm neatly outperforms the naive quadratic algorithm, sometimes by more than an order of magnitude.