🤖 AI Summary
This paper addresses the NP-hard problem of mining maximum disjoint $k$-clique ($k geq 3$) sets in large-scale dynamic social networks, targeting real-time team formation in applications such as online gaming. Methodologically, it proposes the first lightweight $k$-approximation algorithm, integrating node ordering, clique-graph degree estimation, and dynamic index maintenance, along with an intelligent swap mechanism supporting high-frequency edge insertions and deletions—significantly reducing update overhead. Evaluated on multiple real-world large graphs, the method achieves up to two orders-of-magnitude speedup over state-of-the-art approaches while improving the count of discovered disjoint $k$-cliques by 13.3% on average. Key contributions include: (i) the first theoretically guaranteed $k$-approximation ratio; (ii) a lightweight architecture balancing accuracy, efficiency, and dynamic adaptability; and (iii) an open-source, scalable framework for dynamic clique matching.
📝 Abstract
A $k$-clique is a dense graph, consisting of $k$ fully-connected nodes, that finds numerous applications, such as community detection and network analysis. In this paper, we study a new problem, that finds a maximum set of disjoint $k$-cliques in a given large real-world graph with a user-defined fixed number $k$, which can contribute to a good performance of teaming collaborative events in online games. However, this problem is NP-hard when $k geq 3$, making it difficult to solve. To address that, we propose an efficient lightweight method that avoids significant overheads and achieves a $k$-approximation to the optimal, which is equipped with several optimization techniques, including the ordering method, degree estimation in the clique graph, and a lightweight implementation. Besides, to handle dynamic graphs that are widely seen in real-world social networks, we devise an efficient indexing method with careful swapping operations, leading to the efficient maintenance of a near-optimal result with frequent updates in the graph. In various experiments on several large graphs, our proposed approaches significantly outperform the competitors by up to 2 orders of magnitude in running time and 13.3% in the number of computed disjoint $k$-cliques, which demonstrates the superiority of the proposed approaches in terms of efficiency and effectiveness.