Finding the maximum clique in massive graphs.

PROCEEDINGS OF THE VLDB ENDOWMENT(2017)

引用 72|浏览70
暂无评分
摘要
Cliques refer to subgraphs in an undirected graph such that vertices in each subgraph are pairwise adjacent. The maximum clique problem, to find the clique with most vertices in a given graph, has been extensively studied. Besides its theoretical value as an NPhard problem, the maximum clique problem is known to have direct applications in various fields, such as community search in social networks and social media, team formation in expert networks, gene expression and motif discovery in bioinformatics and anomaly detection in complex networks, revealing the structure and function of networks. However, algorithms designed for the maximum clique problem are expensive to deal with real-world networks. In this paper, we devise a randomized algorithm for the maximum clique problem. Different from previous algorithms that search from each vertex one after another, our approach RMC, for the randomized maximum clique problem, employs a binary search while maintaining a lower bound (omega c) under bar and an upper bound (omega c) over bar of omega(G). In each iteration, RMC attempts to find a omega t-clique where omega t = b vertical bar((omega c) under bar + (omega c) over bar)/2 vertical bar. As finding omega t in each iteration is NPcomplete, we extract a seed set S such that the problem of finding a omega t-clique in G is equivalent to finding a omega t-clique in S with probability guarantees (>= 1 - n(-c)). We propose a novel iterative algorithm to determine the maximum clique by searching a k-clique in S starting from k = (omega c) under bar + 1 until S becomes empty set, when more iterations benefit marginally. As confirmed by the experiments, our approach is much more efficient and robust than previous solutions and can always find the exact maximum clique.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要