Global triangle estimation based on first edge sampling in large graph streams

JOURNAL OF SUPERCOMPUTING(2023)

引用 0|浏览20
暂无评分
摘要
Triangle approximate counting has emerged as a prominent issue in graph stream research in the past few years, with applications ranging from social network analysis to web topic mining and motif detection in informatics. Many graph stream sampling and triangle approximate counting algorithms have been proposed, with the majority of them guaranteeing unbiased estimation. However, they either cannot ensure that the memory overhead or the result’s uncertainty is too great due to the use of an excessively large sampling space. In this article, we propose RFES, a set of one-pass stream algorithms for counting the global number of triangles in a fully dynamic graph stream in an unbiased, low-variance, and high-precision manner. RFES has three algorithms: RFESBASE, RFES-IMPR, and RFES-FD, which represent the basic, improved, and complete dynamic versions, respectively. Each algorithm is based on our proposed first-edge reservoir sampling method, which shrinks the sampling space while increasing the uncertainty of triangles in the sample. It can deal with fully dynamic data with a lower theoretical estimation variance than state-of-the-art algorithms. A significant number of experimental results demonstrated that our RFES algorithm is more accurate and takes less time. The source codes of RFES can be downloaded from the website: https://github.com/BioLab310/RFES .
更多
查看译文
关键词
Graph stream,Triangle counting,First-edge sampling,Probability and statistics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要