Feature Driven and Point Process Approaches for Popularity Prediction

ACM International Conference on Information and Knowledge Management(2016)

引用 210|浏览220
暂无评分
摘要
Predicting popularity, or the total volume of information outbreaks, is an important subproblem for understanding collective behavior in networks. Each of the two main types of recent approaches to the problem, feature-driven and generative models, have desired qualities and clear limitations. This paper bridges the gap between these solutions with a new hybrid approach and a new performance benchmark. We model each social cascade with a marked Hawkes self-exciting point process, and estimate the content virality, memory decay, and user influence. We then learn a predictive layer for popularity prediction using a collection of cascade history. We compare the performance of feature driven and point-process models using existing tweet data and a new public benchmark on news tweets. To our surprise, Hawkes process with predictive tuning outperform recent feature-driven and generative approaches. We also found that a basic set of user features and event time summary statistics performs competitively in both classification and regression tasks, and that adding point process information to the feature set further improves predictions. From these observations, we argue that future work on popularity prediction should compare across feature-driven and generative modeling approaches in both classification and regression tasks.
更多
查看译文
关键词
social media,self-exciting point process,information diffusion,cascade prediction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要