A Large Scholarly Corpus: A Bird's-Eye View

2017 IEEE 13th International Conference on e-Science (e-Science)(2017)

引用 0|浏览11
暂无评分
摘要
In this paper we present a new, very large, rich, Comprehensive Scholarly Corpus (CompScholarCorp) as a platform and data source for future research. Our corpus contains records of 1,044,454 papers, 472,365 unique authors, and substantial publication meta-data for each record. We have integrated the data we collected from 276 publishers using a uniform and consistent XML data format within the corpus. The corpus is designed to be compatible with DBLP enabling existing research to utilise our new corpus directly. As an initial analysis of the corpus, we present a number of visualisations of the corpus to better understand the data, provide some analytics of the data, and present a rule-of-thumb we have observed for citations.
更多
查看译文
关键词
Corpus,Scholar,Collaborative network,Social Network,Citation,Network Visualisation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要