Plagiarism Detection in arXiv

Hong Kong(2007)

引用 98|浏览1
暂无评分
摘要
Abstract We describe a large-scale application of methods,for finding plagiarism,and self-plagiarism in research document collections. The methods,are applied to a collection of 284,834 documents collected by arXiv.org over a 14 year period, covering a few different research disciplines. The methodology,efficiently detects a variety of problematic au - thor behaviors, and heuristics are developed to reduce the number,of false positives. The methods,are also efficient enough,to implement,as a real-time submission screen for a collection many,times larger.
更多
查看译文
关键词
plagiarism detection,real-time submission screen,problematic author behavior,false positive,ciently detects,large-scale application,year period,different research discipline,methodology effi,research document collection,technical report,text analysis,real time,computer science
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要