A Web Crawler System Design Based on Distributed Technology.

JNW(2011)

Cited 4|Views6
No score
Abstract
A practical distributed web crawler architecture is designed. The distributed cooperative grasping algorithm is put forward to solve the problem of distributed Web Crawler grasping. Log structure and Hash structure are combined and a large-scale web store structure is devised, which can meet not only the need of a large amount of random accesses, but also the need of newly added pages. Experiment results have shown that the distributed Web Crawler's performance, scalability, and load balance are better. © 2011 ACADEMY PUBLISHER.
More
Translated text
Key words
distributed system,grasping strategy,search engine,web crawler
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined