Chrome Extension
WeChat Mini Program
Use on ChatGLM

A Framework of Petroleum Information Retrieval System Based on Web Scraping with Python

2018 15th International Conference on Service Systems and Service Management (ICSSSM)(2018)

Cited 0|Views25
No score
Abstract
It is very necessary to build a customized retrieval system in the era of the big information explosion. This paper gives a framework of petroleum information retrieval system which will be used by petroleum exploration and development researchers. First, we use the open source framework SCRAPY to build a crawler system to crawl the information that business people pay attention to. Then k-means algorithm is used to cluster the crawled documents, therefore the key information is extracted and presented in the system. The actual effect in production and operation shows that this customized retrieval system is efficient and agile, it improves the efficiency, accuracy and automation level of the work.
More
Translated text
Key words
web crawler,k-means,clustering,information retrieval,petroleum
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined