Keyword Extraction For Web News Documents Based On Lm-Bp Neural Network

2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC)(2015)

引用 25|浏览8
暂无评分
摘要
In view of the actual demand, the paper provides a new idea on keyword extraction for web news documents by adopting the improved LM algorithm based on BP artificial neural network. First, preprocess the web news documents which are of consistent HTML format. The preprocessed work includes noise filter, web content extraction, word segmentation, POS tagging, stop words removal, etc. Also, select effective features like TT, location of words based on the characteristics of news documents. Then the selected features will he considered in training and constructing the BP neural network. Finally, extract keywords with LM algorithm which has parameters adjustment and solves training too long and getting stuck in local minimum of BP so that improve network convergence speed and keyword classification performance. The results show that LM algorithm has better effect and convergence performance comparing with BP in the field of keyword extraction.
更多
查看译文
关键词
BP neural network, Levenberg-Marquardt Algorithm, Keyword Extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要