Weakly-Supervised Symptom Recognition for Rare Diseases in Biomedical Text.

ADVANCES IN INTELLIGENT DATA ANALYSIS XV(2016)

引用 8|浏览42
暂无评分
摘要
In this paper, we tackle the issue of symptom recognition for rare diseases in biomedical texts. Symptoms typically have more complex and ambiguous structure than other biomedical named entities. Furthermore, existing resources are scarce and incomplete. Therefore, we propose a weakly-supervised framework based on a combination of two approaches: sequential pattern mining under constraints and sequence labeling. We use unannotated biomedical paper abstracts with dictionaries of rare diseases and symptoms to create our training data. Our experiments show that both approaches outperform simple projection of the dictionaries on text, and their combination is beneficial. We also introduce a novel pattern mining constraint based on semantic similarity between words inside patterns.
更多
查看译文
关键词
Information extraction,Pattern mining,CRF,Symptoms recognition,Biomedical texts
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要