Chrome Extension
WeChat Mini Program
Use on ChatGLM

The Classification of Short Scientific Texts Using Pretrained BERT Model

PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021(2021)

Cited 1|Views3
No score
Abstract
Automated text classification is a natural language processing (NLP) technology that could significantly facilitate scientific literature selection. A specific topical dataset of 630 article abstracts was obtained from the PubMed database. We proposed 27 parametrized options of PubMedBERT model and 4 ensemble models to solve a binary classification task on that dataset. Three hundred tests with resamples were performed in each classification approach. The best PubMedBERT model demonstrated F1-score = 0.857 while the best ensemble model reached F1score = 0.853. We concluded that the short scientific texts classification quality might be improved using the latest state-of-art approaches.
More
Translated text
Key words
Text classification,neurosurgery,machine learning,topic modeling,natural language processing,artificial intelligence
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined