Imbalanced COVID-19 dataset classification with bidirectional sampling based on sample correlation

International Journal of Embedded Systems(2023)

引用 0|浏览0
暂无评分
摘要
Aiming at the problem that the classification hyperplane is inclined toward the positive class when the CNN model directly classifies the imbalanced dataset, resulting in a high misclassification rate, a bidirectional sampling method based on sample correlation is proposed. Firstly, the sampling ratio is designed according to the numbers of the two types of samples, and then, considering the influence of the positional correlation between the samples, the methods of under-sampling negative samples and oversampling of positive samples are proposed. Therefore, the balance of the numbers of positive and negative samples is achieved. Finally, after sampling the imbalanced dataset of Kaggle images, the deep learning model SSD is used to train and identify the COVID-19 samples. The experimental comparison results show that the method proposed in this paper can improve the evaluation indices such as F+-measure and G-means by more than 5% in the identification of COVID-19.
更多
查看译文
关键词
bidirectional sampling,sample correlation,classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要