Data Augmentation Meta-Classifier Scheme for imbalanced data sets.

SSCI(2022)

Cited 0|Views13
No score
Abstract
Categorical data obtained from real-world domains are commonly imbalanced, as they often present more number of samples belonging to one of the classes. Imbalanced data tends to be a problem for classifiers, as the majority class biased them and affects overall performance. Among the techniques used for dealing with imbalanced data sets, data augmentation (DA) constitutes an alternative, as it can improve the accuracy of prediction for the minority class (usually the relevant one), but usually at the cost of a loss regarding predictions of the majority one. To benefit from both behaviours, we introduce in this study a meta-classifier scheme that works as a mixture of two classifiers, one trained with the original data and the second one trained using augmented data. The experiments carried out with 12 imbalanced data sets, 5 of them obtained from the TCGA database related to cancer survival prediction, show an improvement in accuracy, area under the ROC curve and Matthews correlation coefficient values compared to the results obtained using the original data sets.
More
Translated text
Key words
Data augmentation,Imbalance learning,Data Mining,Meta-Classifier
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined