ECS-SC: Long-tailed classification via data augmentation based on easily confused sample selection and combination

Wenwei He, Junyan Xu, Jie Shi,Hong Zhao

Expert Systems with Applications(2024)

引用 0|浏览1
暂无评分
摘要
The long-tailed distribution data poses many challenges for machine learning because the tail classes are extremely scarce. Long-tailed data augmentation is a powerful technique for enriching the tail class diversity. However, existing methods often treat each class independently, assuming that classes are isolated from each other. These approaches overlook the presence of easily confused tail classes, making it challenging for models to distinguish between them accurately. In this paper, we propose a long-tailed classification method based on data augmentation, which utilizes multi-granularity knowledge to select and combine easily confused tail samples, thereby enhancing the classification performance of these samples. First, we utilize multi-granularity knowledge and semantic relation trees to build a class relation matrix. This matrix records the relationship between classes and helps the model search for easily confused classes from bilateral branch samplers. Second, we crop and combine the easily confused head and tail class samples in a foreground–background manner to generate new samples, thereby augmenting the model training. The extensive head class knowledge is transferred to the scarce tail class samples through the combination of fore-background, and the discriminative and generalized abilities of the model are improved. The experimental results affirm the effectiveness of our proposed method.
更多
查看译文
关键词
Long-tailed classification,Data augmentation,Multi-granularity,Easily confused sample
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要