Geometric mean metric learning for partial label data.

Neurocomputing(2018)

引用 16|浏览28
暂无评分
摘要
Partial label learning (PLL) is a new weakly supervised learning framework that addresses the classification problems, where the true label of each training sample is concealed in a set of candidate labels. To learn from such weakly supervised training data, the key is to disambiguate the ambiguous labeling information. Because it is difficult to address by only focusing on the manipulation in the label space, manifold structure among training data in the feature space has gradually been exploited simultaneously to facilitate the disambiguation process by researchers in recent years. However, the manifold structure is commonly analyzed under an assumption that the samples close to each other in the feature space will share identical labels in the label space, which may be not correct in many real-world problems. In this paper, geometric mean metric learning approach is employed to learn a distance metric for PLL problems such that can maintain the aforementioned assumption correct in as many situations as possible. It is significantly more challenging than the conventional setup of distance metric learning because it is difficult to precisely identify whether a pair of training samples belong to the same class. We propose an alternative approach in which each training sample and its neighbor with shared candidate label are taken as a similarity pair, and each training sample and its neighbor without shared candidate label are taken as a dissimilarity pair. Considering that two samples with shared candidate label do not necessarily come from the same class, a weight is placed on each similarity pair. The experimental results on twenty four controlled UCI data sets and six real-world PLL problems show the proposed distance metric learning approach can be used as a front end of both the PLL algorithms exploiting the manifold structure among training data and other existing distance-based PLL algorithms to significantly improve their performance.
更多
查看译文
关键词
Partial label learning,Metric learning,Weighted geometric mean,Weakly supervised data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要