Rethinking the Setting of Semi-supervised Learning on Graphs.

Ziang Li,Ming Ding,Weikai Li,Zihan Wang,Ziyu Zeng,Yukuo Cen,Jie Tang

European Conference on Artificial Intelligence（2022）

引用 0|浏览346

暂无评分

摘要

We argue that the present setting of semisupervised learning on graphs may result in unfair comparisons, due to its potential risk of over-tuning hyper-parameters for models. In this paper, we highlight the significant influence of tuning hyper-parameters, which leverages the label information in the validation set to improve the performance. To explore the limit of over-tuning hyperparameters, we propose ValidUtil, an approach to fully utilize the label information in the validation set through an extra group of hyper-parameters. With ValidUtil, even GCN can easily get high accuracy of 85.8% on Cora. To avoid over-tuning, we merge the training set and the validation set and construct an i.i.d. graph benchmark (IGB) consisting of 4 datasets. Each dataset contains 100 i.i.d. graphs sampled from a large graph to reduce the evaluation variance. Our experiments suggest that IGB is a more stable benchmark than previous datasets for semisupervised learning on graphs.

查看译文

关键词

Machine Learning: Semi-Supervised Learning,Machine Learning: Sequence and Graph Learning,Multidisciplinary Topics and Applications: Web and Social Networks

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要