Adversarial training in logit space against tiny perturbations

Multimedia Systems(2023)

引用 0|浏览0
暂无评分
摘要
Adversarial training is wildly considered as one of the most effective ways to defend against adversarial examples. However, existing adversarial training methods consume unbearable time, due to the fact that they need to generate adversarial examples in a large input space. To speed up adversarial training, we propose a novel adversarial training method by generating endogenous adversarial examples (EAEs) rather than real adversarial examples, which is fulfilled by adding perturbations to the adversarial examples in the logit space, thus the gradient calculation can be avoided. In order to prove the validity of our method, extensive experiments are conducted on CIFAR-10 and ImageNet. The results show that our EAE adversarial training not only shortens the training time, but also enhances the robustness of model and has less impact on the accuracy of clean examples than the existing state-of-the-art methods.
更多
查看译文
关键词
Adversarial training,Endogenous adversarial examples,Perturbations,Logit space
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要