CC-CERT: A Probabilistic Approach to Certify General Robustness of Neural Networks.

Mikhail Pautov,Nurislam Tursynbek,Marina Munkhoeva,Nikita Muravev,Aleksandr Petiushko,Ivan Oseledets

AAAI Conference on Artificial Intelligence（2022）

引用 17|浏览49

暂无评分

摘要

In safety-critical machine learning applications, it is crucial to defend models against adversarial attacks --- small modifications of the input that change the predictions. Besides rigorously studied $\ell_p$-bounded additive perturbations, semantic perturbations (e.g. rotation, translation) raise a serious concern on deploying ML systems in real-world. Therefore, it is important to provide provable guarantees for deep learning models against semantically meaningful input transformations. In this paper, we propose a new universal probabilistic certification approach based on Chernoff-Cramer bounds that can be used in general attack settings. We estimate the probability of a model to fail if the attack is sampled from a certain distribution. Our theoretical findings are supported by experimental results on different datasets.

查看译文

关键词

Machine Learning (ML)

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要