Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization

Daniel Jakubovitz, Raja Giryes; The European Conference on Computer Vision (ECCV), 2018, pp. 514-529


Deep neural networks have lately shown tremendous performance in various applications including vision and speech processing tasks. However, alongside their ability to perform these tasks with such high accuracy, it has been shown that they are highly susceptible to adversarial attacks: a small change in the input would cause the network to err with high confidence. This phenomenon exposes an inherent fault in these networks and their ability to generalize well. For this reason, providing robustness to adversarial attacks is an important challenge in networks training, which has led to extensive research. In this work, we suggest a theoretically inspired novel approach to improve the networks' robustness. Our method applies regularization using the Frobenius norm of the Jacobian of the network, which is applied as post-processing, after regular training has finished. We demonstrate empirically that it leads to enhanced robustness results with a minimal change in the original network's accuracy.

Related Material

author = {Jakubovitz, Daniel and Giryes, Raja},
title = {Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization},
booktitle = {The European Conference on Computer Vision (ECCV)},
month = {September},
year = {2018}