Towards Fast and Robust Adversarial Training for Image Classification

Erh-Chung Chen, Che-Rung Lee; Proceedings of the Asian Conference on Computer Vision (ACCV), 2020


The adversarial training, which augments the training data with adversarial examples, is one of the most effective methods to defend adversarial attacks. However, its robustness degrades for complex models, and the producing of strong adversarial examples is a time-consuming task. In this paper, we proposed methods to improve the robustness and efficiency of the adversarial training. First, we utilized a re-constructor to enforce the classifier to learn the important features under perturbations. Second, we employed the enhanced FGSM to generate adversarial examples effectively. It can detect overfitting and stop training earlier without extra cost. Experiments are conducted on MNIST and CIFAR10 to validate the effectiveness of our methods. We also compared our algorithm with the state-of-the-art defense methods. The results show that our algorithm is 4-5 times faster than the previously fastest training method. For CIFAR-10, our method can achieve above 46% robust accuracy, which is better than most of other methods.

Related Material

@InProceedings{Chen_2020_ACCV, author = {Chen, Erh-Chung and Lee, Che-Rung}, title = {Towards Fast and Robust Adversarial Training for Image Classification}, booktitle = {Proceedings of the Asian Conference on Computer Vision (ACCV)}, month = {November}, year = {2020} }