Adversarial Example Detection Using Latent Neighborhood Graph

Ahmed Abusnaina, Yuhang Wu, Sunpreet Arora, Yizhen Wang, Fei Wang, Hao Yang, David Mohaisen; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 7687-7696


Detection of adversarial examples with high accuracy is critical for the security of deployed deep neural network-based models. We present the first graph-based adversarial detection method that constructs a Latent Neighborhood Graph (LNG) around an input example to determine if the input example is adversarial. Given an input example, selected reference adversarial and benign examples are used to capture the local manifold in the vicinity of the input example. The LNG node connectivity parameters are optimized jointly with the parameters of a graph attention network in an end-to-end manner to determine the optimal graph topology for adversarial example detection. The graph attention network is used to determine if the LNG is derived from an adversarial or benign input example. Experimental evaluations on CIFAR-10, STL-10, and ImageNet datasets, using six adversarial attack methods, demonstrate that the proposed method outperforms state-of-the-art adversarial detection methods in white-box and gray-box settings. The proposed method is able to successfully detect adversarial examples crafted with small perturbations using unseen attacks.

Related Material

@InProceedings{Abusnaina_2021_ICCV, author = {Abusnaina, Ahmed and Wu, Yuhang and Arora, Sunpreet and Wang, Yizhen and Wang, Fei and Yang, Hao and Mohaisen, David}, title = {Adversarial Example Detection Using Latent Neighborhood Graph}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {7687-7696} }