TransForensics: Image Forgery Localization With Dense Self-Attention

Jing Hao, Zhixin Zhang, Shicai Yang, Di Xie, Shiliang Pu; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 15055-15064

Abstract


Nowadays advanced image editing tools and technical skills produce tampered images more realistically, which can easily evade image forensic systems and make authenticity verification of images more difficult. To tackle this challenging problem, we introduce TransForensics, a novel image forgery localization method inspired by Transformers. The two major components in our framework are dense self-attention encoders and dense correction modules. The former is to model global context and all pairwise interactions between local patches at different scales, while the latter is used for improving the transparency of the hidden layers and correcting the outputs from different branches. Compared to previous traditional and deep learning methods, TransForensics not only can capture discriminative representations and obtain high-quality mask predictions but is also not limited by tampering types and patch sequence orders. By conducting experiments on main benchmarks, we show that TransForensics outperforms the state-of-the-art methods by a large margin.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Hao_2021_ICCV, author = {Hao, Jing and Zhang, Zhixin and Yang, Shicai and Xie, Di and Pu, Shiliang}, title = {TransForensics: Image Forgery Localization With Dense Self-Attention}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {15055-15064} }