Wavelet-Driven Generalizable Framework for Deepfake Face Forgery Detection

Lalith Bharadwaj Baru, Rohit Boddeda, Shilhora Akshay Patel, Sai Mohan Gajapaka; Proceedings of the Winter Conference on Applications of Computer Vision (WACV) Workshops, 2025, pp. 1661-1669

Abstract


The evolution of digital image manipulation particularly with the advancement of deep generative models significantly challenges existing deepfake detection methods especially when the origin of the deepfake is obscure. To tackle the increasing complexity of these forgeries we propose Wavelet-CLIP a deepfake detection framework that integrates wavelet transforms with features derived from the ViT-L/14 architecture pre-trained in the CLIP fashion. Wavelet-CLIP utilizes Wavelet Transforms to deeply analyze both spatial and frequency features from images thus enhancing the model's capability to detect sophisticated deepfakes. To verify the effectiveness of our approach we conducted extensive evaluations against existing state-of-the-art methods for cross-dataset generalization and detection of unseen images generated by standard diffusion models. Our method showcases outstanding performance achieving an average AUC of 0.749 for cross-data generalization and 0.893 for robustness against unseen deepfakes outperforming all compared methods. The code can be reproduced from the repo:https://github.com/lalithbharadwajbaru/wavelet-clip.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Baru_2025_WACV, author = {Baru, Lalith Bharadwaj and Boddeda, Rohit and Patel, Shilhora Akshay and Gajapaka, Sai Mohan}, title = {Wavelet-Driven Generalizable Framework for Deepfake Face Forgery Detection}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV) Workshops}, month = {February}, year = {2025}, pages = {1661-1669} }