Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images

Ji, Yikun; Hong, Yan; Deng, Bowen; Lan, Jun; Zhu, Huijia; Wang, Weiqiang; Zhang, Liqing; Zhang, Jianfu

Yikun Ji, Yan Hong, Bowen Deng, Jun Lan, Huijia Zhu, Weiqiang Wang, Liqing Zhang, Jianfu Zhang; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026, pp. 19165-19175

Abstract

The rapid growth of AI-generated imagery has blurred the boundary between real and synthetic content, raising practical concerns for digital integrity. Vision-language models (VLMs) can provide natural language explanations, but standard one-pass classifiers often miss subtle artifacts in high-quality synthetic images and offer limited grounding in the pixels. We propose Locate-Then-Examine (LTE), a two-stage VLM-based forensic framework that first localizes suspicious regions and then re-examines these crops together with the full image to refine the real vs. AI-generated verdict and its explanation. LTE explicitly links each decision to localized visual evidence through region proposals and region-aware reasoning. To support training and evaluation, we introduce TRACE, a dataset of 20,000 real and high-quality synthetic images with region-level annotations and automatically generated forensic explanations, constructed by a VLM-based pipeline with additional consistency checks and quality control. Across TRACE and multiple external benchmarks, LTE achieves competitive accuracy and improved robustness while providing human-understandable, region-grounded explanations suitable for forensic deployment.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Ji_2026_CVPR, author = {Ji, Yikun and Hong, Yan and Deng, Bowen and Lan, Jun and Zhu, Huijia and Wang, Weiqiang and Zhang, Liqing and Zhang, Jianfu}, title = {Locate-Then-Examine: Grounded Region Reasoning Improves Detection of AI-Generated Images}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2026}, pages = {19165-19175} }