Is Synthetic Data all We Need? Benchmarking the Robustness of Models Trained with Synthetic Images

Krishnakant Singh, Thanush Navaratnam, Jannik Holmer, Simone Schaub-Meyer, Stefan Roth; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 2505-2515

Abstract


A long-standing challenge in developing machine learning approaches has been the lack of high-quality labeled data. Recently models trained with purely synthetic data here termed synthetic clones generated using large-scale pre-trained diffusion models have shown promising results in overcoming this annotation bottleneck. As these synthetic clone models progress they are likely to be deployed in challenging real-world settings yet their suitability remains understudied. Our work addresses this gap by providing the first benchmark for three classes of synthetic clone models namely supervised self-supervised and multi-modal ones across a range of robustness measures. We show that existing synthetic self-supervised and multi-modal clones are comparable to or outperform state-of-the-art real-image baselines for a range of robustness metrics -- shape bias background bias calibration etc. However we also find that synthetic clones are much more susceptible to adversarial and real-world noise than models trained with real data. To address this we find that combining both real and synthetic data further increases the robustness and that the choice of prompt used for generating synthetic images plays an important part in the robustness of synthetic clones.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Singh_2024_CVPR, author = {Singh, Krishnakant and Navaratnam, Thanush and Holmer, Jannik and Schaub-Meyer, Simone and Roth, Stefan}, title = {Is Synthetic Data all We Need? Benchmarking the Robustness of Models Trained with Synthetic Images}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2024}, pages = {2505-2515} }