Exploring the Sim2Real Gap Using Digital Twins

Sruthi Sudhakar, Jon Hanzelka, Josh Bobillot, Tanmay Randhavane, Neel Joshi, Vibhav Vineet; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 20418-20427

Abstract


It is very time consuming to create datasets for training computer vision models. An emerging alternative is to use synthetic data, but if the synthetic data is not similar enough to the real data, the performance is typically below that of training with real data. Thus using synthetic data still requires a large amount of time, money, and skill as one needs to author the data carefully. In this paper, we seek to understand which aspects of this authoring process are most critical. We present an analysis of which factors of variation between simulated and real data are most important. We capture images of YCB objects to create a novel YCB-Real dataset. We then create a novel synthetic "digital twin" dataset, YCB-Synthetic, which matches the YCB-Real dataset and includes variety of artifacts added to the synthetic data. We study the affects of these artifacts on our dataset and two existing published datasets on two different computer vision tasks: object detection and instance segmentation. We provide an analysis of the cost-benefit trade-offs between artist time for fixing artifacts and trained model accuracy. We plan to release this dataset (images and 3D assets) so they can be further used by the community.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Sudhakar_2023_ICCV, author = {Sudhakar, Sruthi and Hanzelka, Jon and Bobillot, Josh and Randhavane, Tanmay and Joshi, Neel and Vineet, Vibhav}, title = {Exploring the Sim2Real Gap Using Digital Twins}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2023}, pages = {20418-20427} }