MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain

Rui Yi Yong, Samuel Picosson, Arnold Wiliem; Proceedings of the Winter Conference on Applications of Computer Vision (WACV) Workshops, 2025, pp. 1579-1587

Abstract


This work tackles 3D scene reconstruction for a video fly-over perspective problem in the maritime domain with a specific emphasis on geometrically and visually sound reconstructions. This will allow for downstream tasks such as segmentation navigation and localization. To our knowledge there is no dataset available in this domain. As such we propose a novel maritime 3D scene reconstruction benchmarking dataset named as MTReD (Maritime Three Dimensional Reconstruction Dataset). The MTReD comprises 19 fly-over videos curated from the Internet containing ships islands and coastlines. As the task is aimed towards geometrical consistency and visual completeness the dataset uses two metrics: (1) Reprojection error; and (2) Perception based metrics. We find that existing perception based metrics such as Learned Perceptual Image Patch Similarity (LPIPS) do not appropriately measure the completeness of a reconstructed image. Thus we propose a novel semantic similarity metric utilizing DINOv2 features. We perform initial evaluation on two baselines: (1) Structured from Motion (SfM) through Colmap; and (2) the recent SOTA MASt3R model. We find that the reconstructed scenes by MASt3R have higher reprojection errors but superior perception based metric scores. To this end some pre-processing methods are explored and we find a pre-processing method which improves both the reprojection error and the semantic similarity score. We envisage our proposed MTReD to stimulate further research in these directions. The dataset and all the code will be made available in this https://github.com/RuiYiYong/MTReD.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Yong_2025_WACV, author = {Yong, Rui Yi and Picosson, Samuel and Wiliem, Arnold}, title = {MTReD: 3D Reconstruction Dataset for Fly-over Videos of Maritime Domain}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV) Workshops}, month = {February}, year = {2025}, pages = {1579-1587} }