Seeing the Sound: A New Multimodal Imaging Device for Computer Vision

Andrea Zunino, Marco Crocco, Samuele Martelli, Andrea Trucco, Alessio Del Bue, Vittorio Murino; Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops, 2015, pp. 6-14

Abstract


Audio imaging can play a fundamental role in computer vision, in particular in automated surveillance, boosting the accuracy of current systems based on standard optical cameras. We present here a new hybrid device for acoustic-optic imaging, whose characteristics are tailored to automated surveillance. In particular, the device allows realtime, high frame rate generation of an acoustic map, overlaid over a standard optical image using a geometric calibration of audio and video streams. We demonstrate the potentialities of the device for target tracking on three challenging setup showing the advantages of using acoustic images against baseline algorithms on image tracking. In particular, the proposed approach is able to overcome, often dramatically, visual tracking with state-of-art algorithms, dealing efficiently with occlusions, abrupt variations in visual appearence and camouflage. These results pave the way to a widespread use of acoustic imaging in application scenarios such as in surveillance and security.

Related Material


[pdf]
[bibtex]
@InProceedings{Zunino_2015_ICCV_Workshops,
author = {Zunino, Andrea and Crocco, Marco and Martelli, Samuele and Trucco, Andrea and Del Bue, Alessio and Murino, Vittorio},
title = {Seeing the Sound: A New Multimodal Imaging Device for Computer Vision},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops},
month = {December},
year = {2015}
}