Tracking the Untrackable: Learning to Track Multiple Cues With Long-Term Dependencies

Amir Sadeghian, Alexandre Alahi, Silvio Savarese; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 300-311

Abstract


The majority of existing solutions to the Multi-Target Tracking (MTT) problem do not combine cues over a long period of time in a coherent fashion. In this paper, we present an online method that encodes long-term temporal dependencies across multiple cues. One key challenge of tracking methods is to accurately track occluded targets or those which share similar appearance properties with surrounding objects. To address this challenge, we present a structure of Recurrent Neural Networks (RNN) that jointly reasons on multiple cues over a temporal window. Our method allows to correct data association errors and recover observations from occluded states. We demonstrate the robustness of our data-driven approach by tracking multiple targets using their appearance, motion, and even interactions. Our method outperforms previous works on multiple publicly available datasets including the challenging MOT benchmark.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Sadeghian_2017_ICCV,
author = {Sadeghian, Amir and Alahi, Alexandre and Savarese, Silvio},
title = {Tracking the Untrackable: Learning to Track Multiple Cues With Long-Term Dependencies},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {Oct},
year = {2017}
}