Real-Time Segmentation Networks should be Latency Aware

Evann Courdier, Francois Fleuret; Proceedings of the Asian Conference on Computer Vision (ACCV), 2020


As scene segmentation systems reach visually accurate results, many recent papers focus on making these network architectures faster, smaller and more efficient. In particular, studies often aim at designing `real-time' systems. Achieving this goal is particularly relevant in the context of real-time video understanding for autonomous vehicles, and robots.In this paper, we argue that the commonly used performance metric of mean Intersection over Union (mIoU) does not fully capture the information required to estimate the true performance of these networks when they operate in `real-time'.We propose a change of objective in the segmentation task, and its associated metric that encapsulates this missing information in the following way: We propose to predict the future output segmentation map that will match the future input frame at the time when the network finishes the processing.We introduce the associated latency-aware metric, from which we can determine a ranking.We perform latency timing experiments of some recent networks on different hardware and assess the performances of these networks on our proposed task. We propose improvements to scene segmentation networks to better perform on our task by using multi-frames input and increasing capacity in the initial convolutional layers.

Related Material

@InProceedings{Courdier_2020_ACCV, author = {Courdier, Evann and Fleuret, Francois}, title = {Real-Time Segmentation Networks should be Latency Aware}, booktitle = {Proceedings of the Asian Conference on Computer Vision (ACCV)}, month = {November}, year = {2020} }