Large Scale Labelled Video Data Augmentation for Semantic Segmentation in Driving Scenarios

Ignas Budvytis, Patrick Sauer, Thomas Roddick, Kesar Breen, Roberto Cipolla; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 230-237

Abstract


In this paper we present an analysis of the effect of large scale video data augmentation for semantic segmentation in driving scenarios. Our work is motivated by a strong correlation between the high performance of most recent deep learning based methods and the availability of large volumes of ground truth labels. To generate additional labelled data, we make use of an occlusion-aware and uncertaintyenabled label propagation algorithm. As a result we increase the availability of high-resolution labelled frames by a factor of 20, yielding in a 6.8% to 10.8% rise in average classification accuracy and/or IoU scores for several semantic segmentation networks. Our key contributions include: (a) augmented CityScapes and CamVid datasets providing 56.2K and 6.5K additional labelled frames of object classes respectively, (b) detailed empirical analysis of the effect of the use of augmented data as well as (c) extension of proposed framework to instance segmentation.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Budvytis_2017_ICCV,
author = {Budvytis, Ignas and Sauer, Patrick and Roddick, Thomas and Breen, Kesar and Cipolla, Roberto},
title = {Large Scale Labelled Video Data Augmentation for Semantic Segmentation in Driving Scenarios},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops},
month = {Oct},
year = {2017}
}