Semantic Segmentation With Multi Scale Spatial Attention for Self Driving Cars

Abhinav Sagar, RajKumar Soundrapandiyan; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2021, pp. 2650-2656

Abstract


In this paper, we present a novel neural network using multi scale feature fusion at various scales for accurate and efficient semantic image segmentation. We used ResNet based feature extractor, dilated convolutional layers in downsampling part, atrous convolutional layers in the upsampling part and used concat operation to merge them. A new attention module is proposed to encode more contextual information and enhance the receptive field of the network. We present an in depth theoretical analysis of our network with training and optimization details. Our network was trained and tested on the Camvid dataset and Cityscapes dataset using mean accuracy per class and Intersection Over Union (IOU) as the evaluation metrics. Our model outperforms previous state of the art methods on semantic segmentation achieving mean IOU value of 74.12 while running at >100 FPS.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Sagar_2021_ICCV, author = {Sagar, Abhinav and Soundrapandiyan, RajKumar}, title = {Semantic Segmentation With Multi Scale Spatial Attention for Self Driving Cars}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops}, month = {October}, year = {2021}, pages = {2650-2656} }