Learning Adaptive Receptive Fields for Deep Image Parsing Network

Zhen Wei, Yao Sun, Jinqiao Wang, Hanjiang Lai, Si Liu; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 2434-2442

Abstract


In this paper, we introduce a novel approach to regulate receptive field in deep image parsing network automatically. Unlike previous works which have stressed much importance on obtaining better receptive fields using manually selected dilated convolutional kernels, our approach uses two affine transformation layers in the network's backbone and operates on feature maps. Feature maps will be inflated/shrinked by the new layer and therefore receptive fields in following layers are changed accordingly. By end-to-end training, the whole framework is data-driven without laborious manual intervention. The proposed method is generic across dataset and different tasks. We conduct extensive experiments on both general parsing task and face parsing task as concrete examples to demonstrate the method's superior regulation ability over manual designs.

Related Material


[pdf] [supp] [poster]
[bibtex]
@InProceedings{Wei_2017_CVPR,
author = {Wei, Zhen and Sun, Yao and Wang, Jinqiao and Lai, Hanjiang and Liu, Si},
title = {Learning Adaptive Receptive Fields for Deep Image Parsing Network},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {July},
year = {2017}
}