Embedding 3D Geometric Features for Rigid Object Part Segmentation

Yafei Song, Xiaowu Chen, Jia Li, Qinping Zhao; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 580-588

Abstract


Object part segmentation is a challenging and fundamental problem in computer vision. Its difficulties may be caused by the varying viewpoints, poses, and topological structures, which can be attributed to an essential reason, i.e., a specific object is a 3D model rather than a 2D figure. Therefore, we conjecture that not only 2D appearance features but also 3D geometric features could be helpful. With this in mind, we propose a 2-stream FCN. One stream, named AppNet, is to extract 2D appearance features from the input image. The other stream, named GeoNet, is to extract 3D geometric features. However, the problem is that the input is just an image. To this end, we design a 2D-convolution based CNN structure to extract 3D geometric features from 3D volume, which is named VolNet. Then a teacher-student strategy is adopted and VolNet teaches GeoNet how to extract 3D geometric features from an image. To perform this teaching process, we synthesize training data using 3D models. Each training sample consists of an image and its corresponding volume. A perspective voxelization algorithm is further proposed to align them. Experimental results verify our conjecture and the effectiveness of both the proposed 2-stream CNN and VolNet.

Related Material


[pdf]
[bibtex]
@InProceedings{Song_2017_ICCV,
author = {Song, Yafei and Chen, Xiaowu and Li, Jia and Zhao, Qinping},
title = {Embedding 3D Geometric Features for Rigid Object Part Segmentation},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {Oct},
year = {2017}
}