FASON: First and Second Order Information Fusion Network for Texture Recognition

Xiyang Dai, Joe Yue-Hei Ng, Larry S. Davis; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 7352-7360

Abstract


Deep networks have shown impressive performance on many computer vision tasks. Recently, deep convolutional neural networks (CNNs) have been used to learn discriminative texture representations. One of the most successful approaches is Bilinear CNN model that explicitly captures the second order statistics within deep features. However, these networks cut off the first order information flow in the deep network and make gradient back-propagation difficult. We propose an effective fusion architecture - FASON that combines second order information flow and first order information flow. Our method allows gradients to back-propagate through both flows freely and can be trained effectively. We then build a multi-level deep architecture to exploit the first and second order information within different convolutional layers. Experiments show that our method achieves improvements over state-of-the-art methods on several benchmark datasets.

Related Material


[pdf]
[bibtex]
@InProceedings{Dai_2017_CVPR,
author = {Dai, Xiyang and Yue-Hei Ng, Joe and Davis, Larry S.},
title = {FASON: First and Second Order Information Fusion Network for Texture Recognition},
booktitle = {Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {July},
year = {2017}
}