Performance Guaranteed Network Acceleration via High-Order Residual Quantization

Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, Wen Gao; Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2017, pp. 2584-2592

Abstract


Input binarization has shown to be an effective way for network acceleration. However, previous binarization scheme could be regarded as simple pixel-wise thresholding operations (i.e., order-one approximation) and suffers a big accuracy loss. In this paper, we propose a high-order binarization scheme, which achieves more accurate approximation while still possesses the advantage of binary operation. In particular, the proposed scheme recursively performs residual quantization and yields a series of binary input images with decreasing magnitude scales. Accordingly, we propose high-order binary filtering and gradient propagation operations for both forward and backward computations. Theoretical analysis shows approximation error guarantee property of proposed method. Extensive experimental results demonstrate that the proposed scheme yields great recognition accuracy while being accelerated.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Li_2017_ICCV,
author = {Li, Zefan and Ni, Bingbing and Zhang, Wenjun and Yang, Xiaokang and Gao, Wen},
title = {Performance Guaranteed Network Acceleration via High-Order Residual Quantization},
booktitle = {Proceedings of the IEEE International Conference on Computer Vision (ICCV)},
month = {Oct},
year = {2017}
}