Clustering Convolutional Kernels to Compress Deep Neural Networks

Sanghyun Son, Seungjun Nah, Kyoung Mu Lee; Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 216-232

Abstract


In this paper, we propose a novel method to compress CNNs by reconstructing the network from a small set of spatial convolution kernels. Starting from a pre-trained model, we extract representative 2D kernel centroids using k-means clustering. Each centroid replaces the corresponding kernels of the same cluster, and we use indexed representations instead of saving whole kernels. Kernels in the same cluster share their weights, and we fine-tune the model while keeping the compressed state. Furthermore, we also suggest an efficient way of removing redundant calculations in the compressed convolutional layers. We experimentally show that our technique works well without harming the accuracy of widely-used CNNs. Also, our ResNet-18 even outperforms its uncompressed counterpart at ILSVRC2012 classification task with over 10x compression ratio.

Related Material


[pdf]
[bibtex]
@InProceedings{Son_2018_ECCV,
author = {Son, Sanghyun and Nah, Seungjun and Lee, Kyoung Mu},
title = {Clustering Convolutional Kernels to Compress Deep Neural Networks},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
month = {September},
year = {2018}
}