Layer-Wise Invertibility for Extreme Memory Cost Reduction of CNN Training

Hascoet, Tristan; Febvre, Quentin; Zhuang, Weihao; Ariki, Yasuo; Takiguchi, Tetsuya

Tristan Hascoet, Quentin Febvre, Weihao Zhuang, Yasuo Ariki, Tetsuya Takiguchi; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 0-0

Abstract

Convolutional Neural Networks (CNN) have demonstrated state-of-the-art results on various computer vision problems. However, training CNNs require specialized GPU with large memory. GPU memory has been a major bottleneck of the CNN training procedure, limiting the size of both inputs and model architectures. Given the ubiquity of CNN in computer vision, optimizing the memory consumption of CNN training would have wide spread practical benefits. Recently, reversible neural networks have been proposed to alleviate this memory bottleneck by recomputing hidden activations through inverse operations during the backward pass of the backpropagation algorithm. In this paper, we push this idea to extreme and design a reversible neural network with minimal training memory consumption. The result demonstrated that we can train CIFAR10 dataset on Nvidia GTX750 GPU only with 1GB memory and achieve 93% accuracy within 67 minutes.

Related Material

[pdf]

[bibtex]

@InProceedings{Hascoet_2019_ICCV,
author = {Hascoet, Tristan and Febvre, Quentin and Zhuang, Weihao and Ariki, Yasuo and Takiguchi, Tetsuya},
title = {Layer-Wise Invertibility for Extreme Memory Cost Reduction of CNN Training},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
month = {Oct},
year = {2019}
}