Analyzing Deep Neural Network's Transferability via Frechet Distance

Ding, Yifan; Wang, Liqiang; Gong, Boqing

Yifan Ding, Liqiang Wang, Boqing Gong; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2021, pp. 3932-3941

Abstract

Transfer learning has become the de facto practice to reuse a deep neural network (DNN) that is pre-trained with abundant training data in a source task to improve the model training on target tasks with smaller-scale training data. In this paper, we first investigate the correlation between the DNN's pre-training performance in the source task and their transfer results in the downstream tasks. We find that high performance of a pre-trained model does not necessarily imply high transferability. We then propose a metric, named Fr echet Pre-train Distance, to estimate the transferability of a deep neural network. By applying the proposed Fr echet Pre-train Distance, we are able to identify the optimal pre-trained checkpoint, and then achieve high transferability on downstream tasks. Finally, we investigate several factors impacting DNN's transferability including normalization, different networks and learning rates. The results consistently support our conclusions.

Related Material

[pdf]

[bibtex]

@InProceedings{Ding_2021_WACV, author = {Ding, Yifan and Wang, Liqiang and Gong, Boqing}, title = {Analyzing Deep Neural Network's Transferability via Frechet Distance}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2021}, pages = {3932-3941} }