Can Weight Sharing Outperform Random Architecture Search? An Investigation With TuNAS

Bender, Gabriel; Liu, Hanxiao; Chen, Bo; Chu, Grace; Cheng, Shuyang; Kindermans, Pieter-Jan; Le, Quoc V.

Gabriel Bender, Hanxiao Liu, Bo Chen, Grace Chu, Shuyang Cheng, Pieter-Jan Kindermans, Quoc V. Le; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 14323-14332

Abstract

Efficient Neural Architecture Search methods based on weight sharing have shown good promise in democratizing Neural Architecture Search for computer vision models. There is, however, an ongoing debate whether these efficient methods are significantly better than random search. Here we perform a thorough comparison between efficient and random search methods on a family of progressively larger and more challenging search spaces for image classification and detection on ImageNet and COCO. While the efficacies of both methods are problem-dependent, our experiments demonstrate that there are large, realistic tasks where efficient search methods can provide substantial gains over random search. In addition, we propose and evaluate techniques which improve the quality of searched architectures and reduce the need for manual hyper-parameter tuning.

Related Material

[pdf] [supp] [arXiv] [video]

[bibtex]

@InProceedings{Bender_2020_CVPR,
author = {Bender, Gabriel and Liu, Hanxiao and Chen, Bo and Chu, Grace and Cheng, Shuyang and Kindermans, Pieter-Jan and Le, Quoc V.},
title = {Can Weight Sharing Outperform Random Architecture Search? An Investigation With TuNAS},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2020}
}