On Offline Evaluation of Vision-based Driving Models

Felipe Codevilla, Antonio M. Lopez, Vladlen Koltun, Alexey Dosovitskiy; Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 236-251


Autonomous driving models should ideally be evaluated by deploying them on a fleet of physical vehicles in the real world. Unfortunately, this approach is not practical for the vast majority of researchers. An attractive alternative is to evaluate models offline, on a pre-collected validation dataset with ground truth annotation. In this paper, we investigate the relation between various online and offline metrics for evaluation of autonomous driving models. We find that generally offline prediction no necessarily correlated with the driving quality, and two models with identical prediction error can differ dramatically in their driving performance. We show that the correlation of offline evaluation with the driving quality can be significantly improved by selecting appropriate validation dataset and suitable offline metrics.

Related Material

author = {Codevilla, Felipe and Lopez, Antonio M. and Koltun, Vladlen and Dosovitskiy, Alexey},
title = {On Offline Evaluation of Vision-based Driving Models},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
month = {September},
year = {2018}