Learning to Drive from a World Model

Goff, Mitchell; Hogan, Greg; Hotz, George; du Parc Locmaria, Armand; Raczy, Kacper; Schäfer, Harald; Shihadeh, Adeeb; Zhang, Weixing; Yousfi, Yassine

Mitchell Goff, Greg Hogan, George Hotz, Armand du Parc Locmaria, Kacper Raczy, Harald Schäfer, Adeeb Shihadeh, Weixing Zhang, Yassine Yousfi; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2025, pp. 1964-1973

Abstract

Most self-driving systems rely on hand-coded perception outputs and engineered driving rules. Learning directly from human driving data with an end-to-end method can allow for a training architecture that is simpler and scales well with compute and data. In this work, we propose an end-to-end training architecture that uses real driving data to train a driving policy in an on-policy simulator. We show two different methods of simulation, one with reprojective simulation and one with a learned world model. We show that both methods can be used to train a policy that learns driving behavior without any hand-coded driving rules. We evaluate the performance of these policies in a closed-loop simulation and when deployed in a real-world advanced driver-assistance system.

Related Material

[pdf] [arXiv]

[bibtex]

@InProceedings{Goff_2025_CVPR, author = {Goff, Mitchell and Hogan, Greg and Hotz, George and du Parc Locmaria, Armand and Raczy, Kacper and Sch\"afer, Harald and Shihadeh, Adeeb and Zhang, Weixing and Yousfi, Yassine}, title = {Learning to Drive from a World Model}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2025}, pages = {1964-1973} }