Taskology: Utilizing Task Relations at Scale

Lu, Yao; Pirk, Soren; Dlabal, Jan; Brohan, Anthony; Pasad, Ankita; Chen, Zhao; Casser, Vincent; Angelova, Anelia; Gordon, Ariel

Yao Lu, Soren Pirk, Jan Dlabal, Anthony Brohan, Ankita Pasad, Zhao Chen, Vincent Casser, Anelia Angelova, Ariel Gordon; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 8700-8709

Abstract

Many computer vision tasks address the problem of scene understanding and are naturally interrelated e.g. object classification, detection, scene segmentation, depth estimation, etc. We show that we can leverage the inherent relationships among collections of tasks, as they are trained jointly, supervising each other through their known relationships via consistency losses. Furthermore, explicitly utilizing the relationships between tasks allows improving their performance while dramatically reducing the need for labeled data, and allows training with additional unsupervised or simulated data. We demonstrate a distributed joint training algorithm with task-level parallelism, which affords a high degree of asynchronicity and robustness. This allows learning across multiple tasks, or with large amounts of input data, at scale. We demonstrate our framework on subsets of the following collection of tasks: depth and normal prediction, semantic segmentation, 3D motion and ego-motion estimation, and object tracking and 3D detection in point clouds. We observe improved performance across these tasks, especially in the low-label regime.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Lu_2021_CVPR, author = {Lu, Yao and Pirk, Soren and Dlabal, Jan and Brohan, Anthony and Pasad, Ankita and Chen, Zhao and Casser, Vincent and Angelova, Anelia and Gordon, Ariel}, title = {Taskology: Utilizing Task Relations at Scale}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2021}, pages = {8700-8709} }