Panoptic Segmentation of Satellite Image Time Series With Convolutional Temporal Attention Networks

Vivien Sainte Fare Garnot, Loic Landrieu; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 4872-4881

Abstract


Unprecedented access to multi-temporal satellite imagery has opened new perspectives for a variety of Earth observation tasks. Among them, pixel-precise panoptic segmentation of agricultural parcels has major economic and environmental implications. While researchers have explored this problem for single images, we argue that the complex temporal patterns of crop phenology are better addressed with temporal sequences of images. In this paper, we present the first end-to-end, single-stage method for panoptic segmentation of Satellite Image Time Series (SITS). This module can be combined with our novel image sequence encoding network which relies on temporal self-attention to extract rich and adaptive multi-scale spatio-temporal features. We also introduce PASTIS, the first open-access SITS dataset with panoptic annotations. We demonstrate the superiority of our encoder for semantic segmentation against multiple competing network architectures, and set up the first state-of-the-art of panoptic segmentation of SITS. Our implementation and the PASTIS dataset are publicly available at (link-upon-publication).

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Garnot_2021_ICCV, author = {Garnot, Vivien Sainte Fare and Landrieu, Loic}, title = {Panoptic Segmentation of Satellite Image Time Series With Convolutional Temporal Attention Networks}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {4872-4881} }