FungiTastic: A Multi-Modal Dataset and Benchmark for Image Categorization

Lukas Picek, Klara Janouskova, Vojtech Cermak, Jiri Matas; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2025, pp. 2046-2056

Abstract


We introduce a new, challenging benchmark and a dataset, FungiTastic, based on fungal records continuously collected over a twenty-year span. The dataset is labelled and curated by experts and consists of about 350k multimodal observations of 5k fine-grained categories (species). The fungi observations include photographs and additional data, e.g., meteorological and climatic data, satellite images, and body part segmentation masks. FungiTastic is one of the few benchmarks that include a test set with DNA-sequenced ground truth of unprecedented label reliability. The benchmark is designed to support (i) standard closed-set classification, (ii) open-set classification, (iii) multi-modal classification, (iv) few-shot learning, (v) domain shift, and many more. We provide tailored baselines for many use cases, a multitude of ready-to-use pre-trained models on HuggingFace, and a framework for model training. The documentation and the baselines are available at GitHub and Kaggle.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Picek_2025_CVPR, author = {Picek, Lukas and Janouskova, Klara and Cermak, Vojtech and Matas, Jiri}, title = {FungiTastic: A Multi-Modal Dataset and Benchmark for Image Categorization}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2025}, pages = {2046-2056} }