LOFI: LOng-tailed FIne-Grained Network for Food Recognition

Jesús M. Rodríguez-De-Vera, Imanol G. Estepa, Marc Bolaños, Bhalaji Nagarajan, Petia Radeva; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 3750-3760

Abstract


Food recognition plays a crucial role in several healthcare applications. Nevertheless it presents significant computer vision challenges such as long-tailed and fine-grained distributions that hinder its progress. In this work we propose LOFI a Long-tailed Fine-grained Network aimed specifically at tackling these food recognition challenges by improving the feature learning capabilities of food recognition models. Specifically we improve vanilla R-CNN architecture by tailoring it for food recognition. We design an efficient multi-task framework for fine-grained food recognition which exploits the lexical similarity of dishes during training to improve the discriminative ability of the network. Secondly we include a Graph Confidence Propagation module based on graph neural networks to aggregate the information of overlapping detections and refine the final prediction of the network. Extensive analysis and ablations of different components of LOFI highlight that it successfully addresses the targeted problems and leads to noticeable gains in performance. Remarkably the proposed method achieves competitive results and outperforms the current state-of-the-art methods in three public food benchmarks: UECFood-256 AiCrowd Food Challenge 2022 and UECFood-100 segmented.

Related Material


[pdf]
[bibtex]
@InProceedings{Rodriguez-De-Vera_2024_CVPR, author = {Rodr{\'\i}guez-De-Vera, Jes\'us M. and Estepa, Imanol G. and Bola\~nos, Marc and Nagarajan, Bhalaji and Radeva, Petia}, title = {LOFI: LOng-tailed FIne-Grained Network for Food Recognition}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2024}, pages = {3750-3760} }