SoyStageNet: Balancing Accuracy and Efficiency for Real-Time Soybean Growth Stage Detection

Abdellah Lakhssassi, Toqi Tahamid Sarker, Khaled Ahmed, Naoufal Lakhssassi, Khalid Meksem; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2025, pp. 5533-5542

Abstract


We present SoyStageNet, a novel transformer-based architecture for accurate detection of soybean growth stages. Precise growth stage identification is critical for optimizing agricultural interventions, yet existing solutions rely on conventional CNN architectures with limited accuracy and efficiency. Our approach integrates a lightweight Mix Vision Transformer backbone with a Neural Architecture Search-based Feature Pyramid Network and task-aligned detection head to effectively capture the subtle morphological differences between growth stages. Comprehensive experiments on our specially collected and annotated soybean dataset demonstrate that SoyStageNet achieves 83.2% AP with only 17.3M parameters and real-time inference speed of 28.5 frames per second, an 87% reduction in model size compared to DINO while maintaining competitive accuracy. This efficiency makes SoyStageNet particularly suitable for resource-constrained agricultural applications requiring real-time growth stage monitoring.

Related Material


[pdf]
[bibtex]
@InProceedings{Lakhssassi_2025_CVPR, author = {Lakhssassi, Abdellah and Sarker, Toqi Tahamid and Ahmed, Khaled and Lakhssassi, Naoufal and Meksem, Khalid}, title = {SoyStageNet: Balancing Accuracy and Efficiency for Real-Time Soybean Growth Stage Detection}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2025}, pages = {5533-5542} }