-
[pdf]
[bibtex]@InProceedings{Yi_2025_ICCV, author = {Yi, Jingjun and Bi, Qi and Zheng, Hao and Zhan, Haolan and Ji, Wei and Huang, Huimin and Li, Yuexiang and Wu, Xian and Zheng, Yefeng}, title = {Learning Generalizable Diabetic Retinopathy Grading by Decoupled State Space Decoding}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops}, month = {October}, year = {2025}, pages = {4461-4471} }
Learning Generalizable Diabetic Retinopathy Grading by Decoupled State Space Decoding
Abstract
Diabetic retinopathy (DR) is a leading cause of vision loss. The rapid advancement of deep learning has significantly propelled the development of automated DR grading methods. However, retinal images are typically collected from diverse clinical centers using equipments from various vendors, leading to domain shifts. As a result, the pre-trained model for DR diagnosis has to handle the unseen retinal images when doing inference. In this paper, we propose Decoupled State Space Decoding (DSSD), a selective state space-based model designed to enhance the generalization capability of DR grading methods to unseen domains. Leveraging the selective scan mechanism to encode local patches relevant to lesions and pathological regions and the recurrent modeling to capture long-range dependencies, DSSD aims to mitigate cross-domain style variations through both channel-wise and sample-wise decoupling. Additionally, it introduces an implicit constraint between shallower and deeper state embeddings to ensure stable recurrent modeling for grade-level predictions. Experiments conducted under both leave-one-domain-out and extreme-single-domain-generalization settings demonstrate its state-of-the-art performance.
Related Material
