Enhancing Road Object Detection in Fisheye Cameras: An Effective Framework Integrating SAHI and Hybrid Inference

Bao Tran Gia, Tuong Bui Cong Khanh, Hien Ho Trong, Thuyen Tran Doan, Tien Do, Duy-Dinh Le, Thanh Duc Ngo; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 7227-7235

Abstract


Fisheye cameras are extensively employed in surveillance systems because they provide a broad viewing angle enhancing visibility. The reception of an image from a wide perspective can result in distortion posing challenges for recognition systems mainly when dealing with moving objects as observed in traffic systems. This work presents an effective framework comprising multiple modules to address the issue of small objects and rapidly changing viewing perspectives in fisheye camera data. First we use Slicing Aided Hyper Inference (SAHI) an algorithm that uses generic slicing-aided inference to deal with small objects. Second we integrate the outcomes of CNN (YOLO) and state-of-the-art Transformer (Co-DERT) detection methods to utilize the respective strengths of each strategy for handling data limitations. This approach has demonstrated promising performance achieving an F1 score of 0.6077 and achieving the 4^ th in Track 4 of the AI City Challenge 2024.

Related Material


[pdf]
[bibtex]
@InProceedings{Gia_2024_CVPR, author = {Gia, Bao Tran and Khanh, Tuong Bui Cong and Trong, Hien Ho and Doan, Thuyen Tran and Do, Tien and Le, Duy-Dinh and Ngo, Thanh Duc}, title = {Enhancing Road Object Detection in Fisheye Cameras: An Effective Framework Integrating SAHI and Hybrid Inference}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2024}, pages = {7227-7235} }