Enhancing Retail Checkout Through Video Inpainting, YOLOv8 Detection, and DeepSort Tracking

Arpita Vats, David C. Anastasiu; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2023, pp. 5530-5537

Abstract


The retail industry has witnessed a remarkable upswing in the utilization of cutting-edge artificial intelligence and computer vision techniques. Among the prominent challenges in this domain is the development of an automated checkout system that can address the multifaceted issues that arise in real-world checkout scenarios, including object occlusion, motion blur, and similarity in scanned items. In this paper, we propose a sophisticated deep learning-based framework that can effectively recognize, localize, track, and count products as they traverse in front of a camera. Our approach, which we call RetailCounter, is founded on a detect-then-track paradigm, wherein we apply tracking on the bounding box of the detected objects. Furthermore, we have incorporated an automatic identification of the detection region of interest (ROI) and efficient removal of unwanted objects from the ROI. The performance of our proposed framework is competitive, as evidenced by our F1 score of 0.8177 and the fourth-place ranking that we achieved in track 4 of the 2023 AI City Challenge.

Related Material


[pdf]
[bibtex]
@InProceedings{Vats_2023_CVPR, author = {Vats, Arpita and Anastasiu, David C.}, title = {Enhancing Retail Checkout Through Video Inpainting, YOLOv8 Detection, and DeepSort Tracking}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2023}, pages = {5530-5537} }