-
[pdf]
[bibtex]@InProceedings{Hoang_2024_ACCV, author = {Hoang, Tuan and Tran, Hung and Rana, Santu and Gupta, Sunil and Venkatesh, Svetha}, title = {Revisiting sample weights based method for noisy-label detection and classification}, booktitle = {Proceedings of the Asian Conference on Computer Vision (ACCV)}, month = {December}, year = {2024}, pages = {4189-4204} }
Revisiting sample weights based method for noisy-label detection and classification
Abstract
The remarkable success of Convolutional Neural Networks (CNNs) in image classification can be attributed large clean training datasets. However, real-world data is often far from noise-free, impacting the performance of resulting deep neural network (DNN) models. Existing literature focuses on noisy label detection, often drawing a clear line between noisy and clean label samples. Nevertheless, each sample contributes differently to the final model performance; some noisy-label samples may still be valuable to a certain level, while certain clean-label samples might not significantly enhance the model. In this work, assuming that a small clean-label dataset may be available, we aim to learn a sample weight for each training sample. This weight is gradually updated as the model is training to indicate the usefulness of a particular sample in minimizing loss with respect to the clean-label dataset. Consequently, our method prioritizes high-quality data samples, minimizing the impact of harmful or unhelpful ones by assigning close-to-zero weights in a weighted loss function. We empirically demonstrate that our method is not dependent on noise type and can work well for both real-world and synthetic noise. Our method can achieve state-of-the-art performance in terms of the classification accuracy on clean test sets.
Related Material