Label Shift Estimation for Class-Imbalance Problem: A Bayesian Approach

Changkun Ye, Russell Tsuchida, Lars Petersson, Nick Barnes; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 1073-1082

Abstract


As a type of distribution shift, label shift occurs when the source and target domains have different label distributions P(Y) but identical conditional distributions of data given labels P(X | Y). Under a Bayesian framework, we propose a novel Maximum A Posteriori (MAP) model and a novel posterior sampling model for the label shift problem. We prove the MAP objective admits a unique optimum and derive an EM algorithm that converges to the global optimum. We propose a novel Adaptive Prior Learning (APL) model to adaptively select prior parameters given data. We use the Markov Chain Monte Carlo (MCMC) method in our posterior sampling model to estimate and correct for label shift. Our methods can effectively resolve class imbalance problems on large-scale datasets without fine-tuning the classifier. Experiments show that our model outperforms existing methods on a variety of label shift settings. Our code is available at https://github.com/ChangkunYe/MAPLS/

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Ye_2024_WACV, author = {Ye, Changkun and Tsuchida, Russell and Petersson, Lars and Barnes, Nick}, title = {Label Shift Estimation for Class-Imbalance Problem: A Bayesian Approach}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2024}, pages = {1073-1082} }