Classification of Facial Expression In-the-Wild Based on Ensemble of Multi-Head Cross Attention Networks

Jae-Yeop Jeong, Yeong-Gi Hong, Daun Kim, Jin-Woo Jeong, Yuchul Jung, Sang-Ho Kim; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022, pp. 2353-2358

Abstract


How to build a system for robust classification and recognition of facial expressions has been one of the most important research issues for successful interactive computing applications. However, previous datasets and studies mainly focused on facial expression recognition in a controlled/lab setting, therefore, could hardly be generalized in a more practical and real-life environment. The Affective Behavior Analysis in-the-wild (ABAW) 2022 competition released a dataset consisting of various video clips of facial expressions in-the-wild. In this paper, we propose a method based on the ensemble of multi-head cross attention networks to address the facial expression classification task introduced in the ABAW 2022 competition. We built a uni-task approach for this task, achieving the average F1-score of 34.60 on the validation set and 33.77 on the test set, ranking second place on the final leaderboard.

Related Material


[pdf]
[bibtex]
@InProceedings{Jeong_2022_CVPR, author = {Jeong, Jae-Yeop and Hong, Yeong-Gi and Kim, Daun and Jeong, Jin-Woo and Jung, Yuchul and Kim, Sang-Ho}, title = {Classification of Facial Expression In-the-Wild Based on Ensemble of Multi-Head Cross Attention Networks}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2022}, pages = {2353-2358} }