Breaking Spurious Correlations: Uncertainty-Driven Causal Transformers for AU Detection

Yuru Wang, Yue Zhou; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026, pp. 7165-7174

Abstract


Facial Action Unit (AU) detection suffers from limited annotated data, severe class imbalance, label noise, and confounding biases, which often lead to overfitting and degraded performance. We propose an Uncertainty-Driven Causal Transformer (UDCT) framework for robust AU detection by jointly modeling uncertainty and causal intervention. Specifically, we parameterize Transformer attention weights as Gaussian distributions to capture robust AU dependencies while explicitly modeling uncertainty in attention. We further introduce an uncertainty-aware loss reweighting strategy to alleviate the effects of class imbalance and label noise during training. In addition, we incorporate a causal intervention module to suppress confounder-dependent AU associations and encourage the model to focus on more stable and less biased AU relationships. Experiments on BP4D and DISFA demonstrate that UDCT achieves competitive performance with stronger robustness under noisy, imbalanced, and distribution-shifted settings.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Wang_2026_CVPR, author = {Wang, Yuru and Zhou, Yue}, title = {Breaking Spurious Correlations: Uncertainty-Driven Causal Transformers for AU Detection}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2026}, pages = {7165-7174} }