BAPose: Bottom-Up Pose Estimation With Disentangled Waterfall Representations

Bruno Artacho, Andreas Savakis; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, 2023, pp. 528-537

Abstract


We propose BAPose, a novel bottom-up approach that achieves state-of-the-art results for multi-person pose estimation. Our end-to-end trainable framework leverages a disentangled multi-scale waterfall architecture and incorporates adaptive convolutions to infer keypoints more precisely in crowded scenes with occlusions. The multi-scale representations, obtained by the disentangled waterfall module in BAPose, leverage the efficiency of progressive filtering in the cascade architecture, while maintaining multi-scale fields-of-view comparable to spatial pyramid configurations. Our results on the challenging COCO and CrowdPose datasets demonstrate that BAPose is an efficient and robust framework for multi-person pose estimation, significantly improving state-of-the-art accuracy.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Artacho_2023_WACV, author = {Artacho, Bruno and Savakis, Andreas}, title = {BAPose: Bottom-Up Pose Estimation With Disentangled Waterfall Representations}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops}, month = {January}, year = {2023}, pages = {528-537} }