PS-Mamba: Spatial-Temporal Graph Mamba for Pose Sequence Refinement

Dong, Haoye; Lee, Gim Hee

Haoye Dong, Gim Hee Lee; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 8568-8578

Abstract

Human pose sequence refinement plays a crucial role in improving the temporal coherence of pose estimation across the sequence of frames. Despite its importance in real-world applications, human pose sequence refinement has received less attention than human pose estimation. In this paper, we propose PS-Mamba, a novel framework that refines human pose sequences by integrating spatial-temporal graph learning with state space modeling. Specifically, we introduce the Spatial-Temporal Graph State Space (ST-GSS) block, which captures spatial and temporal dependencies across joints to smooth pose sequences while preserving structural integrity. The spatial-temporal graph learns intricate joint interactions, while the state space component effectively manages temporal dynamics, reducing both short- and long-term pose instability. Besides, we incorporate a dynamic graph weight matrix to adaptively model the relative influence of joint interactions, further mitigating pose ambiguity. Experiments on challenging benchmarks show that our PS-Mamba outperforms SOTAs, achieving -14.21 mm MPJPE (+18.5%\uparrow), -13.59 mm PA-MPJPE (+22.1%\uparrow), and -0.42 mm/s2 ACCEL (+9.7%\uparrow) compared to SynSP on AIST++, significantly reducing jitters and enhancing pose stability.

Related Material

[pdf] [supp]

[bibtex]

@InProceedings{Dong_2025_ICCV, author = {Dong, Haoye and Lee, Gim Hee}, title = {PS-Mamba: Spatial-Temporal Graph Mamba for Pose Sequence Refinement}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {8568-8578} }