-
[pdf]
[supp]
[bibtex]@InProceedings{Dong_2025_ICCV, author = {Dong, Haoye and Lee, Gim Hee}, title = {PS-Mamba: Spatial-Temporal Graph Mamba for Pose Sequence Refinement}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {8568-8578} }
PS-Mamba: Spatial-Temporal Graph Mamba for Pose Sequence Refinement
Abstract
Human pose sequence refinement plays a crucial role in improving the temporal coherence of pose estimation across the sequence of frames. Despite its importance in real-world applications, human pose sequence refinement has received less attention than human pose estimation. In this paper, we propose PS-Mamba, a novel framework that refines human pose sequences by integrating spatial-temporal graph learning with state space modeling. Specifically, we introduce the Spatial-Temporal Graph State Space (ST-GSS) block, which captures spatial and temporal dependencies across joints to smooth pose sequences while preserving structural integrity. The spatial-temporal graph learns intricate joint interactions, while the state space component effectively manages temporal dynamics, reducing both short- and long-term pose instability. Besides, we incorporate a dynamic graph weight matrix to adaptively model the relative influence of joint interactions, further mitigating pose ambiguity. Experiments on challenging benchmarks show that our PS-Mamba outperforms SOTAs, achieving -14.21 mm MPJPE (+18.5%\uparrow), -13.59 mm PA-MPJPE (+22.1%\uparrow), and -0.42 mm/s2 ACCEL (+9.7%\uparrow) compared to SynSP on AIST++, significantly reducing jitters and enhancing pose stability.
Related Material
