BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction

German Barquero, Sergio Escalera, Cristina Palmero; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 2317-2327

Abstract


Stochastic human motion prediction (HMP) has generally been tackled with generative adversarial networks and variational autoencoders. Most prior works aim at predicting highly diverse motion in terms of the skeleton joints' dispersion. This has led to methods predicting fast and divergent movements, which are often unrealistic and incoherent with past motion. Such methods also neglect scenarios where anticipating diverse short-range behaviors with subtle joint displacements is important. To address these issues, we present BeLFusion, a model that, for the first time, leverages latent diffusion models in HMP to sample from a behavioral latent space where behavior is disentangled from pose and motion. Thanks to our behavior coupler, which is able to transfer sampled behavior to ongoing motion, BeLFusion's predictions display a variety of behaviors that are significantly more realistic, and coherent with past motion than the state of the art. To support it, we introduce two metrics, the Area of the Cumulative Motion Distribution, and the Average Pairwise Distance Error, which are correlated to realism according to a qualitative study (126 participants). Finally, we prove BeLFusion's generalization power in a new cross-dataset scenario for stochastic HMP.

Related Material


[pdf] [supp] [arXiv]
[bibtex]
@InProceedings{Barquero_2023_ICCV, author = {Barquero, German and Escalera, Sergio and Palmero, Cristina}, title = {BeLFusion: Latent Diffusion for Behavior-Driven Human Motion Prediction}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2023}, pages = {2317-2327} }