A Stroke of Genius: Predicting the Next Move in Badminton

Magnus Ibh, Stella Graßhof, Dan Witzner Hansen; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 3376-3385

Abstract


This paper presents a transformer encoder-decoder model for predicting future badminton strokes based on previous rally actions. The model uses court position skeleton poses and player-specific embeddings to learn stroke and player-specific latent representations in a spatiotemporal encoder module. The representations are then used to condition the subsequent strokes in a decoder module through rally-aware fusion blocks which provide additional relevant strategic and technical considerations to make more informed predictions. RallyTemPose shows improved forecasting accuracy compared to traditional sequential methods on two real-world badminton datasets. The performance boost can also be attributed to the inclusion of improved stroke embeddings extracted from the latent representation of a pre-trained large-language model subjected to detailed text descriptions of stroke descriptions. In the discussion the latent representations learned by the encoder module show useful properties regarding player analysis and comparisons.

Related Material


[pdf]
[bibtex]
@InProceedings{Ibh_2024_CVPR, author = {Ibh, Magnus and Gra{\ss}hof, Stella and Hansen, Dan Witzner}, title = {A Stroke of Genius: Predicting the Next Move in Badminton}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops}, month = {June}, year = {2024}, pages = {3376-3385} }