ScanDMM: A Deep Markov Model of Scanpath Prediction for 360deg Images

Xiangjie Sui, Yuming Fang, Hanwei Zhu, Shiqi Wang, Zhou Wang; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 6989-6999

Abstract


Scanpath prediction for 360deg images aims to produce dynamic gaze behaviors based on the human visual perception mechanism. Most existing scanpath prediction methods for 360deg images do not give a complete treatment of the time-dependency when predicting human scanpath, resulting in inferior performance and poor generalizability. In this paper, we present a scanpath prediction method for 360deg images by designing a novel Deep Markov Model (DMM) architecture, namely ScanDMM. We propose a semantics-guided transition function to learn the nonlinear dynamics of time-dependent attentional landscape. Moreover, a state initialization strategy is proposed by considering the starting point of viewing, enabling the model to learn the dynamics with the correct "launcher". We further demonstrate that our model achieves state-of-the-art performance on four 360deg image databases, and exhibit its generalizability by presenting two applications of applying scanpath prediction models to other visual tasks - saliency detection and image quality assessment, expecting to provide profound insights into these fields.

Related Material


[pdf]
[bibtex]
@InProceedings{Sui_2023_CVPR, author = {Sui, Xiangjie and Fang, Yuming and Zhu, Hanwei and Wang, Shiqi and Wang, Zhou}, title = {ScanDMM: A Deep Markov Model of Scanpath Prediction for 360deg Images}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2023}, pages = {6989-6999} }