Multiple Pairwise Ranking Networks for Personalized Video Summarization

Yassir Saquil, Da Chen, Yuan He, Chuan Li, Yong-Liang Yang; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 1718-1727

Abstract


In this paper, we investigate video summarization in the supervised setting. Since video summarization is subjective to the preference of the end-user, the design of a unique model is limited. In this work, we propose a model that provides personalized video summaries by conditioning the summarization process with predefined categorical user labels referred to as preferences. The underlying method is based on multiple pairwise rankers (called Multi-ranker), where the rankers are trained jointly to provide local summaries as well as a global summarization of a given video. In order to demonstrate the relevance and applications of our method in contrast with a classical global summarizer, we conduct experiments on multiple benchmark datasets, notably through a user study and comparisons with the state-of-art methods in the global video summarization task.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Saquil_2021_ICCV, author = {Saquil, Yassir and Chen, Da and He, Yuan and Li, Chuan and Yang, Yong-Liang}, title = {Multiple Pairwise Ranking Networks for Personalized Video Summarization}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {1718-1727} }