Efficient Video Compression via Content-Adaptive Super-Resolution

Mehrdad Khani, Vibhaalakshmi Sivaraman, Mohammad Alizadeh; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 4521-4530

Abstract


Video compression is a critical component of Internet video delivery. Recent work has shown that deep learning techniques can rival or outperform human-designed algorithms, but these methods are significantly less compute and power-efficient than existing codecs. This paper presents a new approach that augments existing codecs with a small, content-adaptive super-resolution model that significantly boosts video quality. Our method, SRVC, encodes video into two bitstreams: (i) a content stream, produced by compressing downsampled low-resolution video with the existing codec, (ii) a model stream, which encodes periodic updates to a lightweight super-resolution neural network customized for short segments of the video. SRVC decodes the video by passing the decompressed low-resolution video frames through the (time-varying) super-resolution model to reconstruct high-resolution video frames. Our results show that to achieve the same PSNR, SRVC requires 20% of the bits-per-pixel of H.265 in slow mode, and 3% of the bits-per-pixel of DVC, a recent deep learning-based video compression scheme. SRVC runs at 90 frames per second on an NVIDIA V100 GPU.

Related Material


[pdf] [arXiv]
[bibtex]
@InProceedings{Khani_2021_ICCV, author = {Khani, Mehrdad and Sivaraman, Vibhaalakshmi and Alizadeh, Mohammad}, title = {Efficient Video Compression via Content-Adaptive Super-Resolution}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2021}, pages = {4521-4530} }