Tree-NeRV: Efficient Non-Uniform Sampling for Neural Video Representation via Tree-Structured Feature Grids

Zhao, Jiancheng; Zhan, Yifan; Zhu, Qingtian; Ma, Mingze; Niu, Muyao; Wan, Zunian; Ji, Xiang; Zheng, Yinqiang

Jiancheng Zhao, Yifan Zhan, Qingtian Zhu, Mingze Ma, Muyao Niu, Zunian Wan, Xiang Ji, Yinqiang Zheng; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 15076-15085

Abstract

Implicit Neural Representations for Videos (NeRV) have emerged as a powerful paradigm for video representation, enabling direct mappings from frame indices to video frames. However, existing NeRV-based methods do not fully exploit temporal redundancy, as they rely on uniform sampling along the temporal axis, leading to suboptimal Rate-Distortion (RD) performance.To address this limitation, we propose Tree-NeRV, a novel tree-structured feature representation for efficient and adaptive video encoding. Unlike conventional approaches, Tree-NeRV organizes feature representations within a Binary Search Tree (BST), enabling non-uniform sampling along the temporal axis. Additionally, we introduce an optimization-driven sampling strategy, dynamically allocating higher sampling density to regions with greater temporal variation. Extensive experiments demonstrate that Tree-NeRV achieves superior compression efficiency and reconstruction quality, outperforming prior uniform sampling-based methods. Our code is publicly available at https://github.com/zhaojiancheng007/Tree-NeRV.git.

Related Material

[pdf] [supp]

[bibtex]

@InProceedings{Zhao_2025_ICCV, author = {Zhao, Jiancheng and Zhan, Yifan and Zhu, Qingtian and Ma, Mingze and Niu, Muyao and Wan, Zunian and Ji, Xiang and Zheng, Yinqiang}, title = {Tree-NeRV: Efficient Non-Uniform Sampling for Neural Video Representation via Tree-Structured Feature Grids}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {15076-15085} }