PVGNet: A Bottom-Up One-Stage 3D Object Detector With Integrated Multi-Level Features

Miao, Zhenwei; Chen, Jikai; Pan, Hongyu; Zhang, Ruiwen; Liu, Kaixuan; Hao, Peihan; Zhu, Jun; Wang, Yang; Zhan, Xin

Zhenwei Miao, Jikai Chen, Hongyu Pan, Ruiwen Zhang, Kaixuan Liu, Peihan Hao, Jun Zhu, Yang Wang, Xin Zhan; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 3279-3288

Abstract

Quantization-based methods are widely used in LiDAR points 3D object detection for its efficiency in extracting context information. Unlike image where the context information is distributed evenly over the object, most LiDAR points are distributed along the object boundary, which means the boundary features are more critical in LiDAR points 3D detection. However, quantization inevitably introduces ambiguity during both the training and inference stages. To alleviate this problem, we propose a one-stage and voting-based 3D detector, named Point-Voxel-Grid Network (PVGNet). In particular, PVGNet extracts point, voxel and grid-level features in a unified backbone architecture and produces point-wise fusion features. It segments LiDAR points into foreground and background, predicts a 3D bounding box for each foreground point, and performs group voting to get the final detection results. Moreover, we observe that instance-level point imbalance due to occlusion and observation distance also degrades the detection performance. A novel instance-aware focal loss is proposed to alleviate this problem and further improve the detection ability. We conduct experiments on the KITTI and Waymo datasets. Our proposed PVGNet outperforms previous state-of-the-art methods and ranks at the top of KITTI 3D/BEV detection leaderboards.

Related Material

[pdf]

[bibtex]

@InProceedings{Miao_2021_CVPR, author = {Miao, Zhenwei and Chen, Jikai and Pan, Hongyu and Zhang, Ruiwen and Liu, Kaixuan and Hao, Peihan and Zhu, Jun and Wang, Yang and Zhan, Xin}, title = {PVGNet: A Bottom-Up One-Stage 3D Object Detector With Integrated Multi-Level Features}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2021}, pages = {3279-3288} }