PV-RCNN++
Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
مقال من تأليف: Li, Hongsheng ; Wang, Xiaogang ; Deng, Jiajun ; Wang, Zhe ; Jiang, Li ; Guo, Chaoxu ; Shi, Jianping ; Shi, Shaoshuai ;
ملخص: 3D object detection is receiving increasing attention from both industry and academia thanks to its wide applications in various fields. In this paper, we propose Point-Voxel Region-based Convolution Neural Networks (PV-RCNNs) for 3D object detection on point clouds. First, we propose a novel 3D detector, PV-RCNN, which boosts the 3D detection performance by deeply integrating the feature learning of both point-based set abstraction and voxel-based sparse convolution through two novel steps, i.e. , the voxel-to-keypoint scene encoding and the keypoint-to-grid RoI feature abstraction. Second, we propose an advanced framework, PV-RCNN++, for more efficient and accurate 3D object detection. It consists of two major improvements: sectorized proposal-centric sampling for efficiently producing more representative keypoints, and VectorPool aggregation for better aggregating local point features with much less resource consumption. With these two strategies, our PV-RCNN++ is about
لغة:
إنجليزية
الموضوع
الإعلام الآلي
الكلمات الدالة:
Lidar
autonomous driving
3D object detection
Point clouds
Sparse convolution