SPBA-Net point cloud object detection with sparse attention and box aligning

Haojie Sha; Qingrui Gao; Hao Zeng; Kai Li; Wang Li; Xuande Zhang; Xiaohui Wang

doi:10.1038/s41598-024-77097-z

SPBA-Net point cloud object detection with sparse attention and box aligning

Sci Rep. 2024 Nov 18;14(1):28420. doi: 10.1038/s41598-024-77097-z.

Authors

Haojie Sha¹, Qingrui Gao^#¹, Hao Zeng^#², Kai Li¹, Wang Li¹, Xuande Zhang¹, Xiaohui Wang³

Affiliations

¹ College of Applied Technology, Qingdao University, Qingdao, 266100, China.
² Institute of Software Chinese Academy of Sciences, Beijing, 100000, China.
³ College of Applied Technology, Qingdao University, Qingdao, 266100, China. Wangxhqdu@163.com.

^# Contributed equally.

Abstract

Object detection in point clouds is essential for various applications, including autonomous navigation, household robots, and augmented/virtual reality. However, during voxelization and Bird's Eye View transformation, local point cloud data often remains sparse due to non-target areas and noise points, posing a significant challenge for feature extraction. In this paper, we propose a novel mechanism named Keypoint Guided Sparse Attention (KGSA) to enhance the semantic information of point clouds by calculating Euclidean distances between selected keypoints and others. Additionally, we introduce Instance-wise Box Aligning, a method for expanding predicted boxes and clustering the points within them to achieve precise alignment between predicted bounding boxes and ground-truth targets. Experimental results demonstrate the superiority of our proposed SPBA-Net in 3D object detection on point clouds compared to other state-of-the-art methods.The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

Keywords: 3D object detection; Instance-wise box aligning; Keypoint guided sparse attention.