Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images

@article{Tian2022FullyCO,
  title={Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images},
  author={Zhi Tian and Xiangxiang Chu and Xiaoming Wang and Xiaolin Wei and Chunhua Shen},
  journal={ArXiv},
  year={2022},
  volume={abs/2205.13764}
}
We present a simple yet effective fully convolutional one-stage 3D object detector for LiDAR point clouds of autonomous driving scenes, termed FCOS-LiDAR. Unlike the dominant methods that use the bird-eye view (BEV), our proposed detector detects objects from the range view (RV, a.k.a. range image) of the LiDAR points. Due to the range view’s compactness and compatibility with the LiDAR sensors’ sampling process on self-driving cars, the range view-based object detector can be realized by… 
1 Citations

Figures and Tables from this paper

A Robust and Lightweight Detector for Ship Target with Complex Background in SAR Image

Compared with the current mainstream algorithms, 3S-YOLO has achieved good results in all aspects to meet the real-time ship target detection in SAR images and its reasoning time can be reduced to less than 3 ms.

References

SHOWING 1-10 OF 36 REFERENCES

RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection

  • Pei SunWeiyue Wang Drago Anguelov
  • Computer Science, Environmental Science
    2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
  • 2021
Range Sparse Net (RSN) is a simple, efficient, and accurate 3D object detector that runs at more than 60 frames per second on a 150m × 150m detection region on Waymo Open Dataset (WOD) while being more accurate than previously published detectors.

RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection

This paper proposes an anchor-free single-stage LiDAR-based 3D object detector – RangeDet, and designs three components to address two issues overlooked by previous works: the scale variation between nearby and far away objects and the inconsistency between the 2D range image coordinates used in feature extraction and the 3D Cartesian coordinate used in output.

LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving

benchmark results show that this approach has significantly lower runtime than other recent detectors and that it achieves state-of-the-art performance when compared on a large dataset that has enough data to overcome the challenges of training on the range view.

Embracing Single Stride 3D Object Detector with Sparse Transformer

This paper proposes Single-stride Sparse Transformer (SST) to maintain the original resolution from the beginning to the end of the network, and addresses the problem of insufficient receptive field in single-strides architectures.

PointAugmenting: Cross-Modal Augmentation for 3D Object Detection

PointAugmenting decorates point clouds with corresponding point-wise CNN features extracted by pretrained 2D detection models, and then performs 3D object detection over the decorated point clouds and achieves the new state-of-the-art results on the nuScenes leaderboard to date.

Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection

A range-conditioned dilated block (RCD) is proposed to dynamically adjust a continuous dilation rate as a function of the measured range, achieving scale invariance and soft range gating helps mitigate the effect of occlusion.

Vehicle Detection from 3D Lidar Using Fully Convolutional Network

This paper proposes to present the data in a 2D point map and use a single 2D end-to-end fully convolutional network to predict the objectness confidence and the bounding boxes simultaneously, and shows the state-of-the-art performance of the proposed method.

SECOND: Sparsely Embedded Convolutional Detection

An improved sparse convolution method for Voxel-based 3D convolutional networks is investigated, which significantly increases the speed of both training and inference and introduces a new form of angle loss regression to improve the orientation estimation performance.

PointPillars: Fast Encoders for Object Detection From Point Clouds

benchmarks suggest that PointPillars is an appropriate encoding for object detection in point clouds, and proposes a lean downstream network.

nuScenes: A Multimodal Dataset for Autonomous Driving

Robust detection and tracking of objects is crucial for the deployment of autonomous vehicle technology. Image based benchmark datasets have driven development in computer vision tasks such as object