2021
DOI: 10.1016/j.inffus.2020.11.002
|View full text |Cite
|
Sign up to set email alerts
|

Point-cloud based 3D object detection and classification methods for self-driving applications: A survey and taxonomy

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
97
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 182 publications
(97 citation statements)
references
References 113 publications
0
97
0
Order By: Relevance
“…Semantic segmentation was inspired by the success of Deep Learning methods in producing an accurate result [ 10 , 13 , 14 ], but these techniques require an extremely large amount of data to train the network. Such large datasets may be difficult to obtain, or not provide adequate information, such as the case of man-made structures captured by sensors that only provide colourless point clouds.…”
Section: Introductionmentioning
confidence: 99%
“…Semantic segmentation was inspired by the success of Deep Learning methods in producing an accurate result [ 10 , 13 , 14 ], but these techniques require an extremely large amount of data to train the network. Such large datasets may be difficult to obtain, or not provide adequate information, such as the case of man-made structures captured by sensors that only provide colourless point clouds.…”
Section: Introductionmentioning
confidence: 99%
“…Regarding the object detection in point cloud data, models addressed in the literature have increasingly improved their detection capabilities. Generally, models in the literature have been positioned in two categories: 3D and 2D CNN-based approaches, where different data representation, backbone networks, and multi-scale feature learning techniques might be adopted [ 4 ].…”
Section: Related Workmentioning
confidence: 99%
“…They have a deeper pipeline and process a larger amount of data. For instance, typically a point cloud comprises between 100 k–120 k points [ 4 ], where each point contains data related to euclidean distance and signal reflection, i.e., 128 bits for translating each point information. Although solutions in the literature differ in the design of its pipeline, they share the same type of backbone, i.e., a stage responsible for extracting features from input data, which are based on CNN, and therefore due to the amount of data and arithmetic operations is the most computational demanding stage of the pipeline.…”
Section: Introductionmentioning
confidence: 99%
“…Firstly, it reduces the total processing required, facilitating the real-time analysis necessary for e.g, self-driving vehicles and machinery (cf. [109] for a detailed review). Secondly, it prevents the smoothing and distortion that inevitably occurs when data is projected onto a planar image grid, so retains the original data quality and resolution.…”
Section: From 2-d Raster To 3-d Point Cloudmentioning
confidence: 99%