2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.605
|View full text |Cite
|
Sign up to set email alerts
|

3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds

Abstract: Semantic parsing of large-scale 3D point clouds is an important research topic in computer vision and remote sensing fields. Most existing approaches utilize hand-crafted features for each modality independently and combine them in a heuristic manner. They often fail to consider the consistency and complementary information among features adequately, which makes them difficult to capture high-level semantic structures. The features learned by most of the current deep learning methods can obtain high-quality im… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
51
0
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 94 publications
(52 citation statements)
references
References 56 publications
0
51
0
1
Order By: Relevance
“…Specifically, Hu and Yuan [44] suggest that DL-based algorithms can outperform the current methods that are most commonly used for ground return classification. Others have investigated the classification of features in 3D space represented as point clouds [45][46][47].…”
Section: Deep Learningmentioning
confidence: 99%
“…Specifically, Hu and Yuan [44] suggest that DL-based algorithms can outperform the current methods that are most commonly used for ground return classification. Others have investigated the classification of features in 3D space represented as point clouds [45][46][47].…”
Section: Deep Learningmentioning
confidence: 99%
“…Deep RL [36], [38] uses deep artificial neural networks as estimators. Most popularly convoluted neural networks (CNN) or recurrent neural network (RNN), mainly Long short-term memory (LSTM) [39] are used in deep RL. In order to use Qvalue in deep RL, equation 1 is updated to include the network parameters as presented in equation 3, where Φ is the preprocessed equivalent to state s t and θ stands for the parameters in the neural network (weights).…”
Section: B Reinforcement Learningmentioning
confidence: 99%
“…3D semantic segmentation [48,2,29,34] and Shape Completion [45,7,4,15] are both active areas in computer vision. 3D segmentation gives semantic labels to observed voxels, while shape completion completes missing voxels.…”
Section: D Semantic Segmentation and Shape Completionmentioning
confidence: 99%