H-CNN: Spatial Hashing Based CNN for 3D Shape Analysis

Shao, Tianjia; Yang, Yin; Weng, Yanlin; Hou, Qingwen; Zhou, Kun

doi:10.1109/tvcg.2018.2887262

Cited by 23 publications

(10 citation statements)

References 57 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Sparse-Voxel-based CNNs. The drawback of volumetric CNNs is overcome by sparse-voxel-based CNNs, which adopt spatially adaptive data structures such as octrees [Wang et al 2017] and hash tables [Choy et al 2019;Graham et al 2018;Shao et al 2018] to index non-empty voxels efficiently and constrain the convolution over these sparse voxels. A set of sparse-voxel-based CNNs have been proposed for shape reconstruction and generation, where an encoder-decoder network is learned to map input point cloud to octrees with occupancy values [Häne et al 2017;Tatarchenko et al 2017], adaptive planar patches [Wang et al 2018b], or moving-least-squares points .…”

Section: Related Workmentioning

confidence: 99%

“…Although these approaches can easily adapt deep neural networks developed for 2D images to 3D learning, their memory and computational costs grow cubically as the volumetric resolution increases, making them difficult to model 3D shape details. A set of methods [Choy et al 2019;Graham et al 2018;Shao et al 2018;Wang et al 2017] represent 3D shapes with sparse non-empty voxels and design neural networks that operate only on sparse voxels. Although these sparse-voxel-based methods significantly reduce computational and memory cost, the features in empty voxels are ignored or simply set to zero, and predicting the locations of sparse voxels for shape generation and reconstruction is a difficult task, especially for incomplete inputs.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Dual Octree Graph Networks for Learning Adaptive Volumetric Shape Representations

Wang,

Liu,

Tong

2022

Preprint

View full text Add to dashboard Cite

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Dual Octree Graph Networks for Learning Adaptive Volumetric Shape Representations

Wang,

Liu,

Tong

2022

Preprint

View full text Add to dashboard Cite

“…Methods like OctNet [22] and O-CNN [31] save computation time by using octrees to avoid processing empty spaces. [10] and [23] use Kd-tree and Hash structures instead. [26] uses sparse 3D convolutions rather than efficient data structures.…”

Section: Related Workmentioning

confidence: 99%

“…Some approaches project the 3D raw data into a regular structure (e.g. voxels) where 3D convolutions can be used [10,17,22,23,31,39]. Other approaches use multilayer perceptrons (MLP) to process point clouds directly [19,20,29].…”

Section: Introductionmentioning

confidence: 99%

Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation

Cuevas-Velasquez¹,

Gallego²,

Fisher³

2021

Preprint

View full text Add to dashboard Cite

We present an innovative two-headed attention layer that combines geometric and latent features to segment a 3D scene into semantically meaningful subsets. Each head combines local and global information, using either the geometric or latent features, of a neighborhood of points and uses this information to learn better local relationships. This Geometric-Latent attention layer (Ge-Latto) is combined with a sub-sampling strategy to capture global features. Our method is invariant to permutation thanks to the use of shared-MLP layers, and it can also be used with point clouds with varying densities because the local attention layer does not depend on the neighbor order. Our proposal is simple yet robust, which allows it to achieve competitive results in the ShapeNetPart and ModelNet40 datasets, and the state-of-the-art when segmenting the complex dataset S3DIS, with 69.2% IoU on Area 5, and 89.7% overall accuracy using K-fold crossvalidation on the 6 areas.

show abstract

“…Methods based on the 3D voxel grid data are given as follows: this method works by meshing or voxelizing various 3D data and then designing the corresponding 3D convolutional neural network for feature extraction and recognition. References [1,[4][5][6][7] is a series of convolutional neural network algorithms whose input data is a voxel grid, but these algorithms all consume a lot of computational costs because of the sparseness of the data and the features of convolution in 3D. e requirements for resolution are high.…”

Section: Related Workmentioning

confidence: 99%

Recognition of Point Sets Objects in Realistic Scenes

Gao

Zhang

2020

Mobile Information Systems

View full text Add to dashboard Cite

With the emergence of new intelligent sensing technologies such as 3D scanners and stereo vision, high-quality point clouds have become very convenient and lower cost. The research of 3D object recognition based on point clouds has also received widespread attention. Point clouds are an important type of geometric data structure. Because of its irregular format, many researchers convert this data into regular three-dimensional voxel grids or image collections. However, this can lead to unnecessary bulk of data and cause problems. In this paper, we consider the problem of recognizing objects in realistic senses. We first use Euclidean distance clustering method to segment objects in realistic scenes. Then we use a deep learning network structure to directly extract features of the point cloud data to recognize the objects. Theoretically, this network structure shows strong performance. In experiment, there is an accuracy rate of 98.8% on the training set, and the accuracy rate in the experimental test set can reach 89.7%. The experimental results show that the network structure in this paper can accurately identify and classify point cloud objects in realistic scenes and maintain a certain accuracy when the number of point clouds is small, which is very robust.

show abstract

H-CNN: Spatial Hashing Based CNN for 3D Shape Analysis

Cited by 23 publications

References 57 publications

Dual Octree Graph Networks for Learning Adaptive Volumetric Shape Representations

Dual Octree Graph Networks for Learning Adaptive Volumetric Shape Representations

Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation

Recognition of Point Sets Objects in Realistic Scenes

Contact Info

Product

Resources

About