Fully-Convolutional Point Networks for Large-Scale Point Clouds

Rethage, Dario; Wald, Johanna; Sturm, Jürgen; Navab, Nassir; Tombari, Federico

doi:10.1007/978-3-030-01225-0_37

Cited by 182 publications

(92 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Intuitively, such multiscale skip-connections are useful for point-based deep learning as well. A few recent works have exploited the power of multiscale representation [12,24,28,37,49] and skipconnection [8,43] in 3D learning. In this paper, we focus on point cloud upsampling and propose intra-level and interlevel point-based skip-connections.…”

Section: Related Workmentioning

confidence: 99%

Patch-Based Progressive 3D Point Set Upsampling

Wang

Huang

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

243

185

View full text Add to dashboard Cite

Figure 1: We develop a deep neural network for 3D point set upsampling. Intuitively, our network learns different levels of detail in multiple steps, where each step focuses on a local patch from the output of the previous step. By progressively training our patch-based network end-to-end, we successfully upsample a sparse set of input points, step by step, to a dense point set with rich geometric details. Here we use circle plates for points rendering, which are color-coded by point normals. AbstractWe present a detail-driven deep neural network for point set upsampling. A high-resolution point set is essential for point-based rendering and surface reconstruction. Inspired by the recent success of neural image super-resolution techniques, we progressively train a cascade of patch-based upsampling networks on different levels of detail end-to-end. We propose a series of architectural design contributions that lead to a substantial performance boost. The effect of each technical contribution is demonstrated in an ablation study. Qualitative and quantitative experiments show that our method significantly outperforms the state-of-theart learning-based [58,59], and optimazation-based [23] approaches, both in terms of handling low-resolution inputs and revealing high-fidelity details. The data and code are at https://github.com/yifita/3pu.

show abstract

Section: Related Workmentioning

confidence: 99%

Patch-Based Progressive 3D Point Set Upsampling

Wang

Huang

et al. 2019

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

243

185

View full text Add to dashboard Cite

show abstract

“…Ins. mIOU mIOU PointNet [25] 80.4% 83.7% PointNet++ [27] 81.9% 85.1% FCPN [29] -84.0% SyncSpecCNN [51] 82.0% 84.7% SSCN [10] 83.3% 86.0% SPLATNet [36] 83.7% 85.4% SpiderCNN [49] 81.7% 85.3% SO-Net [19] 81.0% 84.9% PCNN [2] 81.8% 85.1% KCNet [34] 82.2% 83.7% ShapeContextNet [47] -84.6% SpecGCN [41] -85.4% 3DmFV [3] 81.0% 84.3% RSNet [12] 81.4% 84.9% PointCNN [20] 84.6% 86.1% DGCNN [45] 82.3% 85.1% SGPN [44] 82.8% 85.8% PointConv [46] 82.8% 85.7% Point2Seq [23] -85.2% InterpCNN (ours) 84.0% 86.3% work in Figure 2(b). During training we randomly sample 2,048 points from each object and use the original point clouds for testing.…”

Section: Catmentioning

confidence: 99%

Interpolated Convolutional Networks for 3D Point Cloud Understanding

Mao

Wang

2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

236

156

View full text Add to dashboard Cite

Point cloud is an important type of 3D representation. However, directly applying convolutions on point clouds is challenging due to the sparse, irregular and unordered data structure. In this paper, we propose a novel Interpolated Convolution operation, InterpConv, to tackle the point cloud feature learning and understanding problem. The key idea is to utilize a set of discrete kernel weights and interpolate point features to neighboring kernel-weight coordinates by an interpolation function for convolution. A normalization term is introduced to handle neighborhoods of different sparsity levels. Our InterpConv is shown to be permutation and sparsity invariant, and can directly handle irregular inputs. We further design Interpolated Convolutional Neural Networks (InterpCNNs) based on Inter-pConv layers to handle point cloud recognition tasks including shape classification, object part segmentation and indoor scene semantic parsing. Experiments show that the networks can capture both fine-grained local structures and global shape context information effectively. The proposed approach achieves state-of-the-art performance on public benchmarks including ModelNet40, ShapeNet Parts and S3DIS.

show abstract

“…Other works such as SLAM++ [24] or Fusion++ [12] operate on an object level and create semantic scene graphs for SLAM and loop closure. Non-incremental scene understanding methods, in contrast, process a 3D scan directly to obtain semantic, instance or part segmentation [19,20,21,5,10]. Independently from the approach, all these methods rely on the assumption that…”

Section: Rgb-d Scenementioning

confidence: 99%

RIO: 3D Object Instance Re-Localization in Changing Indoor Environments

Wald

Avetisyan

Navab

et al. 2019

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

Self Cite

View full text Add to dashboard Cite

Figure 1: 3D object instance re-localization benchmark: we want to robustly estimate the 6DoF pose (T1, T2, ...Tn) of changed rigid object instances from a segmented source to a target scan taken at a later point in time. AbstractIn this work, we introduce the task of 3D object instance re-localization (RIO): given one or multiple objects in an RGB-D scan, we want to estimate their corresponding 6DoF poses in another 3D scan of the same environment taken at a later point in time. We consider RIO a particularly important task in 3D vision since it enables a wide range of practical applications, including AI-assistants or robots that are asked to find a specific object in a 3D scene. To address this problem, we first introduce 3RScan, a novel dataset and benchmark, which features 1482 RGB-D scans of 478 environments across multiple time steps. Each scene includes several objects whose positions change over time, together with ground truth annotations of object instances and their respective 6DoF mappings among re-scans. Automatically finding 6DoF object poses leads to a particular challenging feature matching task due to varying partial observations and changes in the surrounding context. To this end, we introduce a new data-driven approach that efficiently finds matching features using a fully-convolutional 3D correspondence network operating on multiple spatial scales. Combined with a 6DoF pose optimization, our method outperforms state-of-the-art baselines on our newly-established benchmark, achieving an accuracy of 30.58%.

show abstract

Fully-Convolutional Point Networks for Large-Scale Point Clouds

Cited by 182 publications

References 19 publications

Patch-Based Progressive 3D Point Set Upsampling

Patch-Based Progressive 3D Point Set Upsampling

Interpolated Convolutional Networks for 3D Point Cloud Understanding

RIO: 3D Object Instance Re-Localization in Changing Indoor Environments

Contact Info

Product

Resources

About