2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00743
|View full text |Cite
|
Sign up to set email alerts
|

SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences

Abstract: Scene graphs are a compact and explicit representation successfully used in a variety of 2D scene understanding tasks. This work proposes a method to incrementally build up semantic scene graphs from a 3D environment given a sequence of RGB-D frames. To this end, we aggregate PointNet features from primitive scene components by means of a graph neural network. We also propose a novel attention mechanism well suited for partial and missing graph data present in such an incremental reconstruction scenario. Altho… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
68
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 92 publications
(68 citation statements)
references
References 51 publications
0
68
0
Order By: Relevance
“…The approaches in [4,49,50] are designed for offline operation. Other papers focus on reconstructing a graph of objects and their relations [26,63,67]. Wu et al [67] predict objects and relations in real-time using a graph-neural network.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…The approaches in [4,49,50] are designed for offline operation. Other papers focus on reconstructing a graph of objects and their relations [26,63,67]. Wu et al [67] predict objects and relations in real-time using a graph-neural network.…”
Section: Related Workmentioning
confidence: 99%
“…3D Scene Graphs [4,26,49,50,63,67] have recently emerged as powerful high-level representations of 3D environments. A 3D scene graph (Fig.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, their focus is on efficient annotation and collection of such graphs from 3D sensors. Similarly, more recent efforts such as (Zhang et al 2021;Wu et al 2021) are also targeted at improvising the efficiency of constructing a 3D scene graph from RGBD scans, while our focus is on constructing pseudo-3D graphs leveraging recent advancements in 2D-to-3D methods. We note that while precise 3D scene graphs may be important for several tasks such as robot navigation or manipulation, they need not be required for reasoning tasks such as what we consider in this paper, and for such tasks approximate 3D reasoning may be sufficient.…”
Section: Related Workmentioning
confidence: 99%
“…An important step in semantic segmentation is feature extraction. Based on the methods used for feature extraction, semantic segmentation approaches can be roughly categorized into three groups: handcrafted-feature-based [8,14,39,46,48,[50][51][52], learning-based [16,24,37,38,47], and hybrid methods [23,53]. Handcrafted features are often effective when with limited training data.…”
Section: Semantic Segmentation Of 3d Datamentioning
confidence: 99%