2022
DOI: 10.1007/s11263-021-01546-9
|View full text |Cite
|
Sign up to set email alerts
|

Learning 3D Semantic Scene Graphs with Instance Embeddings

Abstract: A 3D scene is more than the geometry and classes of the objects it comprises. An essential aspect beyond object-level perception is the scene context, described as a dense semantic network of interconnected nodes. Scene graphs have become a common representation to encode the semantic richness of images, where nodes in the graph are object entities connected by edges, so-called relationships. Such graphs have been shown to be useful in achieving state-of-the-art performance in image captioning, visual question… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(5 citation statements)
references
References 82 publications
0
5
0
Order By: Relevance
“…The model investigated here is combined with transformer for generating the scene graph by inputting video data and exploring the relationship between humans, the environment(places), and objects. As the novel proposed research, a scene graph mentioned the relationship between objects in the 3D environment [142]. It built the 3DSSG on top of 3RScan, aiming to extract the 3D geometry and depth information of 3D space.…”
Section: Graph-like Methodologiesmentioning
confidence: 99%
See 1 more Smart Citation
“…The model investigated here is combined with transformer for generating the scene graph by inputting video data and exploring the relationship between humans, the environment(places), and objects. As the novel proposed research, a scene graph mentioned the relationship between objects in the 3D environment [142]. It built the 3DSSG on top of 3RScan, aiming to extract the 3D geometry and depth information of 3D space.…”
Section: Graph-like Methodologiesmentioning
confidence: 99%
“…Rather than the 2D scene graph, the involvement of a diagram is more densely connected and informative in the 3D scene graph. As mentioned in the work [142] of scene graph generation, it allows spatialtemporal analysis of moving objects and human behaviors in a stereoscopic environment. D. RNNs: Gated Recurrent Units(GRUs) and LSTM GRU are utilized for predicting the time sequential movements with multiple variants, including GRU-SVM and GRU-D models.…”
Section: Graph-like Methodologiesmentioning
confidence: 99%
“…At present, it is a hot topic to consider the real dynamic situation in time and space dimensions. Ji et al [27] proposed to capture spatial and temporal information from video and Wald et al [28] presented a new neural network architecture of 3D data to explore the relationship between entities, and returned semantics from a given 3D scene through learning. Fernando et al [29] combined motion dynamics into the image and feed the image into any standard CNN for end-to-end learning.…”
Section: Dynamic Scenario Research In Time and Space Dimensionsmentioning
confidence: 99%
“…The model investigated here is combined with a transformer for generating the scene graph by inputting video data and exploring the relationship between humans, the environment(places), and objects. As the novel proposed research, a scene graph mentioned the relationship between objects in the 3D environment [130]. It built the 3DSSG on top of 3RScan, aiming to extract the 3D geometry and depth information of 3D space.…”
Section: Multi-modality Behavior Recognitionmentioning
confidence: 99%
“…Rather than the 2D scene graph, the involvement of a diagram is more densely connected and informative in the 3D scene graph. As mentioned in the work [130] of scene graph generation, it allows video analysis of moving objects and human behaviors in a stereoscopic environment.…”
Section: Multi-modality Behavior Recognitionmentioning
confidence: 99%