2022
DOI: 10.48550/arxiv.2209.13232
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective

Abstract: Graph Neural Networks (GNNs) have gained momentum in graph representation learning and boosted the state of the art in a variety of areas, such as data mining (e.g., social network analysis and recommender systems), computer vision (e.g., object detection and point cloud learning), and natural language processing (e.g., relation extraction and sequence learning), to name a few. With the emergence of Transformers in natural language processing and computer vision, graph Transformers embed a graph structure into… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
9
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 312 publications
0
9
0
Order By: Relevance
“…Image segmentation aims to separate an image into several semantic meaningful regions by labeling each pixel, depending on the object's appearance [15]. Deep neural networks have significantly improved this field.…”
Section: Image Segmentationmentioning
confidence: 99%
See 1 more Smart Citation
“…Image segmentation aims to separate an image into several semantic meaningful regions by labeling each pixel, depending on the object's appearance [15]. Deep neural networks have significantly improved this field.…”
Section: Image Segmentationmentioning
confidence: 99%
“…Finally, GNN is a fast-growing field, and this paper can only discuss limited aspects of GNNs. In addition, a more comprehensive review of GNN is given by Chen et al [15].…”
Section: Image Segmentationmentioning
confidence: 99%
“…For a hierarchical data structure such as a graph, graph neural networks (GNN) are well suited to perform learning [14,15]. GNNs have been shown to be effective at tasks such as node classification, link prediction, and graph classification, and have been applied to a wide range of domains including computer vision, natural language processing, electrical engineering, and bioinformatics [16][17][18][19].…”
Section: Introductionmentioning
confidence: 99%
“…In such cases, a simple graph structure is imposed a-priori (e.g., based on distances) [12] or is automatically infererred by the neural network [13]. Few works investigated the application of GNNs to the vision domain for different tasks, mainly related to point clouds [14], with the Vision GNN architecture [15] (ViG) being the most successful architecture in image classification, achieving higher performances in the image classification task compared to the ViT architecture [10].…”
Section: Introductionmentioning
confidence: 99%