A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective

Chen, Chaoqi; Wu, Yushuang; Dai, Qing; Zhou, Haoming; Xu, Mutian; Yang, Sibei; Han, Xiaodong; Yu, Yizhou

doi:10.48550/arxiv.2209.13232

Cited by 7 publications

(9 citation statements)

References 312 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Image segmentation aims to separate an image into several semantic meaningful regions by labeling each pixel, depending on the object's appearance [15]. Deep neural networks have significantly improved this field.…”

Section: Image Segmentationmentioning

confidence: 99%

See 1 more Smart Citation

Explainer on GNN-based segmentation networks

2024

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

Graph Neural Networks (GNN) are powerful tools for deep learning. Similar to other neural networks, GNNs are complex models, in which humans can’t understand the decision-making procedures of the models. Therefore, it brings the need to explainability of GNNs. Explainability is critical for deep learning to support its predictions. In this paper, we will investigate the Grad-Cam and Integrated-Gradients explaining methods. The Grad-Cam applies a global average pooling over the feature activation mapping, and then which was followed by a ReLU activation to obtain an attribution. The Integrated-Gradients explains models by taking a line integral between the baseline image (a black image) and the source image. We demonstrate how Grad-Cam and the Integrated-Gradients methods explain the graph-deep model in semantic segmentation tasks over the Cityscapes dataset. FCN and LRASSP-MobileNet are used as a comparison to the DualGCN in the experiment to show the explaining effect.

show abstract

Section: Image Segmentationmentioning

confidence: 99%

“…Finally, GNN is a fast-growing field, and this paper can only discuss limited aspects of GNNs. In addition, a more comprehensive review of GNN is given by Chen et al [15].…”

Section: Image Segmentationmentioning

confidence: 99%

Explainer on GNN-based segmentation networks

2024

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

show abstract

“…For a hierarchical data structure such as a graph, graph neural networks (GNN) are well suited to perform learning [14,15]. GNNs have been shown to be effective at tasks such as node classification, link prediction, and graph classification, and have been applied to a wide range of domains including computer vision, natural language processing, electrical engineering, and bioinformatics [16][17][18][19].…”

Section: Introductionmentioning

confidence: 99%

Attention-Based Graph Neural Network for Label Propagation in Single-Cell Omics

Bhadani

Chen

2023

Genes

View full text Add to dashboard Cite

Single-cell data analysis has been at forefront of development in biology and medicine since sequencing data have been made available. An important challenge in single-cell data analysis is the identification of cell types. Several methods have been proposed for cell-type identification. However, these methods do not capture the higher-order topological relationship between different samples. In this work, we propose an attention-based graph neural network that captures the higher-order topological relationship between different samples and performs transductive learning for predicting cell types. The evaluation of our method on both simulation and publicly available datasets demonstrates the superiority of our method, scAGN, in terms of prediction accuracy. In addition, our method works best for highly sparse datasets in terms of F1 score, precision score, recall score, and Matthew’s correlation coefficients as well. Further, our method’s runtime complexity is consistently faster compared to other methods.

show abstract

“…In such cases, a simple graph structure is imposed a-priori (e.g., based on distances) [12] or is automatically infererred by the neural network [13]. Few works investigated the application of GNNs to the vision domain for different tasks, mainly related to point clouds [14], with the Vision GNN architecture [15] (ViG) being the most successful architecture in image classification, achieving higher performances in the image classification task compared to the ViT architecture [10].…”

Section: Introductionmentioning

confidence: 99%