Fully Attentional Network for Semantic Segmentation

Song, Qi; Li, Jie; Li, Chenghong; Guo, Hao; Huang, Rui

doi:10.1609/aaai.v36i2.20126

Cited by 38 publications

(16 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In addition, choices of the pooling size combination are investigated. It is observed in Table 4 that the pooling size combination (4,8,12) outperforms (1,2,4) and (2,4,8), which makes it our settlement in MFA.…”

Section: Ablation Study For Hyperparametersmentioning

confidence: 90%

“…EMANet 11 iterates a set of compact bases using expectation maximization algorithm to represent the whole image and runs attention calculation on this set of bases, so that the computational complexity can be reduced significantly. FLANet 12 encodes both channel and spatial attentions in one single attention map, which not only considers all the information but also reduces the computational consumption. Even though these modules have reduced the computing cost and have improved the segmentation performance, they ignore the fusion of multi-resolution features.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Attention-guided and resource-saving modules for semantic segmentation

Cao

et al. 2023

J. Electron. Imag.

View full text Add to dashboard Cite

Self-attention has been proven to be a quite powerful yet calculation-intensive method for scene semantic segmentation. Even though many efforts have been made to explore more effective and resource-saving ways to apply self-attention, there is still space in reducing the calculation consumption. Meanwhile, since self-attention is good at fusing information, its application should be extended to multi-scale-feature-fusion, which is barely researched while the information exchange paths between features in different resolutions are mostly addition and concatenation. A special partition method decreasing the computational complexity of self-attention is investigated, and a multi-scale-feature-attention (MFA) module fusing low-resolution features containing semantic information with high-resolution features having detailed information is presented at the same time. To be specific, the proposed multi-scale-partition-attention (MPA) module and MFA module are inserted into the backbone in sequence to fuse information among all the pixels in one highly extracted feature and the pixels from features with different resolutions, respectively. Extensive experiments are carried out on semantic segmentation benchmarks including PASCAL-Context and Cityscapes to demonstrate that these two improved modules can improve the performance of the backbone in scene semantic segmentation tasks that contain multiple classes and objects in both big and small sizes.

show abstract

Section: Ablation Study For Hyperparametersmentioning

confidence: 90%

Section: Introductionmentioning

confidence: 99%

Attention-guided and resource-saving modules for semantic segmentation

Cao

et al. 2023

J. Electron. Imag.

View full text Add to dashboard Cite

show abstract

“…Inspired by the Fully Attentional Block proposed in the literature 15 and Progressive Sampling proposed in the literature, 29 the FAPS module is proposed, shown in Figures 2 and 3. The basic idea of this module is to save the feature response from the global background under the same spatial position of horizontal and vertical coordinates and then use the self‐attention mechanism to capture the fully attention similarity between the two channel maps and their spatial positions.…”

Section: Methodsmentioning

confidence: 99%

“…Similarly, in non‐local spatial attention, a situation arises in which the action between each channel dimension is missing. Based on this, the literature 15 proposed Fully Attention Block (FLA), which uses global contextual features to preserve spatial response features when computing the channel attention map, which enables full attention in a single attention and improves computational efficiency. First, the feature response of the global context is captured at each spatial location.…”

Section: Introductionmentioning

confidence: 99%

TACT: Text attention based CNN‐Transformer network for polyp segmentation

Zhao,

Li,

Hua

2023

Int J Imaging Syst Tech

View full text Add to dashboard Cite

Colorectal cancer (CRC) has been one of the top three disease in the world in terms of incidence for many years. Therefore, how to prevent and treat CRC has become a topic of concern for an increasing number of people, and colonoscopy is the most effective detection method in polyp examination. According to studies, 90% of CRC is caused by adenomatous polyps of the large intestine. In clinical practice, the diversity of polyps' size, number, and shape and the unclear boundary between polyps and colon folds can reduce the operator's accuracy of polyps segmentation and lead to a higher rate of missed diagnosis. To better address the inaccurate segmentation or high miss rate due to the above factors, we propose a text attention‐based CNN‐Transformer network for polyp segmentation (TACT) network to process the images in a way that minimizes operator subjectivity and miss rate. The network is based on the CNN‐Transformer structure, and on this basis, a fully attention progressive sampling module is added to more accurately divide the polyp boundary. Moreover, an auxiliary text classification task was added to focus on polyp size and number features in the form of text attention, which more effectively copes with the segmentation tasks of different sizes and different numbers of polyps. After comparing with multiple state‐of‐the‐art segmentation methods in four challenging datasets, our proposed TACT improves segmentation accuracy for polyps of different sizes in different datasets.

show abstract

“…EVALUATION OF SEGMENTATION RESULTS(%).To test the generality of the proposed feature consistency constraints, experiments on different network structures are conducted. In this experiment, four network structures, namely FCN, DeepLab V3+, PSPNet, and FLANet[56] are employed…”

mentioning

confidence: 99%

Feature Consistency Constraints-Based CNN for Landsat Land Cover Mapping

Zhao

Luo

et al. 2023

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

The cascade of convolution layers and the end-toend training process facilitate CNN feature extraction and transmission, and promote the success of CNN in image processing. However, the drawback of heavily relying on large-scale highquality training samples restricts its applications. To avoid costly and unrealistic manual annotations for large-scale remote sensing images, existing land cover maps are considered as an alternative to manual annotations, in which noisy labels are inevitable. To alleviate the impact of noisy labels, this paper proposes to improve the consistency feature learning ability of CNNs as a feasible solution in practical land cover mapping. Firstly, an intraclass feature consistency constraint is introduced to maintain the consistency of CNN feature maps for the same class. Then, an inter-iteration feature consistency constraint is employed to guide the network to learn features that are consistent with the whole underlying distribution inside a mini-batch. These two feature consistency constraints work in a cooperative and complementary manner with the traditional cross-entropy, and together improve the consistency feature learning ability of the proposed Feature Consistency Network (FCNet). Experimental results demonstrate the effectiveness of the proposed FCNet. Extensive experiments on different network structures validate the generalization of the proposed feature consistency constraints.

show abstract

Fully Attentional Network for Semantic Segmentation

Cited by 38 publications

References 24 publications

Attention-guided and resource-saving modules for semantic segmentation

Attention-guided and resource-saving modules for semantic segmentation

TACT: Text attention based CNN‐Transformer network for polyp segmentation

Feature Consistency Constraints-Based CNN for Landsat Land Cover Mapping

Contact Info

Product

Resources

About