2022
DOI: 10.1177/20552076221136362
|View full text |Cite
|
Sign up to set email alerts
|

RAFF-Net: An improved tongue segmentation algorithm based on residual attention network and multiscale feature fusion

Abstract: Objective Due to the complexity of face images, tongue segmentation is susceptible to interference from uneven tongue texture, lips and face, resulting in traditional methods failing to segment the tongue accurately. To address this problem, RAFF-Net, an automatic tongue region segmentation network based on residual attention network and multiscale feature fusion, was proposed. It aims to improve tongue segmentation accuracy and achieve end-to-end automated segmentation. Methods Based on the UNet backbone netw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 37 publications
0
4
0
Order By: Relevance
“…By sharing parameters, they simultaneously completed the tasks of tongue image segmentation and tongue image classification. Song et al [40] optimized the encoder of the UNet network by fusing the squeeze-and-excitation (SE) blocks between the residual blocks of ResNet, enhancing the feature extraction capability of the encoder. Then, they completed the optimization by introducing a weighted cross-entropy loss function.…”
Section: Deep Learning Segmentationmentioning
confidence: 99%
“…By sharing parameters, they simultaneously completed the tasks of tongue image segmentation and tongue image classification. Song et al [40] optimized the encoder of the UNet network by fusing the squeeze-and-excitation (SE) blocks between the residual blocks of ResNet, enhancing the feature extraction capability of the encoder. Then, they completed the optimization by introducing a weighted cross-entropy loss function.…”
Section: Deep Learning Segmentationmentioning
confidence: 99%
“…Therefore, in recent years, many scholars have conducted research on tongue image segmentation based on deep learning models. The research on tongue image segmentation is mainly divided into fine-tuning, optimization, and improvement based on the classical semantic segmentation model UNet, 4 7 which can achieve good tongue image segmentation effect; Some researchers also optimized and improved DeepLabV3 or DeepLabV3+ models 8 in order to obtain better tongue image segmentation effect. To improve the segmentation precision and accuracy of the model, the attention mechanism is introduced to solve the problem of inaccurate tongue segmentation and edge segmentation, and the edge information is filtered by the attention feature map, and important features are selected.…”
Section: Introductionmentioning
confidence: 99%
“…Ruan et al constructed an efficient tongue image segmentation model by optimizing the UNet network and designed a new network to specifically handle tongue edge segmentation [9]. Haibei Song et al proposed RAFF-NET for tongue region segmentation [10]。 The above-mentioned study achieved good results, but did not investigate tongue fissures. Existing methods that combine deep learning and tongue diagnosis primarily fall into two categories: object detection [11] and instance segmentation.…”
Section: Introductionmentioning
confidence: 99%
“…For example, Yushan Xue et al used crack and non-crack regions to train AlexNet, extracting deep features of the crack region, and finally performed classification using Support Vector Machines (SVM) [9]. Jianjun Yan et al proposed the Segmentation-Based Deep Learning (SBDL) model for cracked tongue image extraction and recognition [10]. Meng-Yi Li et al improved the partial encoder of the Unet architecture by introducing a global convolutional network module to address the encoder's inability to extract relatively abstract high-level semantic features, thereby achieving cracked tongue extraction.…”
Section: Introductionmentioning
confidence: 99%