2020
DOI: 10.1007/s10462-020-09854-1
|View full text |Cite
|
Sign up to set email alerts
|

Deep semantic segmentation of natural and medical images: a review

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
311
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 614 publications
(312 citation statements)
references
References 183 publications
0
311
0
1
Order By: Relevance
“…The softmax operation is performed to ensure that the prediction result is finally mapped into the (0,1) interval, which is used to represent the probability that the pixels are the background or the disc. As the most commonly used loss function, cross-entropy loss examines each pixel independently and compares the class prediction vector with ground-truth [ 15 ]. Then, cross entropy (CE) can be defined as: where is the groundtruth class, and [0, 1] is the prediction class.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The softmax operation is performed to ensure that the prediction result is finally mapped into the (0,1) interval, which is used to represent the probability that the pixels are the background or the disc. As the most commonly used loss function, cross-entropy loss examines each pixel independently and compares the class prediction vector with ground-truth [ 15 ]. Then, cross entropy (CE) can be defined as: where is the groundtruth class, and [0, 1] is the prediction class.…”
Section: Methodsmentioning
confidence: 99%
“…Attention mechanism is gradually gaining popularity in medical segmentation. The attention mechanism can be viewed as using feature map information to select and locate the most significant part of the input signal [ 15 ]. Hu et al [ 16 ] used global average pooling to aggregate feature map information, then reduced it to a single channel feature map, and finally used an activation gate to highlight salient features.…”
Section: Introductionmentioning
confidence: 99%
“…In recent years, deep learning has developed rapidly in the field of computer vision. It has made great progress in image classification [12][13][14][15][16][17][18], object detection [19,20] and image segmentation [22][23][24][25][26][27]. Compared with traditional methods, deep neural networks can automatically extract features from the input data and achieve higher accuracy.…”
Section: Related Workmentioning
confidence: 99%
“…Compared to using a single modality, multi-modalities significantly improve the performance of learning models [13,14,15,16,17]. Several relevant surveys already exist, such as deep learning-based semantic segmentation [2,3,18,19], indoor scene understanding [20,21], multimodal perception for autonomous driving [22], multimodal human motion recognition [23], multimodal medical image segmentation [24], and multimodal learning study [25,26]. However, these review works are mostly focused on unimodal image segmentation, multimodal fusion for specific domains, or multimedia analysis across video, audio, and text.…”
Section: Introductionmentioning
confidence: 99%