2019
DOI: 10.1088/1742-6596/1213/2/022003
|View full text |Cite
|
Sign up to set email alerts
|

Exploring An Easy Way for Imbalanced Data Sets in Semantic Image Segmentation

Abstract: In recent years, deep convolutional neural networks have gradually become the preferred method for image processing. After the development of Classification, Detection and Segmentation, a large variety of state-of-the-art models and algorithms have emerged in the field. However, for some specific data sets or tasks, not all methods are applicable, which is inconvenient to researchers. This paper took the data set provided in the airbus ship detection challenge in Kaggle as an example to explore an easy and eff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 5 publications
0
2
0
1
Order By: Relevance
“…For semantic segmentation (i.e., pixel-wise classification) of the catheter balloon, the commonly used U-Net architecture ( 22 ) was employed for both its high-resolution localization of objects and global context understanding, owing to its encoder-decoder structure. Here a modified U-Net model using a ResNet-34 ( 23 ) encoder (i.e., a 34 layered Residual Neural Network) was employed as it was shown to provide higher recall when compared to the standard U-Net architecture with imbalanced datasets ( 24 ), as is the case in this study (i.e., the balloon represents <1% of the image). The U-Net consisted of pre-trained weights using the ImageNet dataset ( 25 ).…”
Section: Methodsmentioning
confidence: 99%
“…For semantic segmentation (i.e., pixel-wise classification) of the catheter balloon, the commonly used U-Net architecture ( 22 ) was employed for both its high-resolution localization of objects and global context understanding, owing to its encoder-decoder structure. Here a modified U-Net model using a ResNet-34 ( 23 ) encoder (i.e., a 34 layered Residual Neural Network) was employed as it was shown to provide higher recall when compared to the standard U-Net architecture with imbalanced datasets ( 24 ), as is the case in this study (i.e., the balloon represents <1% of the image). The U-Net consisted of pre-trained weights using the ImageNet dataset ( 25 ).…”
Section: Methodsmentioning
confidence: 99%
“…4, Agustus 2023, hlm. 899-908 kenyamanan bagi para peneliti di bidang terkait dengan metode yang diusulkan block konvolusi berdasarkan Fibonacci dan menghasilkan metode sementasi yang efektif dengan perbedaan yang signifikan antara gambar background dan foregroundnya (Xia et al, 2019).…”
Section: 2unclassified
“…The first method implies using a weighted loss function during training (e.g., Weighted Binary Cross Entropy) that assigns a larger weight to samples containing buildings and, therefore, induces stronger changes in the net parameters when a building is being processed. The second method [33] suggests training the model only on positive examples, i.e., patches containing more than a pre-set number or percentage of building pixels in our case. This second approach was selected because it is expected not to affect the generalization capabilities of the network.…”
Section: Training Procedures 431 Data Processingmentioning
confidence: 99%