2020
DOI: 10.1016/j.media.2020.101759
|View full text |Cite
|
Sign up to set email alerts
|

Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis

Abstract: Supervised training of deep learning models requires large labeled datasets. There is a growing interest in obtaining such datasets for medical image analysis applications. However, the impact of label noise has not received sufficient attention. Recent studies have shown that label noise can significantly impact the performance of deep learning models in many machine learning and computer vision applications. This is especially concerning for medical applications, where datasets are typically small, labeling … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
278
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 477 publications
(282 citation statements)
references
References 110 publications
(141 reference statements)
4
278
0
Order By: Relevance
“…Therefore, it can be assumed, that there is a certain amount of noise in the training data, which might affect the accuracy of the models trained on it. Implementing a human-in-the loop approach for partially correcting the label noise could further improve performance of networks trained on the CheXpert dataset 21 . Our findings differ from applied techniques used in previous literature, where deeper network architectures, mainly a DenseNet-121, were used to classify the CheXpert data set 6,9,22 .…”
Section: Discussionmentioning
confidence: 99%
“…Therefore, it can be assumed, that there is a certain amount of noise in the training data, which might affect the accuracy of the models trained on it. Implementing a human-in-the loop approach for partially correcting the label noise could further improve performance of networks trained on the CheXpert dataset 21 . Our findings differ from applied techniques used in previous literature, where deeper network architectures, mainly a DenseNet-121, were used to classify the CheXpert data set 6,9,22 .…”
Section: Discussionmentioning
confidence: 99%
“…Labelled-image databases are usually used for the training and testing of deep neural networks and were also applied in our research. The research community has recognized the importance of the impact of the label errors (label noise) in training datasets on the model accuracy and have introduced works attempting to understand noisy training labels [25].…”
Section: Related Workmentioning
confidence: 99%
“…Dealing with label noise in the context of supervised machine learning is a well-known issue that challenges researchers since the early developments of classifiers, as detailed in [1]. A vast corpus of techniques has been developed, most of which are listed in a recent exhaustive survey [7], where even the latest methods related to general deep learning algorithms are indexed. Interestingly, in [7], authors propose to classify techniques for handling label noise into six (possibly overlapping) categories.…”
Section: Related Workmentioning
confidence: 99%
“…None of the reconstructions based on the feature vector of the ''true class 0'' matches the initial image. Then, the corresponding reconstructions from feature vector ''5'' (5,6,7). When ghost feature vectors are allowed, the network is able to reconstruct the initial image from the feature vector ''5'', which corresponds to a non-true class, while it is not the case when no ghost feature vector is allowed.…”
Section: B Proof Of Conceptmentioning
confidence: 99%