2020
DOI: 10.1371/journal.pone.0238908
|View full text |Cite
|
Sign up to set email alerts
|

Inconsistency in the use of the term “validation” in studies reporting the performance of deep learning algorithms in providing diagnosis from medical imaging

Abstract: Background The development of deep learning (DL) algorithms is a three-step process-training, tuning, and testing. Studies are inconsistent in the use of the term "validation", with some using it to refer to tuning and others testing, which hinders accurate delivery of information and may inadvertently exaggerate the performance of DL algorithms. We investigated the extent of inconsistency in usage of the term "validation" in studies on the accuracy of DL algorithms in providing diagnosis from medical imaging.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(7 citation statements)
references
References 34 publications
0
7
0
Order By: Relevance
“…Internal validation is that in the validation phase, the testing set is separated from the original dataset, whereas external validation is that using a completely independent dataset out of the original one. 86 Though DL performed inferiorly in the external group, we still appeal to further studies to apply external instead of internal validation. One of the major limitations of including studies is that the majority of them didn’t implement external validation, which made them hard to be generalized and reproduced.…”
Section: Discussionmentioning
confidence: 97%
See 1 more Smart Citation
“…Internal validation is that in the validation phase, the testing set is separated from the original dataset, whereas external validation is that using a completely independent dataset out of the original one. 86 Though DL performed inferiorly in the external group, we still appeal to further studies to apply external instead of internal validation. One of the major limitations of including studies is that the majority of them didn’t implement external validation, which made them hard to be generalized and reproduced.…”
Section: Discussionmentioning
confidence: 97%
“…Xian et al. used DL in near-infrared fluorescence imaging to help intraoperative diagnosis, 86 which requested not only accuracy but also celerity. Besides, as for the application of AI technology in low-resource areas, the acceptance ability of healthcare workers also needs to be considered.…”
Section: Discussionmentioning
confidence: 99%
“…In recent years, the success of deep learning in natural language processing, computer vision and other fields has made the recommendation field begin to pay attention to this powerful tool, and scholars have begun to explore the use of deep learning methods to improve some insurmountable weaknesses of current recommendation systems, such as data sparseness, cold start, poor interpretability and other problems [19,20]. In particular, the emergence of CNN and RNN [21][22][23][24][25][26] has achieved great success in many natural language processing (NLP) tasks. So everyone began to try to use deep learning methods, such as DeepCoNN, D-Attn [12], etc., to mine user preferences and product characteristics in review texts, and then directly apply them to predictive scoring.…”
Section: Relate Workmentioning
confidence: 99%
“…Before explaining these classifications, it is necessary to understand the term ‘validation’ clearly. In addition to its ordinary meaning (i.e., verification or confirmation), as used in this article, validation is also a technical term in the machine learning field, referring to the process of adjusting hyperparameters when making AI algorithms [ 31 ]. The process of adjusting hyperparameters is also called tuning to avoid confusion; however, validation is more widely used to indicate the procedure in AI-related literature [ 31 ].…”
Section: Evaluating Ai Algorithm Performance: Classification Accordinmentioning
confidence: 99%
“…In addition to its ordinary meaning (i.e., verification or confirmation), as used in this article, validation is also a technical term in the machine learning field, referring to the process of adjusting hyperparameters when making AI algorithms [ 31 ]. The process of adjusting hyperparameters is also called tuning to avoid confusion; however, validation is more widely used to indicate the procedure in AI-related literature [ 31 ]. On the other hand, validation test or test is used instead of validation to indicate verification of algorithm performance and distinguish it from the process of adjusting hyperparameters [ 32 33 ].…”
Section: Evaluating Ai Algorithm Performance: Classification Accordinmentioning
confidence: 99%