ObjectivesTo evaluate the application value of a generally trained artificial intelligence (AI) automatic diagnosis system in the malignancy diagnosis of follicular-patterned thyroid lesions (FPTL), including follicular thyroid carcinoma (FTC), adenomatoid hyperplasia nodule (AHN) and follicular thyroid adenoma (FTA) and compare the diagnostic performance with radiologists of different experience levels.MethodsWe retrospectively reviewed 607 patients with 699 thyroid nodules that included 168 malignant nodules by using postoperative pathology as the gold standard, and compared the diagnostic performances of three radiologists (one junior, two senior) and that of AI automatic diagnosis system in malignancy diagnosis of FPTL in terms of sensitivity, specificity and accuracy, respectively. Pairwise t-test was used to evaluate the statistically significant difference.ResultsThe accuracy of the AI system in malignancy diagnosis was 0.71, which was higher than the best radiologist in this study by a margin of 0.09 with a p-value of 2.08×10-5. Two radiologists had higher sensitivity (0.84 and 0.78) than that of the AI system (0.69) at the cost of having much lower specificity (0.35, 0.57 versus 0.71). One senior radiologist showed balanced sensitivity and specificity (0.62 and 0.54) but both were lower than that of the AI system.ConclusionsThe generally trained AI automatic diagnosis system can potentially assist radiologists for distinguishing FTC from other FPTL cases that share poorly distinguishable ultrasonographical features.
Purpose: To evaluate the application value of a generally trained artificial intelligence (AI) automatic diagnosis system in the malignancy diagnosis of rare thyroid carcinomas, such as follicular thyroid carcinoma, medullary thyroid carcinoma, primary thyroid lymphoma and anaplastic thyroid carcinoma and compare the diagnostic performance with radiologists of different experience levels.
Methods: We retrospectively studied 342 patients with 378 thyroid nodules that included 196 rare malignant nodules by using postoperative pathology as the gold standard, and compared the diagnostic performances of three radiologists (one junior, one mid-level, one senior) and that of AI automatic diagnosis system.
Results: The accuracy of the AI system in malignancy diagnosis was 0.825, which was significantly higher than that of all three radiologists and higher than the best radiologist in this study by a margin of 0.097 with P-value of 2.252×10 -16 . The mid-level radiologist and senior radiologist had higher sensitivity (0.857 and 0.959) than that of the AI system (0.847) at the cost of having much lower specificity (0.533, 0.478 versus 0.802). The junior radiologist showed relatively balanced sensitivity and specificity (0.816 and 0.549) but both were lower than that of the AI system.
Conclusions: The generally trained AI automatic diagnosis system showed high accuracy in the differential diagnosis of begin nodules and rare malignancy nodules. It may assist radiologists for screening of 3 rare malignancy nodules that even senior radiologists are not acquainted with.
To evaluate the application value of a generally trained artificial intelligence (AI) automatic diagnosis system in the malignancy diagnosis of rare thyroid carcinomas, such as follicular thyroid carcinoma, medullary thyroid carcinoma, primary thyroid lymphoma and anaplastic thyroid carcinoma and compare the diagnostic performance with radiologists of different experience levels.
MethodsWe retrospectively studied 342 patients with 378 thyroid nodules that included 196 rare malignant nodules by using postoperative pathology as the gold standard, and compared the diagnostic performances of three radiologists (one junior, one mid-level, one senior) and that of AI automatic diagnosis system.
ResultsThe accuracy of the AI system in malignancy diagnosis was 0.825, which was significantly higher than that of all three radiologists and higher than the best radiologist in this study by a margin of 0.097 with P-value of 2.252 × 10 -16 . The mid-level radiologist and senior radiologist had higher sensitivity (0.857 and 0.959) than that of the AI system (0.847) at the cost of having much lower specificity (0.533, 0.478 versus 0.802). The junior radiologist showed relatively balanced sensitivity and specificity (0.816 and 0.549) but both were lower than that of the AI system.
ConclusionsThe generally trained AI automatic diagnosis system showed high accuracy in the differential diagnosis of begin nodules and rare malignancy nodules. It may assist radiologists for screening of 3 rare malignancy nodules that even senior radiologists are not acquainted with.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.