Using a Dual-Input Convolutional Neural Network for Automated Detection of Pediatric Supracondylar Fracture on Conventional Radiography

Choi, Jae Won; Cho, Yeon Jin; Lee, Seowoo; Lee, Ji Hyuk; Lee, Seung–Hyun; Choi, Young Hun; Cheon, Jung Eun; Ha, Ji Young

doi:10.1097/rli.0000000000000615

Cited by 69 publications

(59 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They have successfully been used for fracture detection and localization on radiographs [3][4][5][6][7][8][9][10][11][12]. Training data for automated fracture detection have been heterogeneously labeled by orthopedic surgeons [5], orthopedic specialists [6], radiology [10,11,13,14] or orthopedic [15] residents and general radiologists [4] or specialized musculoskeletal radiologists [7,8]. Cheng et al [8] used registry data to label hip fractures on radiographs and only Olczak et al [12] used key phrases of radiology reports to label radiographs for the training set.…”

Section: Introductionmentioning

confidence: 99%

AI-based detection and classification of distal radius fractures using low-effort data labeling: evaluation of applicability and effect of training set size

et al. 2021

View full text Add to dashboard Cite

Objectives To evaluate the performance of a deep convolutional neural network (DCNN) in detecting and classifying distal radius fractures, metal, and cast on radiographs using labels based on radiology reports. The secondary aim was to evaluate the effect of the training set size on the algorithm’s performance. Methods A total of 15,775 frontal and lateral radiographs, corresponding radiology reports, and a ResNet18 DCNN were used. Fracture detection and classification models were developed per view and merged. Incrementally sized subsets served to evaluate effects of the training set size. Two musculoskeletal radiologists set the standard of reference on radiographs (test set A). A subset (B) was rated by three radiology residents. For a per-study-based comparison with the radiology residents, the results of the best models were merged. Statistics used were ROC and AUC, Youden’s J statistic (J), and Spearman’s correlation coefficient (ρ). Results The models’ AUC/J on (A) for metal and cast were 0.99/0.98 and 1.0/1.0. The models’ and residents’ AUC/J on (B) were similar on fracture (0.98/0.91; 0.98/0.92) and multiple fragments (0.85/0.58; 0.91/0.70). Training set size and AUC correlated on metal (ρ = 0.740), cast (ρ = 0.722), fracture (frontal ρ = 0.947, lateral ρ = 0.946), multiple fragments (frontal ρ = 0.856), and fragment displacement (frontal ρ = 0.595). Conclusions The models trained on a DCNN with report-based labels to detect distal radius fractures on radiographs are suitable to aid as a secondary reading tool; models for fracture classification are not ready for clinical use. Bigger training sets lead to better models in all categories except joint affection. Key Points • Detection of metal and cast on radiographs is excellent using AI and labels extracted from radiology reports. • Automatic detection of distal radius fractures on radiographs is feasible and the performance approximates radiology residents. • Automatic classification of the type of distal radius fracture varies in accuracy and is inferior for joint involvement and fragment displacement.

show abstract

Section: Introductionmentioning

confidence: 99%

AI-based detection and classification of distal radius fractures using low-effort data labeling: evaluation of applicability and effect of training set size

et al. 2021

View full text Add to dashboard Cite

show abstract

“…Three studies 6 , 24 , 25 reported the area under the receiving operating characteristics curve (AUC-ROC) to evaluate IV and EV performance. The AUC is a common metric to report CNN performance, 26 where a value of 1.0 indicates perfect discriminatory performance, whereas 0.5 indicates a prediction equal to that of chance.…”

Section: Methodsmentioning

confidence: 99%

“… 6 Zhou et al 27 addressed both fracture detection and classification. The CNNs detected fractures on a single anatomical location like the wrist, 6 , 24 elbow, 25 or ribs. 27 Input features of three studies 6 , 24 , 25 were conventional radiographs; one study used CT scans.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics

et al. 2021

View full text Add to dashboard Cite

Aims The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing. External validation of a CNN on a temporally separate (separated by time) or geographically separate (separated by location) dataset is crucial to assess generalizability of the CNN before application to clinical practice in other institutions. We aimed to answer the following questions: are current CNNs for fracture recognition externally valid?; which methods are applied for external validation (EV)?; and, what are reported performances of the EV sets compared to the internal validation (IV) sets of these CNNs? Methods The PubMed and Embase databases were systematically searched from January 2010 to October 2020 according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement. The type of EV, characteristics of the external dataset, and diagnostic performance characteristics on the IV and EV datasets were collected and compared. Quality assessment was conducted using a seven-item checklist based on a modified Methodologic Index for NOn-Randomized Studies instrument (MINORS). Results Out of 1,349 studies, 36 reported development of a CNN for fracture detection and/or classification. Of these, only four (11%) reported a form of EV. One study used temporal EV, one conducted both temporal and geographical EV, and two used geographical EV. When comparing the CNN’s performance on the IV set versus the EV set, the following were found: AUCs of 0.967 (IV) versus 0.975 (EV), 0.976 (IV) versus 0.985 to 0.992 (EV), 0.93 to 0.96 (IV) versus 0.80 to 0.89 (EV), and F1-scores of 0.856 to 0.863 (IV) versus 0.757 to 0.840 (EV). Conclusion The number of externally validated CNNs in orthopaedic trauma for fracture recognition is still scarce. This greatly limits the potential for transfer of these CNNs from the developing institute to another hospital to achieve similar diagnostic performance. We recommend the use of geographical EV and statements such as the Consolidated Standards of Reporting Trials–Artificial Intelligence (CONSORT-AI), the Standard Protocol Items: Recommendations for Interventional Trials–Artificial Intelligence (SPIRIT-AI) and the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis–Machine Learning (TRIPOD-ML) to critically appraise performance of CNNs and improve methodological rigor, quality of future models, and facilitate eventual implementation in clinical practice. Cite this article: Bone Jt Open 2021;2(10):879–885.

show abstract

“…Our interdisciplinary research group started working in the field of pediatric trauma computer vision applications in 2018 [52]. As described [90,91], AI can be helpful in the domain of automated fracture detection. One of the main hurdles in establishing AI algorithms is the lack of annotated training data sets and data quality.…”

Section: Personal Experiencesmentioning

confidence: 99%

The augmented radiologist: artificial intelligence in the practice of radiology

et al. 2021

View full text Add to dashboard Cite

In medicine, particularly in radiology, there are great expectations in artificial intelligence (AI), which can “see” more than human radiologists in regard to, for example, tumor size, shape, morphology, texture and kinetics — thus enabling better care by earlier detection or more precise reports. Another point is that AI can handle large data sets in high-dimensional spaces. But it should not be forgotten that AI is only as good as the training samples available, which should ideally be numerous enough to cover all variants. On the other hand, the main feature of human intelligence is content knowledge and the ability to find near-optimal solutions. The purpose of this paper is to review the current complexity of radiology working places, to describe their advantages and shortcomings. Further, we give an AI overview of the different types and features as used so far. We also touch on the differences between AI and human intelligence in problem-solving. We present a new AI type, labeled “explainable AI,” which should enable a balance/cooperation between AI and human intelligence — thus bringing both worlds in compliance with legal requirements. For support of (pediatric) radiologists, we propose the creation of an AI assistant that augments radiologists and keeps their brain free for generic tasks.

show abstract

Using a Dual-Input Convolutional Neural Network for Automated Detection of Pediatric Supracondylar Fracture on Conventional Radiography

Cited by 69 publications

References 26 publications

AI-based detection and classification of distal radius fractures using low-effort data labeling: evaluation of applicability and effect of training set size

AI-based detection and classification of distal radius fractures using low-effort data labeling: evaluation of applicability and effect of training set size

An increasing number of convolutional neural networks for fracture recognition and classification in orthopaedics

The augmented radiologist: artificial intelligence in the practice of radiology

Contact Info

Product

Resources

About