Multiclass datasets expand neural network utility: an example on ankle radiographs

Kim, Suam; Rebmann, Philipp; Tran, Phuong Hien; Kellner, Elias; Reisert, Marco; Steybe, David; Bayer, Jörg; Bamberg, Fabian; Kotter, Elmar; Russe, Maximilian

doi:10.1007/s11548-023-02839-9

Cited by 3 publications

(1 citation statement)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For our classification model, we modified a headless imagenet-pretrained Xception-network based on the keras implementation 19 by adding a global average pooling and a drop-out layer (10% drop-out during training) as well as 2 dense layers (128 and 2 units, respectively). Empirically, 20 model performances could be improved by replacing the superfluous two input layers in the third dimension of imagenet-pretrained networks with filtered copies of the original two-dimensional image after application of a brightness-inversion and an adaptive mean thresholding edge-enhancing filter, respectively. We conjecture generally facilitated fracture delineation by the corresponding and contrasting input information as the reason for improvement, similar to the common practice of performing brightness inversion on radiographs when looking for fracture or pleural lines.…”

Section: Methodsmentioning

confidence: 99%

AI-based X-ray fracture analysis of the distal radius: accuracy between representative classification, detection and segmentation deep learning models for clinical practice

Russe,

Rebmann,

Tran

et al. 2024

BMJ Open

Self Cite

View full text Add to dashboard Cite

ObjectivesTo aid in selecting the optimal artificial intelligence (AI) solution for clinical application, we directly compared performances of selected representative custom-trained or commercial classification, detection and segmentation models for fracture detection on musculoskeletal radiographs of the distal radius by aligning their outputs.Design and settingThis single-centre retrospective study was conducted on a random subset of emergency department radiographs from 2008 to 2018 of the distal radius in Germany.Materials and methodsAn image set was created to be compatible with training and testing classification and segmentation models by annotating examinations for fractures and overlaying fracture masks, if applicable. Representative classification and segmentation models were trained on 80% of the data. After output binarisation, their derived fracture detection performances as well as that of a standard commercially available solution were compared on the remaining X-rays (20%) using mainly accuracy and area under the receiver operating characteristic (AUROC).ResultsA total of 2856 examinations with 712 (24.9%) fractures were included in the analysis. Accuracies reached up to 0.97 for the classification model, 0.94 for the segmentation model and 0.95 for BoneView. Cohen’s kappa was at least 0.80 in pairwise comparisons, while Fleiss’ kappa was 0.83 for all models. Fracture predictions were visualised with all three methods at different levels of detail, ranking from downsampled image region for classification over bounding box for detection to single pixel-level delineation for segmentation.ConclusionsAll three investigated approaches reached high performances for detection of distal radius fractures with simple preprocessing and postprocessing protocols on the custom-trained models. Despite their underlying structural differences, selection of one’s fracture analysis AI tool in the frame of this study reduces to the desired flavour of automation: automated classification, AI-assisted manual fracture reading or minimised false negatives.

show abstract

Section: Methodsmentioning

confidence: 99%