Purpose: To develop and validate a deep learning (DL) algorithm that predicts referable glaucomatous optic neuropathy (GON) and optic nerve head (ONH) features from color fundus images, to determine the relative importance of these features in referral decisions by glaucoma specialists (GSs) and the algorithm, and to compare the performance of the algorithm with eye care providers.Design: Development and validation of an algorithm.Participants: Fundus images from screening programs, studies, and a glaucoma clinic. Methods: A DL algorithm was trained using a retrospective dataset of 86 618 images, assessed for glaucomatous ONH features and referable GON (defined as ONH appearance worrisome enough to justify referral for comprehensive examination) by 43 graders. The algorithm was validated using 3 datasets: dataset A (1205 images, 1 image/patient; 18.1% referable), images adjudicated by panels of GSs; dataset B (9642 images, 1 image/ patient; 9.2% referable), images from a diabetic teleretinal screening program; and dataset C (346 images, 1 image/patient; 81.7% referable), images from a glaucoma clinic.Main Outcome Measures: The algorithm was evaluated using the area under the receiver operating characteristic curve (AUC), sensitivity, and specificity for referable GON and glaucomatous ONH features.Results: The algorithm's AUC for referable GON was 0.945 (95% confidence interval [CI], 0.929e0.960) in dataset A, 0.855 (95% CI, 0.841e0.870) in dataset B, and 0.881 (95% CI, 0.838e0.918) in dataset C. Algorithm AUCs ranged between 0.661 and 0.973 for glaucomatous ONH features. The algorithm showed significantly higher sensitivity than 7 of 10 graders not involved in determining the reference standard, including 2 of 3 GSs, and showed higher specificity than 3 graders (including 1 GS), while remaining comparable to others. For both GSs and the algorithm, the most crucial features related to referable GON were: presence of vertical cup-to-disc ratio of 0.7 or more, neuroretinal rim notching, retinal nerve fiber layer defect, and bared circumlinear vessels.Conclusions: A DL algorithm trained on fundus images alone can detect referable GON with higher sensitivity than and comparable specificity to eye care providers. The algorithm maintained good performance on an independent dataset with diagnoses based on a full glaucoma workup.
Children with epilepsy in low-income countries often go undiagnosed and untreated. We examine a portable, low-cost smartphone-based EEG technology in a heterogeneous pediatric epilepsy cohort in the West African Republic of Guinea. Methods: : Children with epilepsy were recruited at the Ignace Deen Hospital in Conakry, 2017. Participants underwent sequential EEG recordings with an app-based EEG, the Smartphone Brain Scanner-2 (SBS2) and a standard Xltek EEG. Raw EEG data were transmitted via Bluetooth ™ connection to an Android ™ tablet and uploaded for remote EEG specialist review and reporting via a new, secure web-based reading platform, crowdEEG. The results were compared to same-visit Xltek 10-20 EEG recordings for identification of epileptiform and nonepileptiform abnormalities. Results: : 97 children meeting the International League Against Epilepsy's definition of epilepsy (49 male; mean age 10.3 years, 29 untreated with an antiepileptic drug; 0 with a prior EEG) were enrolled. Epileptiform discharges were detected on 21 (25.3%) SBS2 and 31 (37.3%) standard EEG recordings. The SBS2 had a sensitivity of 51.6% (95%CI 32.4%, 70.8%) and a specificity of 90.4% (95%CI 81.4%, 94.4%) for all types of epileptiform discharges, with positive and negative predictive values of 76.2% and 75.8% respectively. For generalized discharges, the SBS2 had a sensitivity of 43.5% with a specificity of 96.2%. Conclusions: : The SBS2 has a moderate sensitivity and high specificity for the detection of epileptiform abnormalities in children with epilepsy in this low-income setting. Use of the SBS2+crowdEEG platform permits specialist input for patients with previously poor access to clinical neurophysiology expertise.
Expert disagreement is pervasive in clinical decision making and collective adjudication is a useful approach for resolving divergent assessments. Prior work shows that expert disagreement can arise due to diverse factors including expert background, the quality and presentation of data, and guideline clarity. In this work, we study how these factors predict initial discrepancies in the context of medical time series analysis, examining why certain disagreements persist after adjudication, and how adjudication impacts clinical decisions. Results from a case study with 36 experts and 4,543 adjudicated cases in a sleep stage classification task show that these factors contribute to both initial disagreement and resolvability, each in their own unique way. We provide evidence suggesting that structured adjudication can lead to significant revisions in treatment-relevant clinical parameters. Our work demonstrates how structured adjudication can support consensus and facilitate a deep understanding of expert disagreement in medical data analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.