ObjectiveCohort selection is challenging for large-scale electronic health record (EHR) analyses, as International Classification of Diseases 9th edition (ICD-9) diagnostic codes are notoriously unreliable disease predictors. Our objective was to develop, evaluate, and validate an automated algorithm for determining an Autism Spectrum Disorder (ASD) patient cohort from EHR. We demonstrate its utility via the largest investigation to date of the co-occurrence patterns of medical comorbidities in ASD.MethodsWe extracted ICD-9 codes and concepts derived from the clinical notes. A gold standard patient set was labeled by clinicians at Boston Children’s Hospital (BCH) (N = 150) and Cincinnati Children’s Hospital and Medical Center (CCHMC) (N = 152). Two algorithms were created: (1) rule-based implementing the ASD criteria from Diagnostic and Statistical Manual of Mental Diseases 4th edition, (2) predictive classifier. The positive predictive values (PPV) achieved by these algorithms were compared to an ICD-9 code baseline. We clustered the patients based on grouped ICD-9 codes and evaluated subgroups.ResultsThe rule-based algorithm produced the best PPV: (a) BCH: 0.885 vs. 0.273 (baseline); (b) CCHMC: 0.840 vs. 0.645 (baseline); (c) combined: 0.864 vs. 0.460 (baseline). A validation at Children’s Hospital of Philadelphia yielded 0.848 (PPV). Clustering analyses of comorbidities on the three-site large cohort (N = 20,658 ASD patients) identified psychiatric, developmental, and seizure disorder clusters.ConclusionsIn a large cross-institutional cohort, co-occurrence patterns of comorbidities in ASDs provide further hypothetical evidence for distinct courses in ASD. The proposed automated algorithms for cohort selection open avenues for other large-scale EHR studies and individualized treatment of ASD.
Hereditary hemochromatosis (HH) is a common autosomal-recessive disorder associated with pathogenic HFE variants, most commonly those resulting in p.Cys282Tyr and p.His63Asp. Recommendations on returning incidental findings of HFE variants in individuals undergoing genome-scale sequencing should be informed by penetrance estimates of HH in unselected samples. We used the eMERGE Network, a multicenter cohort with genotype data linked to electronic medical records, to estimate the diagnostic rate and clinical penetrance of HH in 98 individuals homozygous for the variant coding for HFE p.Cys282Tyr and 397 compound heterozygotes with variants resulting in p.[His63Asp];[Cys282Tyr]. The diagnostic rate of HH in males was 24.4% for p.Cys282Tyr homozygotes and 3.5% for compound heterozygotes (p < 0.001); in females, it was 14.0% for p.Cys282Tyr homozygotes and 2.3% for compound heterozygotes (p < 0.001). Only males showed differences across genotypes in transferrin saturation levels (100% of homozygotes versus 37.5% of compound heterozygotes with transferrin saturation > 50%; p = 0.003), serum ferritin levels (77.8% versus 33.3% with serum ferritin > 300 ng/ml; p = 0.006), and diabetes (44.7% versus 28.0%; p = 0.03). No differences were found in the prevalence of heart disease, arthritis, or liver disease, except for the rate of liver biopsy (10.9% versus 1.8% [p = 0.013] in males; 9.1% versus 2% [p = 0.035] in females). Given the higher rate of HH diagnosis than in prior studies, the high penetrance of iron overload, and the frequency of at-risk genotypes, in addition to other suggested actionable adult-onset genetic conditions, opportunistic screening should be considered for p.[Cys282Tyr];[Cys282Tyr] individuals with existing genomic data.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.