Background: The identification of geographic variation in incidence can be an important step in the delineation of disease risk factors, but has mostly been undertaken in upper-income countries. Here, we use Electronic Health Records (EHR) from a middle-income country, Colombia, to characterize geographic variation in major mental disorders. Method: We leveraged geolocated EHRs of 16,295 patients at a psychiatric hospital serving the entire state of Caldas, all of whom received a primary diagnosis of bipolar disorder, schizophrenia, or major depressive disorder at their first visit. To identify the relationship between travel time and incidence of mental illness we used a zero-inflated negative binomial regression model. We used spatial scan statistics to identify clusters of patients, stratified by diagnosis and severity: mild (outpatients) or severe (inpatients). Results: We observed a significant association between incidence and travel time for outpatients (N = 11,077, relative risk (RR) = 0.80, 95% confidence interval (0.71, 0.89)), but not inpatients (N = 5,218). We found seven clusters of severe mental illness: the cluster with the most extreme overrepresentation of bipolar disorder (RR = 5.83, p < 0.001) has an average annual incidence of 8.7 inpatients per 10,000 residents, among the highest frequencies worldwide. Conclusions: The hospital database reflects the geographic distribution of severe, but not mild, mental illness within Caldas. Each hotspot is a candidate location for further research to identify genetic or environmental risk factors for severe mental illness. Our analyses highlight how existing infrastructure from middle-income countries can be extraordinary resources for population studies.
Investigations using Electronic Health Records (EHR) databases could enable accurate delineation of psychiatric disease trajectories at an unprecedented scale. Using EHR from a single institution (Clinica San Juan de Dios in Manizales, Colombia), we characterize diagnostic trajectories of >22,000 (ages 4-90, 60% female) individuals treated for severe mental illness (SMI), including schizophrenia (SCZ), bipolar disorder (BD), and severe or recurrent major depressive disorder (MDD). We extracted diagnostic codes, clinical notes, and healthcare use data collected since 2005. Using a subsample of 105 SMI patients, we assessed diagnostic reliability, comparing EHR to clinical chart review. EHR diagnostic codes showed very good agreement with chart review diagnoses (Cohens kappa 0.78). Using 3,600 annotated sentences from 2,788 patients, we developed a pipeline for extracting clinical features from the electronic text, which showed high agreement with gold-standard annotations (average F1 0.88). Factors associated with diagnostic instability, defined as changes in diagnosis between successive visits, were identified using mixed-effect logistic regression models. Of SMI patients with >3 visits (n=12,962), 64% had multiple EHR diagnoses; diagnostic switches (19%), comorbidities (30%), and both (15%). While some diagnostic switches are common, such as the switch from MDD to BD (observed in 22% of BD patients), trajectories are highly heterogeneous, with rare trajectories (occurring in <1% of patients) making up the majority (58% of all patients). Predictors of diagnostic instability include time since initial visit (OR 0.56 by visit number, p-value 2e-66), previous diagnostic change (OR= 4.02, p-value 3e-250) and NLP-derived descriptions of delusions (OR 1.50, p-values 2e-18). Our results underline the importance of considering longitudinal rather than cross-sectional diagnoses in psychiatric research and show how high-quality EHR data can contribute to global efforts to understand disease trajectories.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.