Purpose The South London and Maudsley National Health Service (NHS) Foundation Trust Biomedical Research Centre (SLaM BRC) Case Register and its Clinical Record Interactive Search (CRIS) application were developed in 2008, generating a research repository of real-time, anonymised, structured and open-text data derived from the electronic health record system used by SLaM, a large mental healthcare provider in southeast London. In this paper, we update this register's descriptive data, and describe the substantial expansion and extension of the data resource since its original development. Participants Descriptive data were generated from the SLaM BRC Case Register on 31 December 2014. Currently, there are over 250 000 patient records accessed through CRIS. Findings to date Since 2008, the most significant developments in the SLaM BRC Case Register have been the introduction of natural language processing to extract structured data from open-text fields, linkages to external sources of data, and the addition of a parallel relational database (Structured Query Language) output. Natural language processing applications to date have brought in new and hitherto inaccessible data on cognitive function, education, social care receipt, smoking, diagnostic statements and pharmacotherapy. In addition, through external data linkages, large volumes of supplementary information have been accessed on mortality, hospital attendances and cancer registrations. Future plans Coupled with robust data security and governance structures, electronic health records provide potentially transformative information on mental disorders and outcomes in routine clinical care. The SLaM BRC Case Register continues to grow as a database, with approximately 20 000 new cases added each year, in addition to extension of follow-up for existing cases. Data linkages and natural language processing present important opportunities to enhance this type of research resource further, achieving both volume and depth of data. However, research projects still need to be carefully tailored, so that they take into account the nature and quality of the source information.
ObjectiveDespite improving healthcare, the gap in mortality between people with serious mental illness (SMI) and general population persists, especially for younger age groups. The electronic database from a large and comprehensive secondary mental healthcare provider in London was utilized to assess the impact of SMI diagnoses on life expectancy at birth.MethodPeople who were diagnosed with SMI (schizophrenia, schizoaffective disorder, bipolar disorder), substance use disorder, and depressive episode/disorder before the end of 2009 and under active review by the South London and Maudsley NHS Foundation Trust (SLAM) in southeast London during 2007–09 comprised the sample, retrieved by the SLAM Case Register Interactive Search (CRIS) system. We estimated life expectancy at birth for people with SMI and each diagnosis, from national mortality returns between 2007–09, using a life table method.ResultsA total of 31,719 eligible people, aged 15 years or older, with SMI were analyzed. Among them, 1,370 died during 2007–09. Compared to national figures, all disorders were associated with substantially lower life expectancy: 8.0 to 14.6 life years lost for men and 9.8 to 17.5 life years lost for women. Highest reductions were found for men with schizophrenia (14.6 years lost) and women with schizoaffective disorders (17.5 years lost).ConclusionThe impact of serious mental illness on life expectancy is marked and generally higher than similarly calculated impacts of well-recognised adverse exposures such as smoking, diabetes and obesity. Strategies to identify and prevent causes of premature death are urgently required.
BackgroundElectronic health records (EHRs) provide enormous potential for health research but also present data governance challenges. Ensuring de-identification is a pre-requisite for use of EHR data without prior consent. The South London and Maudsley NHS Trust (SLaM), one of the largest secondary mental healthcare providers in Europe, has developed, from its EHRs, a de-identified psychiatric case register, the Clinical Record Interactive Search (CRIS), for secondary research.MethodsWe describe development, implementation and evaluation of a bespoke de-identification algorithm used to create the register. It is designed to create dictionaries using patient identifiers (PIs) entered into dedicated source fields and then identify, match and mask them (with ZZZZZ) when they appear in medical texts. We deemed this approach would be effective, given high coverage of PI in the dedicated fields and the effectiveness of the masking combined with elements of a security model. We conducted two separate performance tests i) to test performance of the algorithm in masking individual true PIs entered in dedicated fields and then found in text (using 500 patient notes) and ii) to compare the performance of the CRIS pattern matching algorithm with a machine learning algorithm, called the MITRE Identification Scrubber Toolkit – MIST (using 70 patient notes – 50 notes to train, 20 notes to test on). We also report any incidences of potential breaches, defined by occurrences of 3 or more true or apparent PIs in the same patient’s notes (and in an additional set of longitudinal notes for 50 patients); and we consider the possibility of inferring information despite de-identification.ResultsTrue PIs were masked with 98.8% precision and 97.6% recall. As anticipated, potential PIs did appear, owing to misspellings entered within the EHRs. We found one potential breach. In a separate performance test, with a different set of notes, CRIS yielded 100% precision and 88.5% recall, while MIST yielded a 95.1% and 78.1%, respectively. We discuss how we overcome the realistic possibility – albeit of low probability – of potential breaches through implementation of the security model.ConclusionCRIS is a de-identified psychiatric database sourced from EHRs, which protects patient anonymity and maximises data available for research. CRIS demonstrates the advantage of combining an effective de-identification algorithm with a carefully designed security model. The paper advances much needed discussion of EHR de-identification – particularly in relation to criteria to assess de-identification, and considering the contexts of de-identified research databases when assessing the risk of breaches of confidential patient information.
BackgroundHigher mortality has been found for people with serious mental illness (SMI, including schizophrenia, schizoaffective disorders, and bipolar affective disorder) at all age groups. Our aim was to characterize vulnerable groups for excess mortality among people with SMI, substance use disorders, depressive episode, and recurrent depressive disorder.MethodsA case register was developed at the South London and Maudsley National Health Services Foundation Trust (NHS SLAM), accessing full electronic clinical records on over 150,000 mental health service users as a well-defined cohort since 2006. The Case Register Interactive Search (CRIS) system enabled searching and retrieval of anonymised information since 2008. Deaths were identified by regular national tracing returns after 2006. Standardized mortality ratios (SMRs) were calculated for the period 2007 to 2009 using SLAM records for this period and the expected number of deaths from age-specific mortality statistics for the England and Wales population in 2008. Data were stratified by gender, ethnicity, and specific mental disorders.ResultsA total of 31,719 cases, aged 15 years old or more, active between 2007-2009 and with mental disorders of interest prior to 2009 were detected in the SLAM case register. SMRs were 2.15 (95% CI: 1.95-2.36) for all SMI with genders combined, 1.89 (1.64-2.17) for women and 2.47 (2.17-2.80) for men. In addition, highest mortality risk was found for substance use disorders (SMR = 4.17; 95% CI: 3.75-4.64). Age- and gender-standardised mortality ratios by ethnic group revealed huge fluctuations, and SMRs for all disorders diminished in strength with age. The main limitation was the setting of secondary mental health care provider in SLAM.ConclusionsSubstantially higher mortality persists in people with serious mental illness, substance use disorders and depressive disorders. Furthermore, mortality risk differs substantially with age, diagnosis, gender and ethnicity. Further research into specific risk groups is required.
Research into suicide prevention has been hampered by methodological limitations such as low sample size and recall bias. Recently, Natural Language Processing (NLP) strategies have been used with Electronic Health Records to increase information extraction from free text notes as well as structured fields concerning suicidality and this allows access to much larger cohorts than previously possible. This paper presents two novel NLP approaches – a rule-based approach to classify the presence of suicide ideation and a hybrid machine learning and rule-based approach to identify suicide attempts in a psychiatric clinical database. Good performance of the two classifiers in the evaluation study suggest they can be used to accurately detect mentions of suicide ideation and attempt within free-text documents in this psychiatric database. The novelty of the two approaches lies in the malleability of each classifier if a need to refine performance, or meet alternate classification requirements arises. The algorithms can also be adapted to fit infrastructures of other clinical datasets given sufficient clinical recording practice knowledge, without dependency on medical codes or additional data extraction of known risk factors to predict suicidal behaviour.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.