Automatic de-identification of electronic medical records using token-level and character-level conditional random fields

Liu, Zengjian; Chen, Yangxin; Tang, Buzhou; Wang, Xiaolong; Chen, Qingcai; Li, Haodi; Wang, Jingfeng; Deng, Qi; Zhu, Suisong

doi:10.1016/j.jbi.2015.06.009

Cited by 68 publications

(70 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The representative works are three natural language processing (NLP) challenges, two organized by the Center of Informatics for Integrating Biology and Bedside (i2b2) in 2006 [2] and 2014 [3, 4, 5], and one organized by the Centers of Excellence in Genomic Science (CEGS) Neuropsychiatric Genome-scale and RDOC Individualized Domains (N-GRID) in 2016 [6]. The organizers of the three challenges provide manually annotated corpora for participants to develop various kinds of systems for de-identification [7, 8, 9, 10, 11, 12, 13, 14, 15]. …”

Section: Introductionmentioning

confidence: 99%

“…In our system, an ensemble classifier is deployed to combine the outputs of three individual machine learning-based subsystems, and a rule-based subsystem is used to identify some formulaic PHI instances. The three machine learning-based subsystems are a CRF-based system with a large number of hand-crafted features [12], a bidirectional LSTM-based system without any hand-crafted features [16, 17], and a variant of bidirectional LSTM-based system with a small quantity of hand-crafted features [18, 19]. Moreover, we also evaluate our system on the 2014 i2b2 challenge corpus and compare it with other state-of-the-art systems.…”

Section: Introductionmentioning

confidence: 99%

“…Lots of teams from all around the world participated in this three challenges. In the two i2b2 NLP challenges, the proposed de-identification systems may fall in three categories: rule-based [26], machine learning-based [10, 14, 27], and hybrid [11, 12, 13, 15]. The rule-based systems can exactly recognize formulaic PHI instances (i.e., phone numbers, emails, licenses, etc.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

De-identification of clinical notes via recurrent neural network and conditional random field

Liu

Tang

Wang

et al. 2017

Journal of Biomedical Informatics

Self Cite

124

114

View full text Add to dashboard Cite

De-identification, identifying information from data, such as protected health information (PHI) present in clinical data, is a critical step to enable data to be shared or published. The 2016 Centers of Excellence in Genomic Science (CEGS) Neuropsychiatric Genome-scale and RDOC Individualized Domains (N-GRID) clinical natural language processing (NLP) challenge contains a de-identification track in de-identifying electronic medical records (EMRs) (i.e., track 1). The challenge organizers provide 1000 annotated mental health records for this track, 600 out of which are used as a training set and 400 as a test set. We develop a hybrid system for the de-identification task on the training set. Firstly, four individual subsystems, that is, a subsystem based on bidirectional LSTM (long-short term memory, a variant of recurrent neural network), a subsystem-based on bidirectional LSTM with features, a subsystem based on conditional random field (CRF) and a rule-based subsystem, are used to identify PHI instances. Then, an ensemble learning-based classifiers is deployed to combine all PHI instances predicted by above three machine learning-based subsystems. Finally, the results of the ensemble learning-based classifier and the rule-based subsystem are merged together. Experiments conducted on the official test set show that our system achieves the highest micro F1-scores of 93.07%, 91.43% and 95.23% under the “token”, “strict” and “binary token” criteria respectively, ranking first in the 2016 CEGS N-GRID NLP challenge. In addition, on the dataset of 2014 i2b2 NLP challenge, our system achieves the highest micro F1-scores of 96.98%, 95.11% and 98.28% under the “token”, “strict” and “binary token” criteria respectively, outperforming other state-of-the-art systems. All these experiments prove the effectiveness of our proposed method.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

De-identification of clinical notes via recurrent neural network and conditional random field

Liu

Tang

Wang

et al. 2017

Journal of Biomedical Informatics

Self Cite

124

114

View full text Add to dashboard Cite

show abstract

“…These rules increase the system coverage by permitting the uses of more relaxed patterns and ambiguous terms. Researchers can combine the pattern and dictionary methods in the machine-learning model by using matching results as features (11, 19). This method is simple and likely optimal within the scope of the challenge.…”

Section: Discussionmentioning

confidence: 99%

“…Successful systems used the machine-learning algorithm Conditional Random Field (CRF) for labeling a sequence of tokens(11, 18, 19). The popular rule-based method was pattern-matching using a formal language such as regular expressions.…”

Section: Introductionmentioning

confidence: 99%

The UAB Informatics Institute and 2016 CEGS N-GRID de-identification shared task challenge

Bui

Wyatt

Cimino

2017

Journal of Biomedical Informatics

View full text Add to dashboard Cite

Clinical narratives (the text notes found in patients’ medical records) are important information sources for secondary use in research. However, in order to protect patient privacy, they must be de-identified prior to use. Manual de-identification is considered to be the gold standard approach but is tedious, expensive, slow, and impractical for use with large-scale clinical data. Automated or semi-automated de-identification using computer algorithms is a potentially promising alternative. The Informatics Institute of the University of Alabama at Birmingham is applying de-identification to clinical data drawn from the UAB hospital’s electronic medical records system before releasing them for research. We participated in a shared task challenge by the Centers of Excellence in Genomic Science (CEGS) Neuropsychiatric Genome-Scale and RDoC Individualized Domains (N-GRID) at the de-identification regular track to gain experience developing our own automatic de-identification tool. We focused on the popular and successful methods from previous challenges: rule-based, dictionary-matching, and machine-learning approaches. We also explored new techniques such as disambiguation rules, term ambiguity measurement, and used multi-pass sieve framework at a micro level. For the challenge’s primary measure (strict entity), our submissions achieved competitive results (f-measures: 87.3%, 87.1%, and 86.7%). For our preferred measure (binary token HIPAA), our submissions achieved superior results (f-measures: 93.7%, 93.6%, and 93%). With those encouraging results, we gain the confidence to improve and use the tool for the real de-identification task at the UAB Informatics Institute.

show abstract

Comparison of Chest Radiograph Captions Based on Natural Language Processing vs Completed by Radiologists

et al. 2023

View full text Add to dashboard Cite

ImportanceArtificial intelligence (AI) can interpret abnormal signs in chest radiography (CXR) and generate captions, but a prospective study is needed to examine its practical value.ObjectiveTo prospectively compare natural language processing (NLP)-generated CXR captions and the diagnostic findings of radiologists.Design, Setting, and ParticipantsA multicenter diagnostic study was conducted. The training data set included CXR images and reports retrospectively collected from February 1, 2014, to February 28, 2018. The retrospective test data set included consecutive images and reports from April 1 to July 31, 2019. The prospective test data set included consecutive images and reports from May 1 to September 30, 2021.ExposuresA bidirectional encoder representation from a transformers model was used to extract language entities and relationships from unstructured CXR reports to establish 23 labels of abnormal signs to train convolutional neural networks. The participants in the prospective test group were randomly assigned to 1 of 3 different caption generation models: a normal template, NLP-generated captions, and rule-based captions based on convolutional neural networks. For each case, a resident drafted the report based on the randomly assigned captions and an experienced radiologist finalized the report blinded to the original captions. A total of 21 residents and 19 radiologists were involved.Main Outcomes and MeasuresTime to write reports based on different caption generation models.ResultsThe training data set consisted of 74 082 cases (39 254 [53.0%] women; mean [SD] age, 50.0 [17.1] years). In the retrospective (n = 8126; 4345 [53.5%] women; mean [SD] age, 47.9 [15.9] years) and prospective (n = 5091; 2416 [47.5%] women; mean [SD] age, 45.1 [15.6] years) test data sets, the mean (SD) area under the curve of abnormal signs was 0.87 (0.11) in the retrospective data set and 0.84 (0.09) in the prospective data set. The residents’ mean (SD) reporting time using the NLP-generated model was 283 (37) seconds—significantly shorter than the normal template (347 [58] seconds; P &lt; .001) and the rule-based model (296 [46] seconds; P &lt; .001). The NLP-generated captions showed the highest similarity to the final reports with a mean (SD) bilingual evaluation understudy score of 0.69 (0.24)—significantly higher than the normal template (0.37 [0.09]; P &lt; .001) and the rule-based model (0.57 [0.19]; P &lt; .001).Conclusions and RelevanceIn this diagnostic study of NLP-generated CXR captions, prior information provided by NLP was associated with greater efficiency in the reporting process, while maintaining good consistency with the findings of radiologists.

show abstract

Automatic de-identification of electronic medical records using token-level and character-level conditional random fields

Cited by 68 publications

References 21 publications

De-identification of clinical notes via recurrent neural network and conditional random field

De-identification of clinical notes via recurrent neural network and conditional random field

The UAB Informatics Institute and 2016 CEGS N-GRID de-identification shared task challenge

Comparison of Chest Radiograph Captions Based on Natural Language Processing vs Completed by Radiologists

Contact Info

Product

Resources

About