2020
DOI: 10.1038/s41598-020-75544-1
|View full text |Cite
|
Sign up to set email alerts
|

De-identification of electronic health record using neural network

Abstract: According to a recent study, around 99% of hospitals across the US now use electronic health record systems (EHRs). One of the most common types of EHR is the unstructured textual data, and unlocking hidden details from this data is critical for improving current medical practices and research endeavors. However, these textual data contain sensitive information, which could compromise our privacy. Therefore, medical textual data cannot be released publicly without undergoing any privacy-protective measures. De… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 33 publications
(20 citation statements)
references
References 16 publications
1
16
0
Order By: Relevance
“…However, because of the different evaluation methods, this improvement cannot be taken at face value. Studies used different evaluation methods, from token-based binary [1,6,7,8] classification to entity classification based on HIPAA PHI [5,9].…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…However, because of the different evaluation methods, this improvement cannot be taken at face value. Studies used different evaluation methods, from token-based binary [1,6,7,8] classification to entity classification based on HIPAA PHI [5,9].…”
Section: Resultsmentioning
confidence: 99%
“…In terms of methods, all the studies were deep learning-based, and invariably used bidirectional long-short term memory (Bi-LSTM), Conditional Random Fields (CRF) and Gated recurrent units (GRU). With these base methods, studies developed multiple innovative ensemble methods through voting mechanisms [3], stacking [3,7] and novel attention mechanisms based on transformer models [7,8], and use of rule-based methods and dictionaries [1,2,9].…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Unstructured electronic health records (EHRs) such as discharge summaries, encounter notes, pathology reports and radiology reports are valuable sources of information for undertaking basic, clinical and translational research 1 3 . Often, researchers are required to share or access EHRs in a de-identified state to protect the privacy of patients.…”
Section: Introductionmentioning
confidence: 99%
“…Over the years, several studies have been conducted on de-identification of EHRs using manual, rule based and machine learning methods. Automated de-identification methods can be employed to replace manual processes 1 , 7 – 11 . These methods, especially deep learning-based methods, often require a large corpus for accurate de-identification.…”
Section: Introductionmentioning
confidence: 99%