2020
DOI: 10.2196/17622
|View full text |Cite
|
Sign up to set email alerts
|

Re-examination of Rule-Based Methods in Deidentification of Electronic Health Records: Algorithm Development and Validation

Abstract: Background Deidentification of clinical records is a critical step before their publication. This is usually treated as a type of sequence labeling task, and ensemble learning is one of the best performing solutions. Under the framework of multi-learner ensemble, the significance of a candidate rule-based learner remains an open issue. Objective The aim of this study is to investigate whether a rule-based learner is useful in a hybrid deidentification s… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…Deidentification of clinical records, as an application, is a critical step in the use of electronic health records for academic research. Zhao et al [ 16 ] investigated the usefulness of rule-based learners in a hybrid deidentification system. A data-driven rule learner named transformation-based error-driven learning was integrated into a hybrid system.…”
Section: Resultsmentioning
confidence: 99%
“…Deidentification of clinical records, as an application, is a critical step in the use of electronic health records for academic research. Zhao et al [ 16 ] investigated the usefulness of rule-based learners in a hybrid deidentification system. A data-driven rule learner named transformation-based error-driven learning was integrated into a hybrid system.…”
Section: Resultsmentioning
confidence: 99%
“…Sweeney [ 10 ] proposed another rule-based approach for identifying and replacing SHI.MIT-De-id (PhysioNet) software, which is also a rule-based deidentification system developed using nursing progress reports [ 11 ]. Zhao et al [ 12 ] developed a rule-based model and integrated it into an ensemble framework and achieved the best performance of the model when compared with a non–rule-based model. Similarly, Dehghan et al [ 13 ] developed cDeID software to deidentify 7 HIPAA categories.…”
Section: Introductionmentioning
confidence: 99%
“…In the past few decades, rule-based [ 6 , 7 ] and machine learning–based [ 3 , 8 , 9 ] approaches have been the mainstream approaches to identifying entities in sentences or documents. Rule-based methods utilize special semantic dictionaries to establish a set of regular expressions [ 4 , 5 ] to extract PHI from EHRs.…”
Section: Introductionmentioning
confidence: 99%