2022
DOI: 10.1145/3490234
|View full text |Cite
|
Sign up to set email alerts
|

The Secondary Use of Electronic Health Records for Data Mining: Data Characteristics and Challenges

Abstract: The primary objective of implementing Electronic Health Records (EHRs) is to improve the management of patients’ health-related information. However, these records have also been extensively used for the secondary purpose of clinical research and to improve healthcare practice. EHRs provide a rich set of information that includes demographics, medical history, medications, laboratory test results, and diagnosis. Data mining and analytics techniques have extensively exploited EHR information to study patient co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
27
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 52 publications
(27 citation statements)
references
References 224 publications
0
27
0
Order By: Relevance
“…The model can be also regarded as a data augmentation tool when little information is available, for example, for rare diseases. In addition to this, since healthcare datasets are frequently incomplete and the removal of missing values may result in a dataset that is too small or induce statistical bias [1], the model has the ability to impute such missing values in the trajectories of patients or reconstruct incomplete sequences of actions. In terms of interpretability, our model provides easier comprehension and explanation for end-users than other approaches developed in the healthcare setting [22,23,24,25,26,27].…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…The model can be also regarded as a data augmentation tool when little information is available, for example, for rare diseases. In addition to this, since healthcare datasets are frequently incomplete and the removal of missing values may result in a dataset that is too small or induce statistical bias [1], the model has the ability to impute such missing values in the trajectories of patients or reconstruct incomplete sequences of actions. In terms of interpretability, our model provides easier comprehension and explanation for end-users than other approaches developed in the healthcare setting [22,23,24,25,26,27].…”
Section: Discussionmentioning
confidence: 99%
“…These repositories contain systematized collections of patient data, including demographics, procedures, diagnosis, medications, costs, medical service providers and so on. The order of occurrence of these medical events in EHRs provides valuable information on the treatment trajectory of a patient which could improve the understanding of the disease [1]. Therefore, we represent each patient using chronologically ordered sequences of medical actions.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…With the generation of huge volume of medical and health data, it has significantly stimulated the rapid development and application of data mining techniques in the field of healthcare [4]. Studies on data-driven diagnostic or prognostic models for diseases are on the rise, evolving from classical statistical methods to machine learning models [5].…”
Section: Introductionmentioning
confidence: 99%
“…Suicide and homelessness share many risk factors, but few have studied the temporal relationship between homelessness and SI/SB, assessed patterns of suicide‐related service utilization around the onset of homelessness, or distinguished between risk factors of SI versus SB. Data from health information exchanges (HIEs) are a potentially valuable, complementary resource for filling these knowledge gaps (Sarkar, 2022; Sarwar et al, 2022).…”
Section: Introductionmentioning
confidence: 99%