2019
DOI: 10.2196/preprints.17376
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Implementation of a Cohort Retrieval System for Clinical Data Repositories Using the Observational Medical Outcomes Partnership Common Data Model: Proof-of-Concept System Validation (Preprint)

Abstract: 150-250 words)Background: Widespread adoption of electronic health records (EHRs) has enabled secondary use of EHR data for clinical research and healthcare delivery. Natural language processing (NLP) techniques have shown promise in their capability to extract the embedded information in unstructured clinical data, and information retrieval (IR) techniques provide flexible and scalable solutions that can augment the NLP systems for retrieving and ranking relevant records. Methods:In this paper, we present the… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
4
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
1
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 33 publications
(37 reference statements)
1
4
0
Order By: Relevance
“…This not only would give us a more realistic picture of the performance of these topics, but also identify additional patients for relevance judgment for the word-based queries. After the additional judgments, we found that the structured queries had much higher recall than the word-based queries ( Figure 5) as well as much higher precision ( Figure 6), which was also been found in comparable experiments from Mayo Clinic [39]. As precision is sometimes conceptualized as "number needed to read" [40], the higher precision for the structured queries means fewer patients would need to be assessed to identify candidates for clinical studies.…”
Section: Discussionsupporting
confidence: 67%
See 1 more Smart Citation
“…This not only would give us a more realistic picture of the performance of these topics, but also identify additional patients for relevance judgment for the word-based queries. After the additional judgments, we found that the structured queries had much higher recall than the word-based queries ( Figure 5) as well as much higher precision ( Figure 6), which was also been found in comparable experiments from Mayo Clinic [39]. As precision is sometimes conceptualized as "number needed to read" [40], the higher precision for the structured queries means fewer patients would need to be assessed to identify candidates for clinical studies.…”
Section: Discussionsupporting
confidence: 67%
“…was not certified by peer review) (which The copyright holder for this preprint this version posted November 12, 2019. . https://doi.org/10.1101/19005280 doi: medRxiv preprint topics that had been selected randomly for this previous research [39], while the second five topics were selected for diversity in all five of our sources for topic definitions (OHSU, Mayo, PheKP, REP, NQF). The second five were also selected based on a higher likelihood to be seen in clinical practice (based on clinician judgement), as compared to other topics in the list of 56.…”
Section: Additional Relevance Assessment For Ten Selected Topicsmentioning
confidence: 99%
“…After the additional judgments, we found that the structured queries had much higher recall than the word-based queries ( Figure 5 ) as well as much higher precision ( Figure 6 ), which was also been found in comparable experiments from Mayo Clinic. 39 As precision is sometimes conceptualized as “number needed to read,” 40 the higher precision for the structured queries means fewer patients would need to be assessed to identify candidates for clinical studies.…”
Section: Discussionmentioning
confidence: 99%
“…These included topics 2, 7, 9, 17, 32, 33, 42, 44, 48, and 52. To build on previous work done in our group, we used 5 topics that had been selected randomly for this previous research, 39 while the second 5 topics were selected for diversity in all 5 of our sources for topic definitions (OHSU, Mayo, PheKB, REP, and NQF). The second 5 were also selected based on a higher likelihood to be seen in clinical practice (based on clinician judgment), as compared to other topics in the list of 56.…”
Section: Methodsmentioning
confidence: 99%
“…In recent years, Clinical Data Warehouses (CDW) have been developed in hospitals. Various tools relying on CDW have been created in order to mine information from structured and unstructured data such as EMERSE [1], CREATE [2] or Dr. Warehouse [3].…”
Section: Introductionmentioning
confidence: 99%