2022
DOI: 10.1055/a-1862-0421
|View full text |Cite
|
Sign up to set email alerts
|

A Systematic Approach to Configuring MetaMap for Optimal Performance

Abstract: Background: MetaMap is a valuable tool for processing biomedical texts to identify concepts. Although MetaMap is highly configurative, configuration decisions are not straightforward. Objective: To develop a systematic, data-driven methodology for configuring MetaMap for optimal performance. Methods: MetaMap, the word2vec model, and the phrase model were used to build a pipeline. For unsupervised training, the phrase and word2vec models used abstracts related to clinical decision support as input. During test… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 22 publications
0
1
0
Order By: Relevance
“…Now, only 3148 unlabeled articles remained, and we created synthetic KP and marked the labels to create a synthetic labeled dataset for the CDSS domain with a 1:2 train-validation split. Cohen’s kappa rates for the first 42 (GS42) abstracts were 0.93 (between annotators 1 and 2) and 0.73 (between annotators 1 and 3) [37]. For the second set of abstracts (GS91), Cohen’s kappa rates were 0.87 (between annotators 1 and 2) and 0.97 (between annotators 1 and 3).…”
Section: Experiments and Resultsmentioning
confidence: 99%
“…Now, only 3148 unlabeled articles remained, and we created synthetic KP and marked the labels to create a synthetic labeled dataset for the CDSS domain with a 1:2 train-validation split. Cohen’s kappa rates for the first 42 (GS42) abstracts were 0.93 (between annotators 1 and 2) and 0.73 (between annotators 1 and 3) [37]. For the second set of abstracts (GS91), Cohen’s kappa rates were 0.87 (between annotators 1 and 2) and 0.97 (between annotators 1 and 3).…”
Section: Experiments and Resultsmentioning
confidence: 99%