2017
DOI: 10.1093/jamia/ocw177
|View full text |Cite
|
Sign up to set email alerts
|

MetaMap Lite: an evaluation of a new Java implementation of MetaMap

Abstract: MetaMap is a widely used named entity recognition tool that identifies concepts from the Unified Medical Language System Metathesaurus in text. This study presents MetaMap Lite, an implementation of some of the basic MetaMap functions in Java. On several collections of biomedical literature and clinical text, MetaMap Lite demonstrated real-time speed and precision, recall, and F1 scores comparable to or exceeding those of MetaMap and other popular biomedical text processing tools, clinical Text Analysis and Kn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
92
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 140 publications
(97 citation statements)
references
References 15 publications
0
92
0
Order By: Relevance
“…This process resulted in a weighted-average precision of 0.62, recall of 0.82, and F 1 score of 0.71 for the CSU data, as compared to a previously reported weighted-average precision of 0.67, recall of 0.53, and F 1 score of 0.58 for human clinical narratives [54]. The results of this evaluation can be seen in Table 3.…”
Section: Evaluation Of Metamap On Veterinary Recordsmentioning
confidence: 79%
See 1 more Smart Citation
“…This process resulted in a weighted-average precision of 0.62, recall of 0.82, and F 1 score of 0.71 for the CSU data, as compared to a previously reported weighted-average precision of 0.67, recall of 0.53, and F 1 score of 0.58 for human clinical narratives [54]. The results of this evaluation can be seen in Table 3.…”
Section: Evaluation Of Metamap On Veterinary Recordsmentioning
confidence: 79%
“…Evaluation metric . For all models we trained (LSTM, DT, and RF), we used the same evaluation metrics previously reported for MetaMap Lite [54]: a) precision, defined as the proportion of documents which were assigned the correct category; b) recall, defined as the proportion of documents from a given category that were correctly identified; and c) F 1 score, defined as the harmonic average of precision and recall. Formulas for these metrics are provided below: recision P =…”
Section: Discussionmentioning
confidence: 99%
“…To provide additional information about the questions that could be used for diverse IR and NLP tasks, we automatically annotated the questions with the focus, its UMLS Concept Unique Identifier (CUI) and Semantic Type. We combined two methods to recognize named entities from the titles of the crawled articles and their associated UMLS CUIs: (i) exact string matching to the UMLS Metathesaurus 8 , and (ii) MetaMap Lite 9 [16]. We then used the UMLS Semantic Network to retrieve the associated semantic types and groups.…”
Section: Methodsmentioning
confidence: 99%
“…[12] A lightweight Java implementation (Metamap Lite) was used in our pipeline due to processing speed and ease of use. In a recent study, MetaMap Lite demonstrated real-time speed and extraction performance comparable to or exceeding those of MetaMap and other popular biomedical text processing tools, [13] clinical Text Analysis and Knowledge Extraction System (cTAKES), [14] and DNorm. [15] Metamap-Lite extracted medical problems, tests, and treatments from 2010 i2b2 concepts dataset with precision 47.0, recall 31.9, and F1 38.0.…”
Section: Umls-driven Natural Language Processing (Nlp) Derived Influementioning
confidence: 99%
“…[15] Metamap-Lite extracted medical problems, tests, and treatments from 2010 i2b2 concepts dataset with precision 47.0, recall 31.9, and F1 38.0. [13] After identifying the UMLS concepts, the NLP pipeline assigned each extracted UMLS Metathesaurus concept an assertion value (present, absent, conditional, hypothetical, possible, not-patient) with an in-house statistical assertion classifier. While building the in-house assertion classifer, the Stanford NLP library [16] was used for tokenization, POS tagging, and dependency parsing to capture a wide range of syntactic and semantic features presented in clinical text.…”
Section: Umls-driven Natural Language Processing (Nlp) Derived Influementioning
confidence: 99%