2019
DOI: 10.1093/jamiaopen/ooz007
|View full text |Cite
|
Sign up to set email alerts
|

Using word embeddings to expand terminology of dietary supplements on clinical notes

Abstract: Objective: The objective of this study is to demonstrate the feasibility of applying word embeddings to expand the terminology of dietary supplements (DS) using over 26 million clinical notes. Methods: Word embedding models (ie, word2vec and GloVe) trained on clinical notes were used to predefine a list of top 40 semantically related terms for each of 14 commonly used DS. Each list was further evaluated by experts to generate semantically similar terms. We investigated the effect of corpus size and other set… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
7
2
1

Relationship

3
7

Authors

Journals

citations
Cited by 48 publications
(15 citation statements)
references
References 21 publications
0
15
0
Order By: Relevance
“…This was achieved by using a predefined clinical lexicon for query expansion using a word2vec model trained by Pakhomov et al (2016). Similarly, Fan et al (2019) trained a word2vec model on clinical notes to expand dietary supplement vocabulary by finding corresponding misspellings and brand names, which helped retrieve more relevant documents.…”
Section: Word Embeddings and Query Expansionmentioning
confidence: 99%
“…This was achieved by using a predefined clinical lexicon for query expansion using a word2vec model trained by Pakhomov et al (2016). Similarly, Fan et al (2019) trained a word2vec model on clinical notes to expand dietary supplement vocabulary by finding corresponding misspellings and brand names, which helped retrieve more relevant documents.…”
Section: Word Embeddings and Query Expansionmentioning
confidence: 99%
“…These will often outperform general-domain embeddings on clinical text mining tasks (51). Specialized clinical text embeddings have been used to improve clinical named entity recognition (75), resolve abbreviations in clinical text (76), expand a structured lexicon of radiology terms (77) and build a lexicon of dietary supplements (78). Second, an embedding can incorporate structured information beyond what is found in the text (79), and embeddings have been created to represent CUIs (80), documents (81,82), or entire patient records (83).…”
Section: Word Phrase and Character Embeddingsmentioning
confidence: 99%
“…Word embeddings are good at enriching the terminology of existing concepts. They are used in [24] to extend the terminology of dietary supplements. The experimental results prove that the expanded terms are more relevant as search keywords in clinical notes than in external knowledge sources.…”
Section: Related Workmentioning
confidence: 99%