2019
DOI: 10.1038/s41597-019-0055-0
|View full text |Cite
|
Sign up to set email alerts
|

BioWordVec, improving biomedical word embeddings with subword information and MeSH

Abstract: Distributed word representations have become an essential foundation for biomedical natural language processing (BioNLP), text mining and information retrieval. Word embeddings are traditionally computed at the word level from a large corpus of unlabeled text, ignoring the information present in the internal structure of words or any information available in domain specific structured resources such as ontologies. However, such information holds potentials for greatly improving the quality of the word represen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
242
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
4
3
2

Relationship

2
7

Authors

Journals

citations
Cited by 369 publications
(263 citation statements)
references
References 29 publications
2
242
0
Order By: Relevance
“…One established trend is a form of word embeddings that represent the semantic, using high dimensional vectors (Chiu et al, 2016;Wang et al, 2018c;Zhang et al, 2019). Similar methods also have been derived to improve embeddings of word sequences by introducing sentence embeddings .…”
Section: Related Workmentioning
confidence: 99%
“…One established trend is a form of word embeddings that represent the semantic, using high dimensional vectors (Chiu et al, 2016;Wang et al, 2018c;Zhang et al, 2019). Similar methods also have been derived to improve embeddings of word sequences by introducing sentence embeddings .…”
Section: Related Workmentioning
confidence: 99%
“…For this setting, we found approximately 1,000 unique publications, screened them for relevance, and, finally, included roughly 100 into this survey. [16] BioWordVec [17] BioSentVec [18] Flair('pubmed-X')…”
Section: Design and Goals Of This Surveymentioning
confidence: 99%
“…In the results, we compare two variants of DeepRank: BioDeepRank refers to the model with the modified aggregation network and weighting mechanism, and using word embeddings for the biomedical domain [15]; Attn-BioDeepRank refers to the final model that additionally replaces the recurrent layer by a self-attention layer. 2 Neural Ranking Models.…”
Section: Methodsmentioning
confidence: 99%