2022
DOI: 10.3389/frai.2022.970517
|View full text |Cite
|
Sign up to set email alerts
|

Training and intrinsic evaluation of lightweight word embeddings for the clinical domain in Spanish

Abstract: Resources for Natural Language Processing (NLP) are less numerous for languages different from English. In the clinical domain, where these resources are vital for obtaining new knowledge about human health and diseases, creating new resources for the Spanish language is imperative. One of the most common approaches in NLP is word embeddings, which are dense vector representations of a word, considering the word's context. This vector representation is usually the first step in various NLP tasks, such as text … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 35 publications
0
1
0
Order By: Relevance
“…• Malayalam: generation of synoptic clinical reports [39]; • Polish: prediction of cardiovascular diseases in electronic health records [40]; • (Brazilian) Portuguese: description of an annotated clinical corpus [41], ICD-10 coding [42]; • Serbian: sentiment analysis in COVID-19 tweets [43]; • Spanish: ICD-coding [10, 44], negation and uncertainty detection in clinical narratives [45], training and evaluation of word embeddings for the clinical domain [46]; • Swedish: ICD-10 coding [44];…”
Section: Languages Addressedmentioning
confidence: 99%
“…• Malayalam: generation of synoptic clinical reports [39]; • Polish: prediction of cardiovascular diseases in electronic health records [40]; • (Brazilian) Portuguese: description of an annotated clinical corpus [41], ICD-10 coding [42]; • Serbian: sentiment analysis in COVID-19 tweets [43]; • Spanish: ICD-coding [10, 44], negation and uncertainty detection in clinical narratives [45], training and evaluation of word embeddings for the clinical domain [46]; • Swedish: ICD-10 coding [44];…”
Section: Languages Addressedmentioning
confidence: 99%