Proceedings of the Workshop Computational Semantics Beyond Events and Roles 2017
DOI: 10.18653/v1/w17-1807
|View full text |Cite
|
Sign up to set email alerts
|

Annotation of negation in the IULA Spanish Clinical Record Corpus

Abstract: This paper presents the IULA Spanish Clinical Record Corpus, a corpus of 3,194 sentences extracted from anonymized clinical records and manually annotated with negation markers and their scope. The corpus was conceived as a resource to support clinical text-mining systems, but it is also a useful resource for other Natural Language Processing systems handling clinical texts: automatic encoding of clinical records, diagnosis support, term extraction, among others, as well as for the study of clinical texts. The… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
18
0
1

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 19 publications
(19 citation statements)
references
References 14 publications
0
18
0
1
Order By: Relevance
“…The IULA Spanish Clinical Record (Marimon et al 2017) corpus contains 300 anonymized clinical records from several services of one of the main hospitals in Barcelona (Spain) that was annotated with negation markers and their scopes. It contains 3,194 sentences, out of which 1,093 (34.22%) were annotated with negation cues.…”
Section: Iula Spanish Clinical Recordmentioning
confidence: 99%
“…The IULA Spanish Clinical Record (Marimon et al 2017) corpus contains 300 anonymized clinical records from several services of one of the main hospitals in Barcelona (Spain) that was annotated with negation markers and their scopes. It contains 3,194 sentences, out of which 1,093 (34.22%) were annotated with negation cues.…”
Section: Iula Spanish Clinical Recordmentioning
confidence: 99%
“…PharmaCoNER: We used SPACCC POS-TAGGER (Soares and gonzalez agirre, 2019) for sentence splitting, word tokenization and POS tagging. We trained FastText embeddings on the following corpora: IBECS (Rodríguez, 2002), IULA-Spanish-English-Corpus (Marimon et al, 2017), MedlinePlus (Miller et al, 2000), PubMed (Lu, 2011), ScIELO (Goldenberg et al, 2007) and PharmaCoNer . We trained embeddings on two variants of corpora: (1) Include train and development set of PharmaCoNER (2) Include complete dataset of PharmaCoNER.…”
Section: Dataset and Experimental Setupmentioning
confidence: 99%
“…Oronoz et al (2015) presented an annotated dataset in Spanish for adverse drug reactions analysis. Although the dataset is in Cruz et al (2017) and Marimon et al (2017) annotated negations in Spanish clinical reports.…”
Section: Previous Workmentioning
confidence: 99%