Anais Do Simpósio Brasileiro De Computação Aplicada À Saúde (SBCAS 2019) 2019
DOI: 10.5753/sbcas.2019.6269
|View full text |Cite
|
Sign up to set email alerts
|

Named Entity Recognition for Clinical Portuguese Corpus with Conditional Random Fields and Semantic Groups

Abstract: Considering the difficulties of extracting entities from Electronic Health Records (EHR) texts in Portuguese, we explore the Conditional Random Fields (CRF) algorithm to build a Named Entity Recognition (NER) system based on a corpus of clinical Portuguese data annotated by experts. We acquaint the challenges and methods to classify Abbreviations, Disorders, Procedures and Chemicals within the texts. By selecting a meaningful set of features, and parameters with the best performance the results demonstrate tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
6
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 7 publications
0
6
0
1
Order By: Relevance
“…By providing a contextualized word representation and taking advantage of the transformer architecture, BERT-based language models have become a new paradigm for NLP tasks . The use of BERT-base models in our work had a positive impact on the results when compared to previous works with traditional machine learning algorithms and word embeddings for NER in Portuguese clinical text (de Souza et al, 2019;Lopes et al, 2019). For examples, de Souza et al (2019) evaluated three groups of entities from the Sem-ClinBr corpus using CRF, without any word embedding.…”
Section: Effect Of the Contextualized Language Modelmentioning
confidence: 91%
See 2 more Smart Citations
“…By providing a contextualized word representation and taking advantage of the transformer architecture, BERT-based language models have become a new paradigm for NLP tasks . The use of BERT-base models in our work had a positive impact on the results when compared to previous works with traditional machine learning algorithms and word embeddings for NER in Portuguese clinical text (de Souza et al, 2019;Lopes et al, 2019). For examples, de Souza et al (2019) evaluated three groups of entities from the Sem-ClinBr corpus using CRF, without any word embedding.…”
Section: Effect Of the Contextualized Language Modelmentioning
confidence: 91%
“…The use of BERT-base models in our work had a positive impact on the results when compared to previous works with traditional machine learning algorithms and word embeddings for NER in Portuguese clinical text (de Souza et al, 2019;Lopes et al, 2019). For examples, de Souza et al (2019) evaluated three groups of entities from the Sem-ClinBr corpus using CRF, without any word embedding. AS shown in Table 3, they obtained for Disorder 0.65 of F1-score, compared to our 0.79; for Procedure, they achieved 0.60 compared to our 0.70 and for Drug, they achieved 0.42 compared to our 0.91.…”
Section: Effect Of the Contextualized Language Modelmentioning
confidence: 91%
See 1 more Smart Citation
“…This type of algorithm (i.e., NER) can support many methods, such as medical concept extraction, biomedical summarization algorithms, and clinical decision support systems. Souza et al [ 57 ] described their preliminary work with promising results on exploring conditional random field (CRF) algorithms to perform NER in clinical pt-br texts. They used different fragments of our corpus and different annotation granularities (STYs and SGRs) to train and evaluate their model.…”
Section: Resultsmentioning
confidence: 99%
“…In fact, few studies are focusing on clinical text processing in general, e.g. (Oliveira et al, 2019;de Souza et al, 2019).…”
Section: Introductionmentioning
confidence: 99%