2020
DOI: 10.1007/978-3-030-61377-8_36
|View full text |Cite
|
Sign up to set email alerts
|

Impact of Text Specificity and Size on Word Embeddings Performance: An Empirical Evaluation in Brazilian Legal Domain

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
2
0
1

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 19 publications
0
2
0
1
Order By: Relevance
“…For example, "In these processes, the consumer claims for compensation for material or moral damages against an airline company due to failures in its services." [Dal Pont et al 2020]. There is an occurrence of the search string (moral).…”
Section: Methodology and Research Methodsmentioning
confidence: 99%
“…For example, "In these processes, the consumer claims for compensation for material or moral damages against an airline company due to failures in its services." [Dal Pont et al 2020]. There is an occurrence of the search string (moral).…”
Section: Methodology and Research Methodsmentioning
confidence: 99%
“…For the text representation, we use Bag of Words using term frequency (TF) values. We also tested word embeddings trained with legal documents written in Portuguese ( Dal Pont et al, 2020 ), MultiLingual Bidirectional Encoder Representations from Transformer (BERT) ( Devlin et al, 2018 ), and TF combined with Inverse Document Frequency (IDF), although TF achieved the best results for the experiments presented in this work. As an example, Multilingual BERT achieved a MAE of 6,192, a RMSE of 7,091, and a of .…”
Section: Proposed Pipeline and Experimentsmentioning
confidence: 99%
“…Também observando vetores embedings resultantes, [Dal Pont et al 2020] avaliaram o impacto da especificidade e do tamanho do corpus de texto utilizado no treinamento dos vetores. Aplicados a dados jurídicos brasileiros em vários níveis de segmentac ¸ão, os resultados mostraram que corpus menores capturam melhor as especificidades textos.…”
Section: Embeddings Orientado Ao Segmento Jurídicounclassified