2021
DOI: 10.1016/j.compind.2020.103347
|View full text |Cite
|
Sign up to set email alerts
|

Portuguese word embeddings for the oil and gas industry: Development and evaluation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0
3

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

2
3

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 39 publications
0
8
0
3
Order By: Relevance
“…Table 5. 16 shows that while multimodality did not manage to improve the best F-score achieved by the best Text-Only model, Standardized NILCFT300, it did raise another model, Standardized NILCW2V100, to tie with this score. Otherwise, whatever increases in F-score as a result of the multimodal fusion that can be observed are minimal at best for this task.…”
Section: Selective Trackmentioning
confidence: 96%
See 1 more Smart Citation
“…Table 5. 16 shows that while multimodality did not manage to improve the best F-score achieved by the best Text-Only model, Standardized NILCFT300, it did raise another model, Standardized NILCW2V100, to tie with this score. Otherwise, whatever increases in F-score as a result of the multimodal fusion that can be observed are minimal at best for this task.…”
Section: Selective Trackmentioning
confidence: 96%
“…BERTimbau [44], a Portuguese language BERT model, was recently developed and added to the Hugging Face 4 library. These models, and others, have been used to advance the state-of-the-art in several Portuguese language NLP tasks [40,26,16].…”
Section: Introductionmentioning
confidence: 99%
“…Two new geosciences domain embeddings were developed during the course of this study as part of a collaboration with experts from Petrobras' CENPES research nucleus through the Geologia Digital project: PetroVec and PetroVec-Hybrid 3 . These models were thoroughly tested using both intrinsic and extrinsic tasks, and the results were compiled into an article published in the Computers in Industry journal 4 [16]. These are the current state-of-the-art models for the Portuguese language in the Geosciences domain.…”
Section: Textual Embeddingsmentioning
confidence: 99%
“…The test corpus for the geosciences domain, henceforth called GeoSim, was developed as part of the Geologia Digital project, and was used to test the PetroVec word embeddings [16]. It was developed in collaboration with several industry experts, Geology students and a PhD in Geology.…”
Section: Geosciences Domainmentioning
confidence: 99%
See 1 more Smart Citation