2023
DOI: 10.1016/j.joi.2023.101453
|View full text |Cite
|
Sign up to set email alerts
|

A Zipf's law-based text generation approach for addressing imbalance in entity extraction

Zhenhua Wang,
Ming Ren,
Dong Gao
et al.
Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2025
2025

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(1 citation statement)
references
References 34 publications
0
1
0
Order By: Relevance
“…For the convenience of training, we process the train set to separate each content and its theme using "SEP," terminated by "EOS." We leverage the Bert Tokenizer to generate token embeddings from the processed train set (Wang et al, 2023), simultaneously forming position embeddings based on each token's location. Moreover, to distinguish between the Content and Theme, SEP is embedded, serving as segment embeddings.…”
Section: Mobile Data Economizing Expression Generatormentioning
confidence: 99%
“…For the convenience of training, we process the train set to separate each content and its theme using "SEP," terminated by "EOS." We leverage the Bert Tokenizer to generate token embeddings from the processed train set (Wang et al, 2023), simultaneously forming position embeddings based on each token's location. Moreover, to distinguish between the Content and Theme, SEP is embedded, serving as segment embeddings.…”
Section: Mobile Data Economizing Expression Generatormentioning
confidence: 99%