2021
DOI: 10.1155/2021/9975078
|View full text |Cite
|
Sign up to set email alerts
|

Improving Loanword Identification in Low‐Resource Language with Data Augmentation and Multiple Feature Fusion

Abstract: Loanword identification is studied in recent years to alleviate data sparseness in several natural language processing (NLP) tasks, such as machine translation, cross-lingual information retrieval, and so on. However, recent studies on this topic usually put efforts on high-resource languages (such as Chinese, English, and Russian); for low-resource languages, such as Uyghur and Mongolian, due to the limitation of resources and lack of annotated data, loanword identification on these languages tends to have lo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…that synthesizes new data from existing data. When training samples are sparse and labeling costs are high, it seeks to enhance the number of data points, reduce overfitting, and improve the model's overall generalization ability [18,19].…”
Section: Data Enhancement Data Augmentation Is a Techniquementioning
confidence: 99%
See 1 more Smart Citation
“…that synthesizes new data from existing data. When training samples are sparse and labeling costs are high, it seeks to enhance the number of data points, reduce overfitting, and improve the model's overall generalization ability [18,19].…”
Section: Data Enhancement Data Augmentation Is a Techniquementioning
confidence: 99%
“…In December 2019, a novel coronavirus (COVID- 19) was transmitted between species and spread fast worldwide in a short period. The fast spread of the disease and severe economic and social devastation are far beyond people's expectations.…”
Section: Introductionmentioning
confidence: 99%