2023
DOI: 10.48550/arxiv.2302.13007
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

AugGPT: Leveraging ChatGPT for Text Data Augmentation

Abstract: Text data augmentation is an effective strategy for overcoming the challenge of limited sample sizes in many natural language processing (NLP) tasks. This challenge is especially prominent in the few-shot learning scenario, where the data in the target domain is generally much scarcer and of lowered quality. A natural and widely-used strategy to mitigate such challenges is to perform data augmentation on the training data to better capture the data invariance and increase the sample size. However, current text… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
47
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(48 citation statements)
references
References 78 publications
1
47
0
Order By: Relevance
“…• CBERT [15]: First, we utilize BERT's segment embeddings to condition the BERT model on the class labels during finetuning. 2 We then finetuned the model with the masked language model (MLM) objective which randomly masks some words in the sequences and aims to predict the original word using the context. Finally, we use the resulting model to predict and replace masked words in the training set.…”
Section: Baseline Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…• CBERT [15]: First, we utilize BERT's segment embeddings to condition the BERT model on the class labels during finetuning. 2 We then finetuned the model with the masked language model (MLM) objective which randomly masks some words in the sequences and aims to predict the original word using the context. Finally, we use the resulting model to predict and replace masked words in the training set.…”
Section: Baseline Methodsmentioning
confidence: 99%
“…• ChatGPTfew-shot [2]: We used few-shot prompting of ChatGPT for data augmentation to produce several paraphrases of each sentence in the training set.…”
Section: Baseline Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…ChatGPT can assist organizations in automating their customer care assistance procedures, lowering the demand for human agents and enhancing response times. ChatGPT can aid users with personal assistance chores like making appointments or looking for information online [35]. Finally, ChatGPT can be used to create content, such as text for social media postings or marketing initiatives.…”
Section: F Wide Range Of Applicationsmentioning
confidence: 99%
“…Recently, the emergence of ChatGPT has significantly advanced NLP tasks by enhancing the capabilities of conversational models, making it a valuable tool for businesses and organizations. Chataug et al [12] leverages ChatGPT to rephrase sentences for text data augmentation. Jiao et al [23] finds the translation ability of Chat-GPT performs competitively with commercial translation products on high-resource and low-resource languages.…”
Section: Introductionmentioning
confidence: 99%