2019
DOI: 10.1109/access.2019.2960263
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Data Augmentation and the Application in Text Classification

Abstract: The applications of data augmentation in natural language processing have been limited. In this paper, we propose a novel method named Hierarchical Data Augmentation (HDA) which applied for text classification. Firstly, inspired by the hierarchical structure of texts, as words form a sentence and sentences form a document, HDA implements a hierarchical data augmentation strategy by augmenting texts at word-level and sentence level respectively. Secondly, inspired by the cropping, a popular method of data augme… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(16 citation statements)
references
References 15 publications
0
14
0
Order By: Relevance
“…Other studies have used data noising as smoothing [12] and predictive language models for SR [13]. Hierarchical Data Augmentation (HDA) augments data by extracting the most relevant parts from sentences and connecting them in order using a hierarchical attention network model [14]. Most sentence data augmentation research focuses on one language.…”
Section: Related Workmentioning
confidence: 99%
“…Other studies have used data noising as smoothing [12] and predictive language models for SR [13]. Hierarchical Data Augmentation (HDA) augments data by extracting the most relevant parts from sentences and connecting them in order using a hierarchical attention network model [14]. Most sentence data augmentation research focuses on one language.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, a new proposed method in [21] showed the DA technique-enabled construction of an accurate model with small whole-body CT images. In domains other than images datasets, DA has also been proven effective in, e.g., text classification [22] [23] [24], speech recognition [25] [26], and EEG classification [27]. In this study, we propose a novel method for DA that can be applied to various forms of data for many applications including image and text classification.…”
Section: Related Workmentioning
confidence: 99%
“…The first scenario is the original data without augmentation. The second is partially augmented data without balance via adding more samples only for small categories (22,37). The third scenario is augmented and balanced data.…”
Section: Experimental Settingsmentioning
confidence: 99%
See 1 more Smart Citation
“…Data augmentation can create several new feature spaces and increase the amount of training data without additional ground truth labels, which has been widely used to improve the performance and generalizability of downstream predictive models. Many works have proposed data augmentation technologies on different types of features, such as images [7,28,15], texts [12,29], vectorized features [18,6], etc. However, how to effectively augment graph data remain a challenging problem, as graph data is more complex and has non-Euclidean structures.…”
Section: Introductionmentioning
confidence: 99%