Proceedings of the 2019 Conference of the North 2019
DOI: 10.18653/v1/n19-1325
|View full text |Cite
|
Sign up to set email alerts
|

Exploiting Noisy Data in Distant Supervision Relation Classification

Abstract: Distant supervision has obtained great progress on relation classification task. However, it still suffers from noisy labeling problem. Different from previous works that underutilize noisy data which inherently characterize the property of classification, in this paper, we propose RCEND, a novel framework to enhance Relation Classification by Exploiting Noisy Data. First, an instance discriminator with reinforcement learning is designed to split the noisy data into correctly labeled data and incorrectly label… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
10
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 20 publications
0
10
0
Order By: Relevance
“…However, such instance bags may not exist in a general supervised setting. Reinforcement learning (Qin et al, 2018;Yang et al, 2019;Wang et al, 2020) and curricular learning (Jiang et al, 2018;Huang and Du, 2019) methods use a clean validation set to obtain an auxiliary model for noise filtering, while constructing a perfectly labeled validation set is expensive.…”
Section: Related Workmentioning
confidence: 99%
“…However, such instance bags may not exist in a general supervised setting. Reinforcement learning (Qin et al, 2018;Yang et al, 2019;Wang et al, 2020) and curricular learning (Jiang et al, 2018;Huang and Du, 2019) methods use a clean validation set to obtain an auxiliary model for noise filtering, while constructing a perfectly labeled validation set is expensive.…”
Section: Related Workmentioning
confidence: 99%
“…e main disadvantage of these datasets is their noisy labels. ere are many approaches proposed to deal with the problem of noisy labels, such as multiple-instance learning [16][17][18], reinforcement learning [19][20][21], the use of knowledge base side information [22,23], and attention mechanism [24,25]. In the Persian language, FarsBase [5] especially uses a distantly supervised method to extract triples for the knowledge base.…”
Section: 1mentioning
confidence: 99%
“…The main disadvantage of these datasets is their noisy labels. The are many approaches proposed to deal with the problem of noisy labels such as multiple-instance learning [15,16,17], reinforcement learning [18,19,20], the use of knowledge base side information [21,22], and attention mechanism [23,24]. In the Persian language, FarsBase [5] especially uses a distant-supervised method to extract triples for the knowledge base.…”
Section: Datasetsmentioning
confidence: 99%