Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics 2019
DOI: 10.18653/v1/p19-1061
|View full text |Cite
|
Sign up to set email alerts
|

Data Programming for Learning Discourse Structure

Abstract: This paper investigates the advantages and limits of data programming for the task of learning discourse structure. The data programming paradigm implemented in the Snorkel framework allows a user to label training data using expert-composed heuristics, which are then transformed via the "generative step" into probability distributions of the class labels given the training candidates. These results are later generalized using a discriminative model. Snorkel's attractive promise to create a large amount of ann… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 20 publications
0
9
0
Order By: Relevance
“…GWASkb with thousands of genotype-phenotype associations was created by using Snorkel in (Kuleshov et al, 2019). Snorkel was also used for chemical reaction relationship extraction (Mallory et al, 2020), discourse structure learning (Badene et al, 2019) and medical entity classification (Fries et al, 2020).…”
Section: Weakly Supervised Learningmentioning
confidence: 99%
“…GWASkb with thousands of genotype-phenotype associations was created by using Snorkel in (Kuleshov et al, 2019). Snorkel was also used for chemical reaction relationship extraction (Mallory et al, 2020), discourse structure learning (Badene et al, 2019) and medical entity classification (Fries et al, 2020).…”
Section: Weakly Supervised Learningmentioning
confidence: 99%
“…Such work includes examples from the medical domain (Callahan et al, 2019;Dutta and Saha, 2019;Dutta et al, 2020;Saab et al, 2019Saab et al, , 2020, multi-task learning (Ratner et al, 2018(Ratner et al, , 2019a, information extraction (Ehrenberg et al, 2016), and learning discourse structure (Badene et al, 2019). Like our work, such work often adjusts the Snorkel framework (Ratner et al, 2017) for the task at hand.…”
Section: Related Workmentioning
confidence: 99%
“…Weakly-supervised text classification (WTC) aims to use various weakly supervised signals to do text classification. Weak supervision signals used by existing methods includes external knowledge base (Gabrilovich et al, 2007;Chang et al, 2008;Song and Roth, 2014;Yin et al, 2019), keywords (Agichtein and Gravano, 2000;Riloff et al, 2003;Kuipers et al, 2006;Tao et al, 2015;Meng et al, 2018Meng et al, , 2019Meng et al, , 2020Mekala and Shang, 2020;Wang et al, 2021;Shen et al, 2021) and heuristics rules (Ratner et al, 2016(Ratner et al, , 2017Badene et al, 2019;Shu et al, 2020). In this paper, we focus on keyword-driven methods.…”
Section: Weakly-supervised Text Classificationmentioning
confidence: 99%