Proceedings of the 2021 International Conference on Management of Data 2021
DOI: 10.1145/3448016.3457334
|View full text |Cite
|
Sign up to set email alerts
|

Adaptive Rule Discovery for Labeling Text Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(11 citation statements)
references
References 11 publications
0
11
0
Order By: Relevance
“…• Uncertainty Sampling (US) [15]: US is a classic and competitive method within active learning paradigm [29]. • IWS-LSE [6]: IWS-LSE is a representative method under the interactive weak supervision paradigm considered in [6,12], where the user is iteratively queried to provide feedback on whether a suggested labeling heuristic is useful or not.…”
Section: Nemo End-to-end System Performancementioning
confidence: 99%
See 1 more Smart Citation
“…• Uncertainty Sampling (US) [15]: US is a classic and competitive method within active learning paradigm [29]. • IWS-LSE [6]: IWS-LSE is a representative method under the interactive weak supervision paradigm considered in [6,12], where the user is iteratively queried to provide feedback on whether a suggested labeling heuristic is useful or not.…”
Section: Nemo End-to-end System Performancementioning
confidence: 99%
“…• Under-formalized LF Development Workflow: The lack of formalism on the LF development process has obscured systematic study to optimize the workflow, making it less organized and more challenging for practitioners to design LFs for DP applications [6,12,33,35]. • Inefficient Development Data Selection: Current LF development workflow selects development data with the most straightforward approach, uniform random sampling, which unfortunately can be time-consuming as it oftentimes requires users to inspect a considerable amount of data samples to create an informative set of LFs.…”
Section: Introductionmentioning
confidence: 99%
“…Snuba [91] generates heuristics based on a small set of labeled datasets. [7] and [25] interactively generate labeling functions based on user feedback. TALLOR [46] and GLaRA [106] automatically augment an initial set of labeling functions with new ones.…”
Section: Related Workmentioning
confidence: 99%
“…(2) Active generation and repurposing of supervision sources. To further reduce human annotation efforts, very recently, researchers turn to active generation [91,46,106,7,25] and repurposing [27] of supervision sources. In the future, we plan to incorporate these new tasks and methods into WRENCH to extend its scope.…”
Section: A3 Hosting and Maintenance Planmentioning
confidence: 99%
“…Anchor learning [27] can also suggest new selfsupervision for human review. Darwin [42] incorporates active learning for verifying proposed rules, but it doesn't conduct structure learning, and like Snuba and other data programming methods, it only models individual instances.…”
Section: Task-specific Self-supervisionmentioning
confidence: 99%