2020
DOI: 10.24251/hicss.2020.028
|View full text |Cite
|
Sign up to set email alerts
|

WeSAL: Applying Active Supervision to Find High-quality Labels at Industrial Scale

Abstract: Obtaining hand-labeled training data is one of the most tedious and expensive parts of the machine learning pipeline. Previous approaches, such as active learning aim at optimizing user engagement to acquire accurate labels. Other methods utilize weak supervision to generate low-quality labels at scale. In this paper, we propose a new hybrid method named WeSAL that incorporates Weak Supervision sources with Active Learning to keep humans in the loop. The method aims to generate large-scale training labels whil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 18 publications
0
3
0
Order By: Relevance
“…Therefore, motivated by the shortcomings of these approaches, in this article, we present ArtAI an improved prediction scheme to predict the long-time popularity of news articles without the need for ground-truth observations. The scheme extends our previous work [11], which is a labeling framework that combines Weak Supervision with Active Learning to create large-scale, high-quality training data. However, ArtAI applies a novel selection policy to engage the end-users in the process.…”
Section: Introductionmentioning
confidence: 78%
“…Therefore, motivated by the shortcomings of these approaches, in this article, we present ArtAI an improved prediction scheme to predict the long-time popularity of news articles without the need for ground-truth observations. The scheme extends our previous work [11], which is a labeling framework that combines Weak Supervision with Active Learning to create large-scale, high-quality training data. However, ArtAI applies a novel selection policy to engage the end-users in the process.…”
Section: Introductionmentioning
confidence: 78%
“…Previous work has proposed a variety of methods for giving users (who are in our case the product moderators) control over classifiers by making use of a pipeline that allows them to provide feedback about training data labels and classification results. In WeSAL (Nashaat et al, 2018(Nashaat et al, , 2020 user feedback improves the labels that sets of rules assign to data points. In contrast, our focus is on feedback that allows moderators to improve the rules directly.…”
Section: Related Workmentioning
confidence: 99%
“…In the investigations presented by Biegel et al (2021) and Nashaat et al (2020), DP is integrated with AL, resulting in enhanced performance across various domains, including visual relationship analysis, spam detection, renewal sales prediction, bank operations, credit card fraud detection, occupancy sensing, and MNIST classification [30,31]. However, the application of AL in combination with DP for TS data has yet to be explored in the literature.…”
Section: Introductionmentioning
confidence: 99%