Robust Data Programming with Precision-guided Labeling Functions

Chatterjee, Oishik; Ramakrishnan, Ganesh; Sarawagi, Sunita

doi:10.1609/aaai.v34i04.5742

Cited by 12 publications

(19 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Data Programming and Unsupervised Learning: Snorkel (Ratner et al, 2016) has been proposed as a generative model to determine correct label probability using consensus on the noisy and conflicting labels assigned by the discrete LFs. Chatterjee et al (2020) proposed a graphical model, CAGE, that uses continuous-valued LFs with scores obtained using soft match techniques such as cosine similarity of word vectors, TF-IDF score, distance among entity pairs, etc. Owing to its generative model, Snorkel is highly sensitive to initialisation and hyper-parameters.…”

Section: Related Workmentioning

confidence: 99%

“…may also produce conflicting labels. In the past, generative models such as Snorkel (Ratner et al, 2016) and CAGE (Chatterjee et al, 2020) have been proposed for consensus on the noisy and conflicting labels assigned by the discrete LFs to determine the probability of the correct labels. Labels thus obtained could be used for training any supervised model/classifier and evaluated on a test set.…”

Section: Motivating Examplementioning

confidence: 99%

“…It acts as a form of semi-supervision by trying to increase the confidence of the predictions made by the model on the unlabelled dataset. Third Component (L3): The third component L CE P f φ (y|x i ), g(l i ) is the cross-entropy of the classification model using the hypothesised labels from CAGE (Chatterjee et al, 2020) on U. Given that l i is the output vector of all labelling functions for any x i ∈ U, we specify the predicted label for x i using the LF-based graphical model P θ (l i , y) from eqn.…”

Section: First Component (L1)mentioning

confidence: 99%

“…Sixth Component (L6): The sixth component KL(P f φ (y|x i ), P θ (y|l i )) is the Kullback-Leibler (KL) divergence between the predictions of both the models, viz., feature-based model f φ and the LF-based graphical model P θ summed over every example x i ∈ U ∪ L. Through this term, we try and make the models agree in their predictions over the union of the labelled and unlabelled datasets. Quality Guides (QG): As the last component in our objective, we use quality guides R(θ|{q j }) on LFs, which have been shown in (Chatterjee et al, 2020) to stabilise the unsupervised likelihood training while using labelling functions. Let q j be the fraction of cases where λ j correctly triggered, and let q t j be the user's belief on the fraction of examples x i where y i and l ij agree.…”

Section: First Component (L1)mentioning

confidence: 99%

See 3 more Smart Citations

Semi-Supervised Data Programming with Subset Selection

Maheshwari¹,

Chatterjee²,

Killamsetty³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

Self Cite

View full text Add to dashboard Cite

The paradigm of data programming, which uses weak supervision in the form of rules/labelling functions, and semi-supervised learning, which augments small amounts of labelled data with a large unlabelled dataset, have shown great promise in several text classification scenarios. In this work, we argue that by not using any labelled data, data programming based approaches can yield sub-optimal performances, particularly when the labelling functions are noisy. The first contribution of this work is an introduction of a framework, SPEAR which is a semi-supervised data programming paradigm that learns a joint model that effectively uses the rules/labelling functions along with semi-supervised loss functions on the feature space. Next, we also study SPEAR-SS which additionally does subset selection on top of the joint semi-supervised data programming objective and selects a set of examples that can be used as the labelled set by SPEAR. The goal of SPEAR-SS is to ensure that the labelled data can complement the labelling functions, thereby benefiting from both data-programming as well as appropriately selected data for human labelling. We demonstrate that by effectively combining semi-supervision, data-programming, and subset selection paradigms, we significantly outperform the current state-of-the-art on seven publicly available datasets. 1

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Motivating Examplementioning

confidence: 99%

Section: First Component (L1)mentioning

confidence: 99%

Section: First Component (L1)mentioning

confidence: 99%

See 2 more Smart Citations

Semi-Supervised Data Programming with Subset Selection

Maheshwari¹,

Chatterjee²,

Killamsetty³

et al. 2021

Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…Firstly, avoiding over-dependence on such hand-crafted features, since the above approaches limit the scope for in-the-wild HOI detections. Such over-dependence has been averted in both textual [2] and image [18] domains and we take inspiration from such works. More often than not, the 3D poses or 3D centroids of objects (used as features) are either not available or are too erroneously estimated to be simply plugged into a model trained on hand-crafted features.…”

Section: Related Workmentioning

confidence: 99%

Lighten

Sunkesula

Dabral

Ramakrishnan

2020

Proceedings of the 28th ACM International Conference on Multimedia

Self Cite

View full text Add to dashboard Cite

Analyzing the interactions between humans and objects from a video includes identification of the relationships between humans and the objects present in the video. It can be thought of as a specialized version of Visual Relationship Detection, wherein one of the objects must be a human. While traditional methods formulate the problem as inference on a sequence of video segments, we present a hierarchical approach, LIGHTEN, to learn visual features to effectively capture spatio-temporal cues at multiple granularities in a video. Unlike current approaches, LIGHTEN avoids using ground truth data like depth maps or 3D human pose, thus increasing generalization across non-RGBD datasets as well. Furthermore, we achieve the same using only the visual features, instead of the

show abstract