A Deep Relevance Model for Zero-Shot Document Filtering

Li, Chenliang; Zhou, Wei; Ji, Feng; Duan, Yu; Chen, Haiqing

doi:10.18653/v1/p18-1214

Cited by 15 publications

(9 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As to future work, we plan to transfer our model to other tasks such as reading comprehension, filtering, and summarization [22,33,35]. Also, we would like to apply reinforcement learning to improve the performance of dialogue generation.…”

Section: Discussionmentioning

confidence: 99%

Explicit State Tracking with Semi-Supervisionfor Neural Dialogue Generation

Jin

Lei

Zhang

et al. 2018

Proceedings of the 27th ACM International Conference on Information and Knowledge Management

View full text Add to dashboard Cite

The task of dialogue generation aims to automatically provide responses given previous utterances. Tracking dialogue states is an important ingredient in dialogue generation for estimating users' intention. However, the expensive nature of state labeling and the weak interpretability make the dialogue state tracking a challenging problem for both task-oriented and non-task-oriented dialogue generation: For generating responses in task-oriented dialogues, state tracking is usually learned from manually annotated corpora, where the human annotation is expensive for training; for generating responses in non-task-oriented dialogues, most of existing work neglects the explicit state tracking due to the unlimited number of dialogue states.In this paper, we propose the semi-supervised explicit dialogue state tracker (SEDST) for neural dialogue generation. To this end, our approach has two core ingredients: CopyFlowNet and posterior regularization. Specifically, we propose an encoder-decoder architecture, named CopyFlowNet, to represent an explicit dialogue state with a probabilistic distribution over the vocabulary space. To optimize the training procedure, we apply a posterior regularization strategy to integrate indirect supervision. Extensive experiments conducted on both task-oriented and non-task-oriented dialogue corpora demonstrate the effectiveness of our proposed model. Moreover, we find that our proposed semi-supervised dialogue state tracker achieves a comparable performance as state-of-the-art supervised learning baselines in state tracking procedure.

show abstract

Section: Discussionmentioning

confidence: 99%

Explicit State Tracking with Semi-Supervisionfor Neural Dialogue Generation

Jin

Lei

Zhang

et al. 2018

Proceedings of the 27th ACM International Conference on Information and Knowledge Management

View full text Add to dashboard Cite

show abstract

“…To reduce this gap, Sachan et al (2018) proposed two methods, namely keyword anonymisation and adaptive word dropout to regularise the model and make it rely less on the keywords. Similarly, Li et al (2018b) performed adversarial training with Gradient Reversal Layer (Ganin et al 2016) to remove category-specific information and to make the model generalise to unseen categories. Mudinas et al (2018) proposed a novel unsupervised method to bootstrap domain-specific sentiment classifiers.…”

Section: Domain Adaptationmentioning

confidence: 99%

“…While keyword (or keyphrase) extraction from text has been extensively studied, how the selection of keywords impacts dataless classification was rarely if ever discussed. Previous work used either hand-picked keywords (Druck et al 2008;Settles 2011;Meng et al 2018) or relied on only the category name or category description (Chang et al 2008;Li et al 2016;Li et al 2018b). The problem of extracting keywords from a (noisily) labelled corpus is defined formally as follows.…”

Section: Mining Abstract From (Noisy) Labelled Corpusmentioning

confidence: 99%

Learning from noisy out-of-domain corpus using dataless classification

Jin

Wanvarie

Le³

2020

Nat. Lang. Eng.

View full text Add to dashboard Cite

In real-world applications, text classification models often suffer from a lack of accurately labelled documents. The available labelled documents may also be out of domain, making the trained model not able to perform well in the target domain. In this work, we mitigate the data problem of text classification using a two-stage approach. First, we mine representative keywords from a noisy out-of-domain data set using statistical methods. We then apply a dataless classification method to learn from the automatically selected keywords and unlabelled in-domain data. The proposed approach outperformed various supervised learning and dataless classification baselines by a large margin. We evaluated different keyword selection methods intrinsically and extrinsically by measuring their impact on the dataless classification accuracy. Last but not least, we conducted an in-depth analysis of the behaviour of the classifier and explained why the proposed dataless classification method outperformed supervised learning counterparts.

show abstract

“…To the best of our knowledge, only one deep learning-based approach was proposed to address a problem similar to dataless text classification [18]. In this recent work, Li et al devised a deep relevance model for zero-shot document filtering -which consists at test time in predicting the relevance of documents with respect to a category unseen in the training set, where each category is characterized by a set of seed words.…”

Section: Related Workmentioning

confidence: 99%

Seed-Guided Deep Document Clustering

Fard¹,

Thonet²,

Gaussier³

2020

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Different users may be interested in different clustering views underlying a given collection (e.g., topic and writing style in documents). Enabling them to provide constraints reflecting their needs can then help obtain tailored clustering results. For document clustering, constraints can be provided in the form of seed words, each cluster being characterized by a small set of words. This seed-guided constrained document clustering problem was recently addressed through topic modeling approaches. In this paper, we jointly learn deep representations and bias the clustering results through the seed words, leading to a Seed-guided Deep Document Clustering approach. Its effectiveness is demonstrated on five public datasets.

show abstract

A Deep Relevance Model for Zero-Shot Document Filtering

Cited by 15 publications

References 31 publications

Explicit State Tracking with Semi-Supervisionfor Neural Dialogue Generation

Explicit State Tracking with Semi-Supervisionfor Neural Dialogue Generation

Learning from noisy out-of-domain corpus using dataless classification

Seed-Guided Deep Document Clustering

Contact Info

Product

Resources

About