A Systematic Study of Neural Discourse Models for Implicit Discourse Relation

Rutherford, Attapol T.; Demberg, Vera; Xue, Nianwen

doi:10.18653/v1/e17-1027

Cited by 42 publications

(37 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The PDTB is annotated with a hierarchy of relations, with 5 classes at level 1 (including the EntRel relation), and 16 at level 2 (with one relation absent from the test). It is interesting to see that this form of simple semi-supervised learning for implicit relation prediction performs quite well, especially for fine-grained relations, as the best model slightly beats the best current dedicated model, listed at 40.9% in Rutherford et al (2017).…”

Section: Resultsmentioning

confidence: 99%

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Sileo¹,

Cruys²,

Pradel³

et al. 2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

Current state of the art systems in NLP heavily rely on manually annotated datasets, which are expensive to construct. Very little work adequately exploits unannotated data -such as discourse markers between sentences -mainly because of data sparseness and ineffective extraction methods. In the present work, we propose a method to automatically discover sentence pairs with relevant discourse markers, and apply it to massive amounts of data. Our resulting dataset contains 174 discourse markers with at least 10K examples each, even for rare markers such as coincidentally or amazingly. We use the resulting data as supervision for learning transferable sentence embeddings. In addition, we show that even though sentence representation learning through prediction of discourse markers yields state of the art results across different transfer tasks, it is not clear that our models made use of the semantic relation between sentences, thus leaving room for further improvements. Our datasets are publicly available 1

show abstract

Section: Resultsmentioning

confidence: 99%

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Sileo¹,

Cruys²,

Pradel³

et al. 2019

Proceedings of the 2019 Conference of the North

View full text Add to dashboard Cite

show abstract

“…The PDTB framework allows annotations to be labelled with more than one label. In such cases we only keep the first label, in line with previous studies (among others Ji and Eisenstein, 2015;Rutherford et al, 2017).…”

Section: Methodsmentioning

confidence: 99%

“…The main purpose of this study is to assess the performance of transfer learning on the implicit discourse relation classification task. To this end, we use a simple feedforward network fed with multilingual sentence embeddings following the finding of (Rutherford et al, 2017) which shows that simple discourse models with feedforward layers perform on par or better than those of with surface features or recurrent and convolutional architectures. We follow the model of due to its simplicity and robust nature even in the multilingual setting with different argument and discourse relation representations.…”

Section: Modelmentioning

confidence: 99%

Zero-shot transfer for implicit discourse relation classification

Kurfalı

Östling

2019

Proceedings of the 20th Annual SIGdial Meeting on Discourse and Dialogue

View full text Add to dashboard Cite

Automatically classifying the relation between sentences in a discourse is a challenging task, in particular when there is no overt expression of the relation. It becomes even more challenging by the fact that annotated training data exists only for a small number of languages, such as English and Chinese. We present a new system using zero-shot transfer learning for implicit discourse relation classification, where the only resource used for the target language is unannotated parallel text. This system is evaluated on the discourse-annotated TED-MDB parallel corpus, where it obtains good results for all seven languages using only English training data.

show abstract

“…The architecture of the model we use is illustrated in Figure 1. Regarding the initialization, regularization and learning algorithm, we follow all the settings in (Rutherford et al, 2017). We adopt cross-entropy as our cost function, adagrad as the optimization algorithm, initialized all the weights in the model with uniform random and set dropout layers after the embedding and output layer with a drop rate of 0.2 and 0.5 respectively.…”

Section: Modelmentioning

confidence: 99%

Do We Need Cross Validation for Discourse Relation Classification?

Wei¹,

Demberg

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 2

Self Cite

View full text Add to dashboard Cite

The task of implicit discourse relation classification has received increased attention in recent years, including two CoNNL shared tasks on the topic. Existing machine learning models for the task train on sections 2-21 of the PDTB and test on section 23, which includes a total of 761 implicit discourse relations. In this paper, we'd like to make a methodological point, arguing that the standard test set is too small to draw conclusions about whether the inclusion of certain features constitute a genuine improvement, or whether one got lucky with some properties of the test set, and argue for the adoption of cross validation for the discourse relation classification task by the community.

show abstract

A Systematic Study of Neural Discourse Models for Implicit Discourse Relation

Cited by 42 publications

References 34 publications

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Mining Discourse Markers for Unsupervised Sentence Representation Learning

Zero-shot transfer for implicit discourse relation classification

Do We Need Cross Validation for Discourse Relation Classification?

Contact Info

Product

Resources

About