The Chinese Discourse TreeBank: a Chinese corpus annotated with discourse relations

Zhou, Yiming; Xue, Nianwen

doi:10.1007/s10579-014-9290-3

Cited by 74 publications

(52 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…We evaluate our model on the Chinese Discourse Treebank (CDTB) because its annotation is the most comparable to the PDTB (Zhou and Xue, 2015). The sense set consists of 10 different senses, which are not organized in a hierarchy, unlike the PDTB.…”

Section: Chinese Discourse Relationsmentioning

confidence: 99%

A Systematic Study of Neural Discourse Models for Implicit Discourse Relation

Rutherford

Demberg

Xue

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 1

Self Cite

View full text Add to dashboard Cite

Inferring implicit discourse relations in natural language text is the most difficult subtask in discourse parsing. Many neural network models have been proposed to tackle this problem. However, the comparison for this task is not unified, so we could hardly draw clear conclusions about the effectiveness of various architectures. Here, we propose neural network models that are based on feedforward and long-short term memory architecture and systematically study the effects of varying structures. To our surprise, the best-configured feedforward architecture outperforms LSTM-based model in most cases despite thorough tuning. Further, we compare our best feedforward system with competitive convolutional and recurrent networks and find that feedforward can actually be more effective. For the first time for this task, we compile and publish outputs from previous neural and nonneural systems to establish the standard for further comparison.

show abstract

Section: Chinese Discourse Relationsmentioning

confidence: 99%

A Systematic Study of Neural Discourse Models for Implicit Discourse Relation

Rutherford

Demberg

Xue

2017

Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 1

Self Cite

View full text Add to dashboard Cite

show abstract

“…We enumerate several characteristics in (Zhou and Xue, 2015) and phenomena from the training data set.…”

Section: Corpus and Resourcesmentioning

confidence: 99%

“…3 System Architecture Zhou and Xue (2015) pointed out that discourse connectives and punctuation marks in Chinese can serve as anchors, which are clues of discourse relations. This opinion encourages us to treat explicit and non-explicit relations similarly.…”

Section: Corpus and Resourcesmentioning

confidence: 99%

An End-to-End Chinese Discourse Parser with Adaptation to Explicit and Non-explicit Relation Recognition

Kang

Zhang

et al. 2016

Proceedings of the CoNLL-16 Shared Task

View full text Add to dashboard Cite

This paper describes our end-to-end discourse parser in the CoNLL-2016 Shared Task on Chinese Shallow Discourse Parsing. To adapt to the characteristics of Chinese, we implement a uniform framework for both explicit and non-explicit relation parsing. In this framework, we are the first to utilize a seed-expansion approach for the argument extraction subtask. In the official evaluation, our system achieves an F1 score of 26.90% in overall performance on the blind test set.

show abstract

“…PDTB's annotation scheme is adapted by the recently released Chinese Discourse Treebank (CDTB) (Zhou and Xue, 2015). Other efforts to exploit Chinese discourse relations include crosslingual annotation projection based on machine translation or word-aligned parallel corpus (Zhou et al, 2012;.…”

Section: Related Workmentioning

confidence: 99%

Sequential Annotation and Chunking of Chinese Discourse Structure

Yung¹,

Duh²,

Matsumoto

2015

Proceedings of the Eighth SIGHAN Workshop on Chinese Language Processing

View full text Add to dashboard Cite

We propose a linguistically driven approach to represent discourse relations in Chinese text as sequences. We observe that certain surface characteristics of Chinese texts, such as the order of clauses, are overt markers of discourse structures, yet existing annotation proposals adapted from formalism constructed for English do not fully incorporate these characteristics. We present an annotated resource consisting of 325 articles in the Chinese Treebank. In addition, using this annotation, we introduce a discourse chunker based on a cascade of classifiers and report 70% top-level discourse sense accuracy.

show abstract

The Chinese Discourse TreeBank: a Chinese corpus annotated with discourse relations

Cited by 74 publications

References 15 publications

A Systematic Study of Neural Discourse Models for Implicit Discourse Relation

A Systematic Study of Neural Discourse Models for Implicit Discourse Relation

An End-to-End Chinese Discourse Parser with Adaptation to Explicit and Non-explicit Relation Recognition

Sequential Annotation and Chunking of Chinese Discourse Structure

Contact Info

Product

Resources

About