Transformer-GCRF: Recovering Chinese Dropped Pronouns with General Conditional Random Fields

Yang, Jingxuan; Xu, Kerui; Xu, Jun; Li, Si; Gao, Sheng; Guo, Jun; Wen, Ji-Rong; Xue, Nianwen

doi:10.18653/v1/2020.findings-emnlp.13

“…We first introduce the problem formulation of these two tasks. Following the practices in (Yang et al, 2015(Yang et al, , 2019(Yang et al, , 2020, we formulate DPR as a sequence labeling problem. DPR aims to recover the dropped pronouns in an utterance by assigning one of 17 labels to each token that indicates the type of pronoun that is dropped before the token (Yang et al, 2015).…”

Section: Problem Formulationmentioning

confidence: 99%

“…Same as existing efforts (Yang et al, 2015(Yang et al, , 2019), we use Precision(P), Recall(R) and F-score(F) as metrics when evaluating the performance of dropped pronoun models. Baselines We compared DiscProReco against ex-isting baselines, including: (1) MEPR (Yang et al, 2015), which leverages a Maximum Entropy classifier to predict the type of dropped pronoun before each token; (2) NRM , which employs two MLPs to predict the position and type of a dropped pronoun separately; (3) Bi-GRU, which utilizes a bidirectional GRU to encode each token in a pro-drop sentence and then makes prediction; (4) NDPR (Yang et al, 2019), which models the referents of dropped pronouns from a large context with a structured attention mechanism; (5) Transformer-GCRF (Yang et al, 2020), which jointly recovers the dropped pronouns in a conversational snippet with general conditional random fields; (6) XLM-RoBERTa-NDPR, which utilizes the pre-trained multilingual masked language model (Conneau et al, 2020) to encode the pro-drop utterance and its context, and then employs the attention mechanism in NDPR to model the referent semantics.…”

Section: Dropped Pronoun Recoverymentioning

confidence: 99%

“…and Yang et al (2019) attempt to recover the dropped pronouns by modeling the referents with deep neural networks. More recently, Yang et al (2020) attempt to jointly predict all dropped pronouns in a conversation snippet by modeling dependencies between pronouns with general conditional random fields. A major shortcoming of these DPR methods is that they overlook the discourse relation (e.g., reply, question) between conversa-tional utterances when exploiting the context of the dropped pronoun.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech

Yang¹,

Xu²,

Xu³

et al. 2021

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer

Self Cite

5

0

View full text Add to dashboard Cite

In this paper, we present a neural model for joint dropped pronoun recovery (DPR) and conversational discourse parsing (CDP) in Chinese conversational speech. We show that DPR and CDP are closely related, and a joint model benefits both tasks. We refer to our model as DiscProReco, and it first encodes the tokens in each utterance in a conversation with a directed Graph Convolutional Network (GCN). The token states for an utterance are then aggregated to produce a single state for each utterance. The utterance states are then fed into a biaffine classifier to construct a conversational discourse graph. A second (multi-relational) GCN is then applied to the utterance states to produce a discourse relation-augmented representation for the utterances, which are then fused together with token states in each utterance as input to a dropped pronoun recovery layer. The joint model is trained and evaluated on a new Structure Parsing-enhanced Dropped Pronoun Recovery (SPDPR) dataset that we annotated with both two types of information. Experimental results on the SPDPR dataset and other benchmarks show that DiscProReco significantly outperforms the state-of-the-art baselines of both tasks.

show abstract

“…We first introduce the problem formulation of these two tasks. Following the practices in (Yang et al, 2015(Yang et al, , 2019(Yang et al, , 2020, we formulate DPR as a sequence labeling problem. DPR aims to recover the dropped pronouns in an utterance by assigning one of 17 labels to each token that indicates the type of pronoun that is dropped before the token (Yang et al, 2015).…”

Section: Problem Formulationmentioning

confidence: 99%

“…and Yang et al (2019) attempt to recover the dropped pronouns by modeling the referents with deep neural networks. More recently, Yang et al (2020) attempt to jointly predict all dropped pronouns in a conversation snippet by modeling dependencies between pronouns with general conditional random fields. A major shortcoming of these DPR methods is that they overlook the discourse relation (e.g., reply, question) between conversa-tional utterances when exploiting the context of the dropped pronoun.…”

Section: Introductionmentioning

confidence: 99%

A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech

Yang

¹

,

Xu

²

,

Xu

³

et al. 2021

Preprint

Self Cite

0

View full text Add to dashboard Cite

In this paper, we present a neural model for joint dropped pronoun recovery (DPR) and conversational discourse parsing (CDP) in Chinese conversational speech. We show that DPR and CDP are closely related, and a joint model benefits both tasks. We refer to our model as DiscProReco, and it first encodes the tokens in each utterance in a conversation with a directed Graph Convolutional Network (GCN). The token states for an utterance are then aggregated to produce a single state for each utterance. The utterance states are then fed into a biaffine classifier to construct a conversational discourse graph. A second (multi-relational) GCN is then applied to the utterance states to produce a discourse relation-augmented representation for the utterances, which are then fused together with token states in each utterance as input to a dropped pronoun recovery layer. The joint model is trained and evaluated on a new Structure Parsing-enhanced Dropped Pronoun Recovery (SPDPR) dataset that we annotated with both two types of information. Experimental results on the SPDPR dataset and other benchmarks show that DiscProReco significantly outperforms the state-of-the-art baselines of both tasks.

show abstract

Machine reading comprehension combined with semantic dependency for Chinese zero pronoun resolution

Bi

¹

,

Liu

²

,

Zhang

³

et al. 2022

Artif Intell Rev

2

0

View full text Add to dashboard Cite

Transformer-GCRF: Recovering Chinese Dropped Pronouns with General Conditional Random Fields

Cited by 5 publications

References 23 publications

A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech

A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech

A Joint Model for Dropped Pronoun Recovery and Conversational Discourse Parsing in Chinese Conversational Speech

Machine reading comprehension combined with semantic dependency for Chinese zero pronoun resolution

Contact Info

Product

Resources

About