Adversarial Training for Cross-Domain Universal Dependency Parsing

Sato, Motoki; Manabe, Hitoshi; Noji, Hiroshi; Matsumoto, Yūji

doi:10.18653/v1/k17-3007

Cited by 38 publications

(48 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…When the data is very skewed, as for Russian, the effect of adding a small treebank to a large one is minor, as expected. While our results are not directly comparable to the adversarial learning in Sato et al (2017) who used a different parser and test set, the improvements of C+FT and TB-EMB are typically at least on par with and often larger than their improvements. While our im- For each test set, the best result is marked with bold.…”

Section: Resultscontrasting

confidence: 73%

Parser Training with Heterogeneous Treebanks

Stymne¹,

Lhoneux²,

Smith³

et al. 2018

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

How to make the most of multiple heterogeneous treebanks when training a monolingual dependency parser is an open question. We start by investigating previously suggested, but little evaluated, strategies for exploiting multiple treebanks based on concatenating training sets, with or without fine-tuning. We go on to propose a new method based on treebank embeddings. We perform experiments for several languages and show that in many cases fine-tuning and treebank embeddings lead to substantial improvements over single treebanks or concatenation, with average gains of 2.0-3.5 LAS points. We argue that treebank embeddings should be preferred due to their conceptual simplicity, flexibility and extensibility.

show abstract

Section: Resultscontrasting

confidence: 73%

Parser Training with Heterogeneous Treebanks

Stymne¹,

Lhoneux²,

Smith³

et al. 2018

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

View full text Add to dashboard Cite

show abstract

“…DANNs have been applied in many NLP tasks in the last few years, mainly to sentiment classification (e.g., Ganin et al (2016), Li et al (2018a), Shen et al (2018), Rocha andLopes Cardoso (2019), Ghoshal et al (2020), to name a few), but recently to many other tasks as well: language identification (Li et al, 2018a), natural language inference (Rocha and Lopes Cardoso, 2019), POS tagging (Yasunaga et al, 2018), parsing (Sato et al, 2017), trigger identification (Naik and Rose, 2020), relation extraction Fu et al, 2017;Rios et al, 2018), and other (binary) text classification tasks like relevancy identification (Alam et al, 2018a), machine reading comprehension , stance detection (Xu et al, 2019), and duplicate question detection (Shah et al, 2018). This makes DANNs the most widely used UDA approach in NLP, as illustrated in Table 1.…”

Section: Domain Adversariesmentioning

confidence: 99%

Neural Unsupervised Domain Adaptation in NLP—A Survey

Ramponi¹,

Plank²

2020

Proceedings of the 28th International Conference on Computational Linguistics

154

108

View full text Add to dashboard Cite

Deep neural networks excel at learning from labeled data and achieve state-of-the-art results on a wide array of Natural Language Processing tasks. In contrast, learning from unlabeled data, especially under domain shift, remains a challenge. Motivated by the latest advances, in this survey we review neural unsupervised domain adaptation techniques which do not require labeled target domain data. This is a more challenging yet a more widely applicable setup. We outline methods, from early traditional non-neural methods to pre-trained model transfer. We also revisit the notion of domain, and we uncover a bias in the type of Natural Language Processing tasks which received most attention. Lastly, we outline future directions, particularly the broader need for out-of-distribution generalization of future NLP. 1

show abstract

“…where w l indicates lexical feature, w d indicates delexicalized feature, and is element-wise multiplication. The difference between Sato et al (2017) and ours is that we remove the adversarial training loss, which is because we have already use the universal information in the shared network.…”

Section: Joint Trainingmentioning

confidence: 99%

“…Beyond embedding-based methods, a natural question is whether we can use a simple way to utilize the universal information. Some previous research either regarded the universal information as extra training signals (e.g., delexicalized embedding (Dehouck and Denis, 2017)), or implicitly trained a network with all features (e.g., adversarial training for parsing in Sato et al (2017)). In our system, we manually and explicitly share the universal annotations via a shared LSTM component.…”

Section: Introductionmentioning

confidence: 99%

A Simple yet Effective Joint Training Method for Cross-Lingual Universal Dependency Parsing

Chen¹,

Lin²,

Hu³

et al. 2018

Proceedings of The

View full text Add to dashboard Cite

This paper describes Fudan's submission to CoNLL 2018's shared task Universal Dependency Parsing. We jointly train models when two languages are similar according to linguistic typology and then do an ensemble of the models using a simple re-parse algorithm. Our system outperforms the baseline method by 4.4% and 2.1% on the development and test set of CoNLL 2018 UD Shared Task, separately. 1. Our code is available on https://github.com/ taineleau/FudanParser. * Authors contributed equally. 1 Unfortunately, we did not finish the run before the deadline. As a result, the official accuracy gain for test set is only 0.54% and we ranks 17th out of 27 teams.

show abstract

Adversarial Training for Cross-Domain Universal Dependency Parsing

Cited by 38 publications

References 7 publications

Parser Training with Heterogeneous Treebanks

Parser Training with Heterogeneous Treebanks

Neural Unsupervised Domain Adaptation in NLP—A Survey

A Simple yet Effective Joint Training Method for Cross-Lingual Universal Dependency Parsing

Contact Info

Product

Resources

About