Many Languages, One Parser

Ammar, Waleed; Mulcaire, George; Ballesteros, Miguel; Dyer, Chris; Smith, Noah A.

doi:10.1162/tacl_a_00109

Cited by 210 publications

(257 citation statements)

References 24 publications

Supporting

Mentioning

251

Contrasting

Order By: Relevance

“…Tsvetkov et al (2016b) used typological information in the target language as additional input to their model for phonetic representation learning. Ammar et al (2016) and Although not for cross-lingual transfer, there has been prior work on data selection for training models. Tsvetkov et al (2016a) and Ruder and Plank (2017) use Bayesian optimization for data selection.…”

Section: Related Workmentioning

confidence: 99%

“…A common challenge in applying natural language processing (NLP) techniques to low-resource languages is the lack of training data in the languages in question. It has been demonstrated that through cross-lingual transfer, it is possible to leverage one or more similar high-resource languages to improve the performance on the low-resource languages in several NLP tasks, including machine score(L tf,1 , L tk ) score (L tf,2 , L tk translation (Zoph et al, 2016;Johnson et al, 2017;Nguyen and Chiang, 2017;Neubig and Hu, 2018), parsing (Täckström et al, 2012;Ammar et al, 2016;Ahmad et al, 2019;, partof-speech or morphological tagging (Täckström et al, 2013;Cotterell and Heigold, 2017;Malaviya et al, 2018;Plank and Agić, 2018), named entity recognition (Zhang et al, 2016;Mayhew et al, 2017;Xie et al, 2018), and entity linking (Tsai and Roth, 2016;Rijhwani et al, 2019). There are many methods for performing this transfer, including joint training (Ammar et al, 2016;Tsai and Roth, 2016;Cotterell and Heigold, 2017;Johnson et al, 2017;Malaviya et al, 2018), annotation projection (Täckström et al, 2012;Täckström et al, 2013;Zhang et al, 2016;Plank and Agić, 2018), fine-tuning (Zoph et al, 2016;Neubig and Hu, 2018), data augmentation (Mayhew et al, 2017), or zero-shot transfer (Ahmad et al, 2019;Xie et al, 2018;Neubig and Hu, 2018;Rijhwani et al, 2019).…”

Section: Introductionmentioning

confidence: 99%

“…However, it is not always true that all languages in a single language family share the same linguistic properties (Ahmad et al, 2019). Therefore, another strategy is to select transfer languages based on the typological properties that are relevant to the specific NLP task, such as word ordering for parsing tasks (Ammar et al, 2016;Ahmad et al, 2019). With several heuristics available for selecting a transfer language, it is unclear a priori if any single attribute of a language will be the most reliable criterion in determining whether cross-lingual learning is likely to work for a specific NLP task.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Choosing Transfer Languages for Cross-Lingual Learning

Lin¹,

Chen²,

Lee³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

102

View full text Add to dashboard Cite

Cross-lingual transfer, where a high-resource transfer language is used to improve the accuracy of a low-resource task language, is now an invaluable tool for improving performance of natural language processing (NLP) on lowresource languages. However, given a particular task language, it is not clear which language to transfer from, and the standard strategy is to select languages based on ad hoc criteria, usually the intuition of the experimenter. Since a large number of features contribute to the success of cross-lingual transfer (including phylogenetic similarity, typological properties, lexical overlap, or size of available data), even the most enlightened experimenter rarely considers all these factors for the particular task at hand. In this paper, we consider this task of automatically selecting optimal transfer languages as a ranking problem, and build models that consider the aforementioned features to perform this prediction. In experiments on representative NLP tasks, we demonstrate that our model predicts good transfer languages much better than ad hoc baselines considering single features in isolation, and glean insights on what features are most informative for each different NLP tasks, which may inform future ad hoc selection even without use of our method. 1 * Equal contribution 1 Code, data, and pre-trained models are available at

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Choosing Transfer Languages for Cross-Lingual Learning

Lin¹,

Chen²,

Lee³

et al. 2019

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics

102

View full text Add to dashboard Cite

show abstract

“…Typology as Quantization: Adding simple, discrete language identifiers to the input has been shown to be useful in multi-task multi-lingual settings (Ammar et al, 2016;Johnson et al, 2017).…”

Section: Discussionmentioning

confidence: 99%

Working Hard or Hardly Working: Challenges of Integrating Typology into Neural Dependency Parsers

Fisch

Guo

Barzilay

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

This paper explores the task of leveraging typology in the context of cross-lingual dependency parsing. While this linguistic information has shown great promise in preneural parsing, results for neural architectures have been mixed. The aim of our investigation is to better understand this stateof-the-art. Our main findings are as follows: 1) The benefit of typological information is derived from coarsely grouping languages into syntactically-homogeneous clusters rather than from learning to leverage variations along individual typological dimensions in a compositional manner; 2) Typology consistent with the actual corpus statistics yields better transfer performance; 3) Typological similarity is only a rough proxy of crosslingual transferability with respect to parsing. 1 * The first two authors contributed equally.

show abstract

“…Finally, prior studies have noticed that the word order information is significant for parsing and use it as features (Ammar et al, 2016;Naseem et al, 2012;Rasooli and Collins, 2017;Zhang and Barzilay, 2015;Dryer, 2007). further propose to decompose these features from models for adapting target languages.…”

Section: Related Workmentioning

confidence: 99%

Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing

Meng

Peng²,

Chang

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

Prior work on cross-lingual dependency parsing often focuses on capturing the commonalities between source and target languages and overlooks the potential of leveraging linguistic properties of the languages to facilitate the transfer. In this paper, we show that weak supervisions of linguistic knowledge for the target languages can improve a cross-lingual graph-based dependency parser substantially. Specifically, we explore several types of corpus linguistic statistics and compile them into corpus-wise constraints to guide the inference process during the test time. We adapt two techniques, Lagrangian relaxation and posterior regularization, to conduct inference with corpus-statistics constraints. Experiments show that the Lagrangian relaxation and posterior regularization inference improve the performances on 15 and 17 out of 19 target languages, respectively. The improvements are especially significant for target languages that have different word order features from the source language.

show abstract

Many Languages, One Parser

Cited by 210 publications

References 24 publications

Choosing Transfer Languages for Cross-Lingual Learning

Choosing Transfer Languages for Cross-Lingual Learning

Working Hard or Hardly Working: Challenges of Integrating Typology into Neural Dependency Parsers

Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing

Contact Info

Product

Resources

About