Weakly Supervised Cross-Lingual Named Entity Recognition via
            Effective Annotation and Representation Projection

Ni, Jian; Dinu, Georgiana; Florian, Radu

doi:10.18653/v1/p17-1135

Cited by 123 publications

(131 citation statements)

References 21 publications

Supporting

Mentioning

130

Contrasting

Order By: Relevance

“…Regard- less, the BERT zero-resource performance far exceeds the results published in previous work. Mayhew et al (2017) and Ni et al (2017) do use some cross-lingual resources (like bilingual dictionaries) in their experiments, but it appears that BERT with multilingual pretraining performs better, even though it does not have access to crosslingual information. Table 3: Median cosine similarity between the mean-pooled BERT embeddings of MLDoc English documents and their translations, with and without language-adversarial training.…”

Section: Conll Ner Resultsmentioning

confidence: 97%

Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER

Keung¹,

Lu²,

Bhardwaj³

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

Contextual word embeddings (e.g. GPT, BERT, ELMo, etc.) have demonstrated stateof-the-art performance on various NLP tasks. Recent work with the multilingual version of BERT has shown that the model performs very well in cross-lingual settings, even when only labeled English data is used to finetune the model. We improve upon multilingual BERT's zero-resource cross-lingual performance via adversarial learning. We report the magnitude of the improvement on the multilingual ML-Doc text classification and CoNLL 2002/2003 named entity recognition tasks. Furthermore, we show that language-adversarial training encourages BERT to align the embeddings of English documents and their translations, which may be the cause of the observed performance gains.

show abstract

Section: Conll Ner Resultsmentioning

confidence: 97%

Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER

Keung¹,

Lu²,

Bhardwaj³

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

show abstract

“…This requires first projecting annotations from the source data to the (unlabeled) target data. Many approaches in this category rely upon parallel corpora (Yarowsky et al, 2001;Zeman and Resnik, 2008;Ehrmann et al, 2011;Fu et al, 2011;Ni et al, 2017), first annotating the source data using a trained model and then projecting the annotations. Only a few works explore the use of MT to first translate a gold annotated corpus to obtain a synthetic parallel corpus and then project annotations (Tiedemann et al, 2014).…”

Section: Annotation Projectionmentioning

confidence: 99%

“…When projecting annotations, one encounters the problem of word alignment. Most of the existing works (Yarowsky et al, 2001;Shah et al, 2010;Ni et al, 2017) rely upon unsupervised alignment models from statistical MT literature, such as IBM Models 1-6 (Brown et al, 1993;Och and Ney, 2003). Other works focus on low-resource settings (Mayhew et al, 2017;Xie et al, 2018) perform translation word-by-word or phrase-by-phrase, and thus do not need to perform word alignment.…”

Section: Annotation Projectionmentioning

confidence: 99%

“…Since this algorithm can produce multiple matches for a given source entity, we postprocess the alignments produced by this algorithm and select the longest match and then project tags in the same way as our method. Our third baseline is Ni et al (2017) (Co-decoding), who use a co-decoding scheme on two different NER models. We also compare our method with Polyglot-NER (Al-Rfou et al, 2015) who use Wikipedia links to project entities.…”

Section: Experimental Evaluationmentioning

confidence: 99%

See 1 more Smart Citation

Entity Projection via Machine Translation for Cross-Lingual NER

Jain¹,

Paranjape²

2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

Although over 100 languages are supported by strong off-the-shelf machine translation systems, only a subset of them possess large annotated corpora for named entity recognition. Motivated by this fact, we leverage machine translation to improve annotationprojection approaches to cross-lingual named entity recognition. We propose a system that improves over prior entity-projection methods by: (a) leveraging machine translation systems twice: first for translating sentences and subsequently for translating entities; (b) matching entities based on orthographic and phonetic similarity; and (c) identifying matches based on distributional statistics derived from the dataset. Our approach improves upon current state-of-the-art methods for cross-lingual named entity recognition on 5 diverse languages by an average of 4.1 points. Further, our method achieves state-of-the-art F 1 scores for Armenian, outperforming even a monolingual model trained on Armenian source data. 1

show abstract

“…In annotation projection approaches, parallel or comparable corpora are commonly used (Yarowsky et al, 2001;Ehrmann et al, 2011;Das and Petrov, 2011;Li et al, 2012;Täckström et al, 2013;Wang and Manning, 2014;Ni et al, 2017). The source language sentences of parallel corpora are first annotated either manually or by a pretrained tagger.…”

Section: Annotation Projectionmentioning

confidence: 99%

Low-Resource Sequence Labeling via Unsupervised Multilingual Contextualized Representations

Bao

Huang

et al. 2019

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen

View full text Add to dashboard Cite

Previous work on cross-lingual sequence labeling tasks either requires parallel data or bridges the two languages through word-byword matching. Such requirements and assumptions are infeasible for most languages, especially for languages with large linguistic distances, e.g., English and Chinese. In this work, we propose a Multilingual Language Model with deep semantic Alignment (MLMA) to generate language-independent representations for cross-lingual sequence labeling. Our methods require only monolingual corpora with no bilingual resources at all and take advantage of deep contextualized representations. Experimental results show that our approach achieves new state-of-the-art NER and POS performance across European languages, and is also effective on distant language pairs such as English and Chinese.

show abstract

Weakly Supervised Cross-Lingual Named Entity Recognition via Effective Annotation and Representation Projection

Cited by 123 publications

References 21 publications

Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER

Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER

Entity Projection via Machine Translation for Cross-Lingual NER

Low-Resource Sequence Labeling via Unsupervised Multilingual Contextualized Representations

Contact Info

Product

Resources

About