Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study

Kulshreshtha, Saurabh; Redondo-García, José Luis; Chang, Ching-Yun

doi:10.48550/arxiv.2009.14304

Cited by 3 publications

(4 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Contextual representations can be obtained through multilingual pre-training, which encodes whole sentence and outputs contextual representation for each word (Devlin et al, 2019;Lample and Conneau, 2019). Due to the rich context information contained in the contextual representations, there are endeavors to align them in different languages (Schuster et al, 2019;Aldarmaki and Diab, 2019;Wang et al, 2020;Kulshreshtha et al, 2020;Cao et al, 2020).…”

Section: • Contextual Representation Based Methodsmentioning

confidence: 99%

Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction

Zhang

Xiao

et al. 2021

Preprint

View full text Add to dashboard Cite

Bilingual Lexicon Induction (BLI) aims to map words in one language to their translations in another, and is typically through learning linear projections to align monolingual word representation spaces. Two classes of word representations have been explored for BLI: static word embeddings and contextual representations, but there is no studies to combine both. In this paper, we propose a simple yet effective mechanism to combine the static word embeddings and the contextual representations to utilize the advantages of both paradigms. We test the combination mechanism on various language pairs under the supervised and unsupervised BLI benchmark settings. Experiments show that our mechanism consistently improves performances over robust BLI baselines on all language pairs by averagely improving 3.2 points in the supervised setting, and 3.1 points in the unsupervised setting 1 .

show abstract

Section: • Contextual Representation Based Methodsmentioning

confidence: 99%

Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction

Zhang

Xiao

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Cross-lingual Transfer: In an endeavor to enhance the cross-lingual transfer abilities of Multilingual BERT (mBERT), Kulshreshtha et al (2020) managed to improve its performance. The researchers achieved this by aligning mBERT with cross-lingual signals, employing parallel corpora supervision, and fine-tuning the alignment.…”

Section: Evolution Of Language Modeling For Armenian Languagementioning

confidence: 99%

“…In a related effort, Ter-Hovhannisyan and Avetisyan (2022) utilized the transformer-based XLM-RoBERTa model for cross-lingual sentence alignment, highlighting its effectiveness in multilingual contexts. Additionally, Kulshreshtha et al (2020) augmented the cross-lingual transfer abilities of Multilingual BERT (mBERT) to achieve superior performance in language transfer tasks.…”

Section: The Utilization Of Llms In Armenian Nlp Tasksmentioning

confidence: 99%

Large Language Models and Low-Resource Languages: An Examination of Armenian NLP

Avetisyan,

Broneske

2023

Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023 (Findings)

View full text Add to dashboard Cite

This paper presents a comprehensive review of Natural Language Processing (NLP) research on Armenian, a language that, despite its rich history and unique linguistic characteristics, is currently low-resource in the field of NLP. We critically synthesize and evaluate various studies in Armenian NLP, highlighting key advancements, challenges, and areas for improvement. A notable aspect of our work is the underlined lack of application of Large Language Models (LLMs) in Armenian NLP, signifying an area of potential exploration and development. Identifying and discussing these challenges and opportunities lays the groundwork for future research directions in Armenian NLP. The emphasis on Armenian also advocates for increased attention to low-resource languages in NLP research, stressing the importance of linguistic diversity and equity. To the best of our knowledge, this is the first paper providing such an extensive review of Armenian NLP, marking a significant contribution to the field.

show abstract

“…In addition, BERT has been incorporated into NMT models as a pre-training mechanism, leading to improved translation quality in various settings [24]. Moreover, several BERT-based models, such as mBERT (multilingual BERT) and XLM-R (Cross-lingual Language Model-RoBERTa), have been developed to handle multilingual and cross-lingual tasks [25,26].…”

Section: Machine Translation Approaches and Evolutionmentioning

confidence: 99%

Translation Performance from the User’s Perspective of Large Language Models and Neural Machine Translation Systems

Son,

Kim

2023

Information

View full text Add to dashboard Cite

The rapid global expansion of ChatGPT, which plays a crucial role in interactive knowledge sharing and translation, underscores the importance of comparative performance assessments in artificial intelligence (AI) technology. This study concentrated on this crucial issue by exploring and contrasting the translation performances of large language models (LLMs) and neural machine translation (NMT) systems. For this aim, the APIs of Google Translate, Microsoft Translator, and OpenAI’s ChatGPT were utilized, leveraging parallel corpora from the Workshop on Machine Translation (WMT) 2018 and 2020 benchmarks. By applying recognized evaluation metrics such as BLEU, chrF, and TER, a comprehensive performance analysis across a variety of language pairs, translation directions, and reference token sizes was conducted. The findings reveal that while Google Translate and Microsoft Translator generally surpass ChatGPT in terms of their BLEU, chrF, and TER scores, ChatGPT exhibits superior performance in specific language pairs. Translations from non-English to English consistently yielded better results across all three systems compared with translations from English to non-English. Significantly, an improvement in translation system performance was observed as the token size increased, hinting at the potential benefits of training models on larger token sizes.

show abstract

Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study

Cited by 3 publications

References 18 publications

Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction

Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction

Large Language Models and Low-Resource Languages: An Examination of Armenian NLP

Translation Performance from the User’s Perspective of Large Language Models and Neural Machine Translation Systems

Contact Info

Product

Resources

About