Improving Statistical Machine Translation for a Resource-Poor Language Using Related Resource-Rich Languages

Nakov, Preslav; Ng, Hwee Tou

doi:10.1613/jair.3540

Cited by 34 publications

(26 citation statements)

References 48 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Non-parallel co-learning approaches can help when learning representations, allow for better semantic concept understanding and even perform unseen object recognition. [148] Transfer learning is also possible on non-parallel data and allows to learn better representations through transferring information from a representation built using a data rich or clean modality to a data scarce or noisy modality. This type of trasnfer learning is often achieved by using coordinated multimodal representations (see Section 3.2).…”

Section: Non-parallel Datamentioning

confidence: 99%

Multimodal Machine Learning: A Survey and Taxonomy

Baltrušaitis

Ahuja

Morency

2019

IEEE Trans. Pattern Anal. Mach. Intell.

2,456

1,320

View full text Add to dashboard Cite

Our experience of the world is multimodal - we see objects, hear sounds, feel texture, smell odors, and taste flavors. Modality refers to the way in which something happens or is experienced and a research problem is characterized as multimodal when it includes multiple such modalities. In order for Artificial Intelligence to make progress in understanding the world around us, it needs to be able to interpret such multimodal signals together. Multimodal machine learning} aims to build models that can process and relate information from multiple modalities. It is a vibrant multi-disciplinary field of increasing importance and with extraordinary potential. Instead of focusing on specific multimodal applications, this paper surveys the recent advances in multimodal machine learning itself and presents them in a common taxonomy. We go beyond the typical early and late fusion categorization and identify broader challenges that are faced by multimodal machine learning, namely: representation, translation, alignment, fusion, and co-learning. This new taxonomy will enable researchers to better understand the state of the field and identify directions for future research.

show abstract

Section: Non-parallel Datamentioning

confidence: 99%

Multimodal Machine Learning: A Survey and Taxonomy

Baltrušaitis

Ahuja

Morency

2019

IEEE Trans. Pattern Anal. Mach. Intell.

2,456

1,320

View full text Add to dashboard Cite

show abstract

“…This can be implemented by making the machine learn from various iterations of combining and adjusting the scores accordingly. (Nakov and Ng, 2012) have indeed shown that results show significant deviations associated with different weights assigned to the tables.…”

Section: Future Workmentioning

confidence: 89%

Exploring System Combination approaches for Indo-Aryan MT Systems

Singla¹,

Singh²,

Shastri³

et al. 2014

Proceedings of the EMNLP'2014 Workshop on Language Technology for Closely Related Languages and Language Variants

View full text Add to dashboard Cite

Statistical Machine Translation (SMT) systems are heavily dependent on the quality of parallel corpora used to train translation models. Translation quality between certain Indian languages is often poor due to the lack of training data of good quality. We used triangulation as a technique to improve the quality of translations in cases where the direct translation model did not perform satisfactorily. Triangulation uses a third language as a pivot between the source and target languages to achieve an improved and more efficient translation model in most cases. We also combined multi-pivot models using linear mixture and obtained significant improvement in BLEU scores compared to the direct source-target models.

show abstract

“…It should be noted that this issue does not arise only in the case of Arabic dialects; it concerns also several other under-resourced languages and many research activities focus on machine translation in the context of under-resourced or non-resourced languages. The main idea of these contributions is exploiting the proximity between an under-resourced language and the closest related resourced language (Cantonese⇒Mandarin (Zhang, 1998), Czech⇒Slovak (Hajič et al, 2000), Turkish⇒Crimean Tatar (Altintas and Cicekli, 2002), Irish⇒Scottish Gaelic (Scannell, 2006), Indonesian⇒English using Malay (Nakov and Ng, 2012) and Standard Austrian German⇒Viennese dialect (Haddow et al, 2013)).…”

Section: Nlp Challenges For Arabic Dialectsmentioning

confidence: 99%

Machine translation for Arabic dialects (survey)

Harrat

Meftouh

Smaïli

2019

Information Processing & Management

View full text Add to dashboard Cite

Arabic dialects also called colloquial Arabic or vernaculars are spoken varieties of Standard Arabic. These dialects have mixed form with many variations due to the influence of ancient local tongues and other languages like European ones. Many of these dialects are mutually incomprehensible. Arabic dialects were not written until recently and were used only in a speech form. Nowadays, with the advent of the internet and mobile telephony technologies, these dialects are increasingly used in a written form. Indeed, this kind of communication brought everyday conversations to a written format. This allows Arab people to use their dialects, which are their actual native languages for expressing their opinion on social media, for chatting, texting, etc. This growing use opens new research direction for Arabic natural language processing (NLP). We focus, in this paper, on machine translation in the context of Arabic dialects. We provide a survey of recent research in this area. We report for each study a detailed description of the adopted approach and we give its most relevant contribution.

show abstract

Improving Statistical Machine Translation for a Resource-Poor Language Using Related Resource-Rich Languages

Cited by 34 publications

References 48 publications

Multimodal Machine Learning: A Survey and Taxonomy

Multimodal Machine Learning: A Survey and Taxonomy

Exploring System Combination approaches for Indo-Aryan MT Systems

Machine translation for Arabic dialects (survey)

Contact Info

Product

Resources

About