Uncertainty aware neural network from similarity and sensitivity

Kabir, H.M. Dipu; Mondal, Subrota Kumar; Khanam, Sadia; Khosravi, Abbas; Rahman, Shafin; Qazani, Mohammad Reza Chalak; Alizadehsani, Roohallah; Asadi, Houshyar; Mohamed, Shady; Nahavandi, Saeid; Acharya, U. Rajendra

doi:10.1016/j.asoc.2023.111027

Applied Soft Computing

2023

DOI: 10.1016/j.asoc.2023.111027

|View full text |Cite

Uncertainty aware neural network from similarity and sensitivity

H.M. Dipu Kabir,

Subrota Kumar Mondal,

Sadia Khanam

et al.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Enhancement of English-Bengali Machine Translation Leveraging Back-Translation

Mondal,

Wang,

Chen

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

An English-Bengali machine translation (MT) application can convert an English text into a corresponding Bengali translation. To build a better model for this task, we can optimize English-Bengali MT. MT for languages with rich resources, like English-German, started decades ago. However, MT for languages lacking many parallel corpora remains challenging. In our study, we employed back-translation to improve the translation accuracy. With back-translation, we can have a pseudo-parallel corpus, and the generated (pseudo) corpus can be added to the original dataset to obtain an augmented dataset. However, the new data can be regarded as noisy data because they are generated by models that may not be trained very well or not evaluated well, like human translators. Since the original output of a translation model is a probability distribution of candidate words, to make the model more robust, different decoding methods are used, such as beam search, top-k random sampling and random sampling with temperature T, and others. Notably, top-k random sampling and random sampling with temperature T are more commonly used and more optimal decoding methods than the beam search. To this end, our study compares LSTM (Long-Short Term Memory, as a baseline) and Transformer. Our results show that Transformer (BLEU: 27.80 in validation, 1.33 in test) outperforms LSTM (3.62 in validation, 0.00 in test) by a large margin in the English-Bengali translation task. (Evaluating LSTM and Transformer without any augmented data is our baseline study.) We also incorporate two decoding methods, top-k random sampling and random sampling with temperature T, for back-translation that help improve the translation accuracy of the model. The results show that data generated by back-translation without top-k or temperature sampling (“no strategy”) help improve the accuracy (BLEU 38.22, +10.42 on validation, 2.07, +0.74 on test). Specifically, back-translation with top-k sampling is less effective (k=10, BLEU 29.43, +1.83 on validation, 1.36, +0.03 on test), while sampling with a proper value of T, T=0.5 makes the model achieve a higher score (T=0.5, BLEU 35.02, +7.22 on validation, 2.35, +1.02 on test). This implies that in English-Bengali MT, we can augment the training set through back-translation using random sampling with a proper temperature T.

show abstract

Enhancement of English-Bengali Machine Translation Leveraging Back-Translation

Mondal,

Wang,

Chen

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Uncertainty aware neural network from similarity and sensitivity

Cited by 1 publication

References 25 publications

Enhancement of English-Bengali Machine Translation Leveraging Back-Translation

Enhancement of English-Bengali Machine Translation Leveraging Back-Translation

Contact Info

Product

Resources

About