Deniz Yüret scite author profile

The encoder-decoder framework for neural machine translation (NMT) has been shown effective in large data scenarios, but is much less effective for low-resource languages. We present a transfer learning method that significantly improves BLEU scores across a range of low-resource languages. Our key idea is to first train a high-resource language pair (the parent model), then transfer some of the learned parameters to the low-resource pair (the child model) to initialize and constrain training. Using our transfer learning method we improve baseline NMT models by an average of 5.6 BLEU on four low-resource language pairs. Ensembling and unknown word replacement add another 2 BLEU which brings the NMT performance on low-resource machine translation close to a strong syntax based machine translation (SBMT) system, exceeding its performance on one language pair. Additionally, using the transfer learning model for re-scoring, we can improve the SBMT system by an average of 1.3 BLEU, improving the state-of-the-art on low-resource machine translation.

show abstract

Natural Language Communication with Robots

Bisk

Yüret

Marcu

2016

142

View full text Add to dashboard Cite

We propose a framework for devising empirically testable algorithms for bridging the communication gap between humans and robots. We instantiate our framework in the context of a problem setting in which humans give instructions to robots using unrestricted natural language commands, with instruction sequences being subservient to building complex goal configurations in a blocks world. We show how one can collect meaningful training data and we propose three neural architectures for interpreting contextually grounded natural language commands. The proposed architectures allow us to correctly understand/ground the blocks that the robot should move when instructed by a human who uses unrestricted language. The architectures have more difficulty in correctly understanding/grounding the spatial relations required to place blocks correctly, especially when the blocks are not easily identifiable.

show abstract

Classification of semantic relations between nominals

Gîrju

Nakov

Năstase

et al. 2009

Lang Resources & Evaluation

109

112

View full text Add to dashboard Cite

The NLP community has shown a renewed interest in deeper semantic analyses, among them automatic recognition of semantic relations in text. We present the development and evaluation of a semantic analysis task: automatic recognition of relations between pairs of nominals in a sentence. The task was part of SemEval-2007, the fourth edition of the semantic evaluation event previously known as SensEval. Apart from the observations we have made, the long-lasting effect of this task may be a framework for comparing approaches to the task. We introduce the problem of recognizing relations between nominals, and in particular the process of drafting and refining the definitions of the semantic relations. We show how we created the training and test data, list and briefly describe the 15 participating systems, discuss the results, and conclude with the lessons learned in the course of this exercise.

show abstract

KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media

2020

View full text Add to dashboard Cite

In this paper, we describe our approach to utilize pre-trained BERT models with Convolutional Neural Networks for sub-task A of the Multilingual Offensive Language Identification shared task (OffensEval 2020), which is a part of the SemEval 2020. We show that combining CNN with BERT is better than using BERT on its own, and we emphasize the importance of utilizing pre-trained language models for downstream tasks. Our system, ranked 4 th with macro averaged F1-Score of 0.897 in Arabic, 4 th with score of 0.843 in Greek, and 3 rd with score of 0.814 in Turkish. Additionally, we present ArabicBERT, a set of pre-trained transformer language models for Arabic that we share with the community.

show abstract

Relationships between amino acid sequence and backbone torsion angle preferences

et al. 2004

View full text Add to dashboard Cite

Statistical averages and correlations for backbone torsion angles of chymotrypsin inhibitor 2 are calculated by using the Rotational Isomeric States model of chain statistics. Statistical weights of torsional states of phipsi pairs, needed for the statistics of the full chain, are obtained in two different ways: 1) by using knowledge-based pairwise dependent phipsi energy maps from Protein Data Bank (PDB) and 2) by collecting torsion angle data from a large number of random coil configurations of an all-atom protein model with volume exclusion. Results obtained by using PDB data show strong correlations between adjacent torsion angle pairs belonging to both the same and different residues. These correlations favor the choice of the native-state torsion angles, and they are strongly context dependent, determined by the specific amino acid sequence of the protein. Excluded volume or steric clashes, only, do not introduce context-dependent phipsi correlations into the chain that would affect the choice of native-state torsional angles.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Deniz Yüret

Transfer Learning for Low-Resource Neural Machine Translation

Natural Language Communication with Robots

Classification of semantic relations between nominals

KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media

Relationships between amino acid sequence and backbone torsion angle preferences

Contact Info

Product

Resources

About