Maxat Tezekbayev scite author profile

Maxat Tezekbayev

5Publications

3Citation Statements Received

119Citation Statements Given

How they've been cited

How they cite others

113

Affiliations

Nazarbayev University

Publications

Order By: Most citations

Geometric Probing of Word Vectors

Babazhanova¹,

Tezekbayev²,

Assylbekov³

2021

View full text Add to dashboard Cite

This paper studies the informativeness of linguistic properties such as part-of-speech and named entities encoded in word representations. First, we find directions that correspond to these properties using the method of Elazar et al. (2020). Then such directions are compared with the principal vectors obtained from application of PCA to word embeddings. As a result, we find that the part-of-speech information is more important for word embeddings than the named entity property.

show abstract

The Rediscovery Hypothesis: Language Models Need to Meet Linguistics

Nikoulina¹,

Tezekbayev

Kozhakhmet

et al. 2021

jair

View full text Add to dashboard Cite

There is an ongoing debate in the NLP community whether modern language models contain linguistic knowledge, recovered through so-called probes. In this paper, we study whether linguistic knowledge is a necessary condition for the good performance of modern language models, which we call the rediscovery hypothesis. In the first place, we show that language models that are significantly compressed but perform well on their pretraining objectives retain good scores when probed for linguistic structures. This result supports the rediscovery hypothesis and leads to the second contribution of our paper: an information-theoretic framework that relates language modeling objectives with linguistic information. This framework also provides a metric to measure the impact of linguistic information on the word prediction task. We reinforce our analytical results with various experiments, both on synthetic and on real NLP tasks in English.

show abstract

Speeding Up Entmax

Tezekbayev¹,

Nikoulina²,

Gallé³

et al. 2022

View full text Add to dashboard Cite

Softmax is the de facto standard for normalizing logits in modern neural networks for language processing. However, by producing a dense probability distribution each token in the vocabulary has a nonzero chance of being selected at each generation step, leading to a variety of reported problems in text generation. αentmax of Peters et al. ( 2019) solves this problem, but is unfortunately slower than softmax.In this paper, we propose an alternative to αentmax, which keeps its virtuous characteristics, but is as fast as optimized softmax and achieves on par or better performance in machine translation task.

show abstract

Speeding Up Entmax

Tezekbayev¹,

Nikoulina²,

Gallé³

et al. 2021

Preprint

View full text Add to dashboard Cite

Softmax is the de facto standard in modern neural networks for language processing when it comes to normalizing logits. However, by producing a dense probability distribution each token in the vocabulary has a nonzero chance of being selected at each generation step, leading to a variety of reported problems in text generation. α-entmax of Peters et al. ( 2019) solves this problem, but is considerably slower than softmax.In this paper, we propose an alternative to αentmax, which keeps its virtuous characteristics, but is as fast as optimized softmax and achieves on par or better performance in machine translation task.

show abstract

Semantics- and Syntax-Related Subvectors in the Skip-Gram Embeddings (Student Abstract)

Tezekbayev

Assylbekov

Takhanov

2020

AAAI

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.