The Rediscovery Hypothesis: Language Models Need to Meet Linguistics
Vassilina Nikoulina,
Maxat Tezekbayev,
Nuradil Kozhakhmet
et al.
Abstract:There is an ongoing debate in the NLP community whether modern language models contain linguistic knowledge, recovered through so-called probes. In this paper we study whether linguistic knowledge is a necessary condition for good performance of modern language models, which we call the rediscovery hypothesis.In the first place we show that language models that are significantly compressed but perform well on their pretraining objectives retain good scores when probed for linguistic structures. This result sup… Show more
Set email alert for when this publication receives citations?
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.