Language Models as an Alternative Evaluator of Word Order Hypotheses: A Case Study in Japanese

Kuribayashi, Tatsuki; Ito, Toshiro; Suzuki, Jun; Inui, Kentaro

doi:10.18653/v1/2020.acl-main.47

Cited by 2 publications

(1 citation statement)

References 24 publications

(47 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The importance of word order in LMs has been a topic of debate, with various works claiming that downstream performance is not affected by scrambled inputs (Malkin et al, 2021;Sinha et al, 2021), although it has been shown that LMs are able to retain a notion of word order through their positional embeddings (Abdou et al, 2022). It has been argued that LMs acquire an abstract notion of word order that goes beyond mere n-gram co-occurrence statistics (Futrell and Levy, 2019;Kuribayashi et al, 2020;Merrill et al, 2024), a claim that we in this paper assess for large-scale LMs in the context of adjective order. Finally, numerous works have investigated the trade-off between memorization and generalization in LMs: it has been shown that larger LMs are able to memorize entire passages from the training data (Biderman et al, 2023a;Lesci et al, 2024;Prashanth et al, 2024), but generalization patterns for grammatical phenomena have also been shown to follow human-like generalization (Dankers et al, 2021;Hupkes et al, 2023;Alhama et al, 2023).…”

Section: Word Order In Language Modelsmentioning

confidence: 99%

Feature Interactions Reveal Linguistic Structure in Language Models

Jumelet¹,

Zuidema²

2023

Findings of the Association for Computational Linguistics: ACL 2023

View full text Add to dashboard Cite

In English and other languages, multiple adjectives in a complex noun phrase show intricate ordering patterns that have been a target of much linguistic theory. These patterns offer an opportunity to assess the ability of language models (LMs) to learn subtle rules of language involving factors that cross the traditional divisions of syntax, semantics, and pragmatics. We review existing hypotheses designed to explain Adjective Order Preferences (AOPs) in humans and develop a setup to study AOPs in LMs: we present a reusable corpus of adjective pairs and define AOP measures for LMs. With these tools, we study a series of LMs across intermediate checkpoints during training. We find that all models' predictions are much closer to human AOPs than predictions generated by factors identified in theoretical linguistics. At the same time, we demonstrate that the observed AOPs in LMs are strongly correlated with the frequency of the adjective pairs in the training data and report limited generalization to unseen combinations. This highlights the difficulty in establishing the link between LM performance and linguistic theory. We therefore conclude with a road map for future studies our results set the stage for, and a discussion of key questions about the nature of knowledge in LMs and their ability to generalize beyond the training sets. 1

show abstract