Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models

Michaelov, James; Arnett, Catherine; Chang, Tyler; Bergen, Ben

doi:10.18653/v1/2023.emnlp-main.227

Cited by 2 publications

References 72 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Active Use of Latent Constituency Representation in both Humans and Large Language Models

Ding,

Liu,

Xiang

2024

Preprint

View full text Add to dashboard Cite

Understanding how sentences are internally represented in the human brain, as well as in large language models (LLMs) such as ChatGPT, is a major challenge for cognitive science. Classic linguistic theories propose that the brain represents a sentence by parsing it into hierarchically organized constituents. In contrast, LLMs do not explicitly parse linguistic constituents and their latent representations remains poorly explained. Here, we demonstrate that humans and LLMs construct similar latent representations of hierarchical linguistic constituents by analyzing their behaviors during a novel one-shot learning task, in which they infer which words should be deleted from a sentence. Both humans and LLMs tend to delete a constituent, instead of a nonconstituent word string. In contrast, a naive sequence processing model that has access to word properties and ordinal positions does not show this property. Based on the word deletion behaviors, we can reconstruct the latent constituency tree representation of a sentence for both humans and LLMs. These results demonstrate that a latent tree-structured constituency representation can emerge in both the human brain and LLMs.

show abstract

Active Use of Latent Constituency Representation in both Humans and Large Language Models

Ding,

Liu,

Xiang

2024

Preprint

View full text Add to dashboard Cite

show abstract

MacBehaviour: An R package for behavioural experimentation on large language models

Duan,

Li,

Cai

2024

Behav Res

View full text Add to dashboard Cite

The study of large language models (LLMs) and LLM-powered chatbots has gained significant attention in recent years, with researchers treating LLMs as participants in psychological experiments. To facilitate this research, we developed an R package called “MacBehaviour “ (https://github.com/xufengduan/MacBehaviour), which interacts with over 100 LLMs, including OpenAI's GPT family, the Claude family, Gemini, Llama family, and other open-weight models. The package streamlines the processes of LLM behavioural experimentation by providing a comprehensive set of functions for experiment design, stimuli presentation, model behaviour manipulation, and logging responses and token probabilities. With a few lines of code, researchers can seamlessly set up and conduct psychological experiments, making LLM behaviour studies highly accessible. To validate the utility and effectiveness of “MacBehaviour,“ we conducted three experiments on GPT-3.5 Turbo, Llama-2-7b-chat-hf, and Vicuna-1.5-13b, replicating the sound-gender association in LLMs. The results consistently demonstrated that these LLMs exhibit human-like tendencies to infer gender from novel personal names based on their phonology, as previously shown by Cai et al. (2024). In conclusion, “MacBehaviour” is a user-friendly R package that simplifies and standardises the experimental process for machine behaviour studies, offering a valuable tool for researchers in this field.

show abstract

Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models

Cited by 2 publications

References 72 publications

Active Use of Latent Constituency Representation in both Humans and Large Language Models

Active Use of Latent Constituency Representation in both Humans and Large Language Models

MacBehaviour: An R package for behavioural experimentation on large language models

Contact Info

Product

Resources

About