2022
DOI: 10.1073/pnas.2106028118
|View full text |Cite
|
Sign up to set email alerts
|

Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents

Abstract: How do societies learn and maintain social norms? Here we use multiagent reinforcement learning to investigate the learning dynamics of enforcement and compliance behaviors. Artificial agents populate a foraging environment and need to learn to avoid a poisonous berry. Agents learn to avoid eating poisonous berries better when doing so is taboo, meaning the behavior is punished by other agents. The taboo helps overcome a credit assignment problem in discovering delayed health effects. Critically, introducing a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
22
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
5
1
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 34 publications
(23 citation statements)
references
References 44 publications
1
22
0
Order By: Relevance
“…3B, this concept can be easily captured by an agent since we provide the frequency of each solution as a feature. We speculate that passing through this sub-optimal strategy facilitates faster learning by providing additional options to copy and helping the agent learn the notion of copying much faster, similar to a recently reported phenomenon in artificial agents for social learning [35]. This is further supported by a delay in the learning process when the frequency feature is not provided (see Supplementary Information for the result where no solution frequency is given).…”
Section: Results a Default Environmentsupporting
confidence: 73%
See 2 more Smart Citations
“…3B, this concept can be easily captured by an agent since we provide the frequency of each solution as a feature. We speculate that passing through this sub-optimal strategy facilitates faster learning by providing additional options to copy and helping the agent learn the notion of copying much faster, similar to a recently reported phenomenon in artificial agents for social learning [35]. This is further supported by a delay in the learning process when the frequency feature is not provided (see Supplementary Information for the result where no solution frequency is given).…”
Section: Results a Default Environmentsupporting
confidence: 73%
“…Different from previous studies [32,34,35], the payoff of our work is given by a fixed landscape, not from a game between agents with a payoff matrix. Our work suggests that social learning can emerge even when explicit payoff interaction between agents is not present, which resonates with the importance of vicarious reinforcement [43] in social learning theory.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Although learning conformity is consistently associated with the self-awareness of university students, school policy pressure, social pressure, self-development, and academic success, research linking subcontracts of learning conformity and a specific perception of self-efficacy is more limited [22][23][24]. There is no research regarding the relationship between general self-efficacy and learning conformity in the current literature.…”
Section: Self-efficacy Researchmentioning
confidence: 92%
“…'in-the-wild,' particular instances during training episodes can be analysed as quasi-experiments. For example, we can aggregate all instances where the AI system is confronted with a choice between two colours, to analyse whether an AI system has learned to associate different colours with different values (Köster et al, 2022). Such simple in-the-lab or in-the-wild tests typically report a binary pass/fail outcome.…”
Section: The Role Of Analysis In Ai Researchmentioning
confidence: 99%