2022
DOI: 10.1126/science.ade9097
|View full text |Cite
|
Sign up to set email alerts
|

Human-level play in the game ofDiplomacyby combining language models with strategic reasoning

Abstract: Despite much progress in training AI systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge. We introduce Cicero, the first AI agent to achieve human-level performance in Diplomacy , a strategy game involving both cooperation and competition that emphasizes natural language negotiation and tactical coordination between seven players. Cicero integrates a language model with planni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
47
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 102 publications
(47 citation statements)
references
References 61 publications
0
47
0
Order By: Relevance
“…Additionally, while current AI language models do not produce statements reaching beyond the content of their training sets, we should ask ourselves whether this is a fundamental feature of the technology or just a temporary limitation. Recent breakthroughs in the use of AI in complex strategic games have highlighted the surprising ease with which AI can outcompete humans in problems considered intractable with computational approaches. , Similar breakthroughs in the use of AI for scientific progress might come from the combination of (i) precise goals, i.e., a clear definition of what we consider a successful scientific observation, (ii) algorithms capable of efficiently optimizing its output for these goals, and (iii) structured and accessible scientific data. In this line of thought, we can imagine AI systems proposing new experiments and new descriptions of observed phenomena and arranging data in figures to support their conclusions.…”
mentioning
confidence: 99%
“…Additionally, while current AI language models do not produce statements reaching beyond the content of their training sets, we should ask ourselves whether this is a fundamental feature of the technology or just a temporary limitation. Recent breakthroughs in the use of AI in complex strategic games have highlighted the surprising ease with which AI can outcompete humans in problems considered intractable with computational approaches. , Similar breakthroughs in the use of AI for scientific progress might come from the combination of (i) precise goals, i.e., a clear definition of what we consider a successful scientific observation, (ii) algorithms capable of efficiently optimizing its output for these goals, and (iii) structured and accessible scientific data. In this line of thought, we can imagine AI systems proposing new experiments and new descriptions of observed phenomena and arranging data in figures to support their conclusions.…”
mentioning
confidence: 99%
“…To win the wargame Diplomacy, players need to negotiate, form alliances, and become skilled at deception to win control of the game's economic and military resources. AI researchers have trained Meta's AI agent Cicero, an expert manipulator, to do the same [69]. It would cooperate with a human player, then change its plan and backstab them.…”
Section: Objectives Cannot Select Against All Deceptionmentioning
confidence: 99%
“…e.g. RLHF [36,55,[178][179][180][181][182]182], RLHP [183], RLAIF [36], RL search algorithms MCTS [174,[184][185][186] or DiL-piKL [187] or PPO with human feedback in Diplomacy [187], WebGpt [56], InstructGPT [178], chatGPT, GopherCite [183].…”
Section: Strengthsmentioning
confidence: 99%