2023
DOI: 10.1038/s42256-022-00583-4
|View full text |Cite
|
Sign up to set email alerts
|

Phy-Q as a measure for physical reasoning intelligence

Abstract: Humans are well versed in reasoning about the behaviours of physical objects and choosing actions accordingly to accomplish tasks, while this remains a major challenge for artificial intelligence. To facilitate research addressing this problem, we propose a new testbed that requires an agent to reason about physical scenarios and take an action appropriately. Inspired by the physical knowledge acquired in infancy and the capabilities required for robots to operate in real-world environments, we identify 15 ess… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 41 publications
0
4
0
Order By: Relevance
“…In recent years, researchers have developed various environments as benchmarks and testbeds to evaluate the physical reasoning capabilities of AI agents. These environments are mainly based on tasks that involve taking actions in a physical environment (Xue et al 2023;Bakhtin et al 2019), reasoning about images of physical scenarios (Hong et al 2021;Wolf 2020), and reasoning about videos of physical scenarios (Riochet et al 2020;Yi et al 2020). Among them, task generation methods used in action-based environments are related to this work as we also focus on generating tasks that an agent can interact with and take actions to solve them.…”
Section: Physical Reasoning Benchmarks and Testbedsmentioning
confidence: 99%
See 3 more Smart Citations
“…In recent years, researchers have developed various environments as benchmarks and testbeds to evaluate the physical reasoning capabilities of AI agents. These environments are mainly based on tasks that involve taking actions in a physical environment (Xue et al 2023;Bakhtin et al 2019), reasoning about images of physical scenarios (Hong et al 2021;Wolf 2020), and reasoning about videos of physical scenarios (Riochet et al 2020;Yi et al 2020). Among them, task generation methods used in action-based environments are related to this work as we also focus on generating tasks that an agent can interact with and take actions to solve them.…”
Section: Physical Reasoning Benchmarks and Testbedsmentioning
confidence: 99%
“…Among them, task generation methods used in action-based environments are related to this work as we also focus on generating tasks that an agent can interact with and take actions to solve them. Examples of recent action-based environments include Phy-Q (Xue et al 2023), PHYRE (Bakhtin et al 2019), Virtual Tools , OGRE , CausalWorld (Ahmed et al 2021), and RL-Bench (James et al 2020).…”
Section: Physical Reasoning Benchmarks and Testbedsmentioning
confidence: 99%
See 2 more Smart Citations