2021
DOI: 10.3389/fnbot.2021.642780
|View full text |Cite
|
Sign up to set email alerts
|

Active Vision for Robot Manipulators Using the Free Energy Principle

Abstract: Occlusions, restricted field of view and limited resolution all constrain a robot's ability to sense its environment from a single observation. In these cases, the robot first needs to actively query multiple observations and accumulate information before it can complete a task. In this paper, we cast this problem of active vision as active inference, which states that an intelligent agent maintains a generative model of its environment and acts in order to minimize its surprise, or expected free energy accord… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2
1

Relationship

4
4

Authors

Journals

citations
Cited by 22 publications
(20 citation statements)
references
References 42 publications
0
20
0
Order By: Relevance
“…Similar generative models can be used for learning machine vision using pixel observations (Eslami et al, 2018 ; Van de Maele et al, 2021b ). In this case, the system is trained to make inferences about the scene s , given images o t and corresponding absolute viewpoints v t .…”
Section: Methodsmentioning
confidence: 99%
“…Similar generative models can be used for learning machine vision using pixel observations (Eslami et al, 2018 ; Van de Maele et al, 2021b ). In this case, the system is trained to make inferences about the scene s , given images o t and corresponding absolute viewpoints v t .…”
Section: Methodsmentioning
confidence: 99%
“…rotation and translation), moving the camera viewpoint. This setup closely mimicks a robot manipulator with an in-hand camera, but without kinematic constraints [26].…”
Section: Methodsmentioning
confidence: 99%
“…However, recent work also focused on active vision. In [26] a generative model learning representations of a whole 3D scene was used for an active inference agent, whereas in [2] an explicit what and where stream were modeled for classifying MNIST digits.…”
Section: Related Workmentioning
confidence: 99%
“…In the active inference literature, multivariate Gaussian (also known as normal) distributions with a diagonal covariance matrix have been largely adopted since the initial works on VAEs [ 32 , 33 , 68 , 86 ]. Similarly, numerous latent state space models have adopted a Gaussian structure of their latent space [ 48 , 49 , 87 , 88 ], but also more complex mixture models have been proposed [ 50 ].…”
Section: Variational World Modelsmentioning
confidence: 99%