2011
DOI: 10.1016/j.neucom.2011.06.009
|View full text |Cite
|
Sign up to set email alerts
|

The distribution of rewards in sensorimotor maps acquired by cognitive robots through exploration

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
1
1
1

Relationship

3
4

Authors

Journals

citations
Cited by 8 publications
(11 citation statements)
references
References 38 publications
0
11
0
Order By: Relevance
“…Mohan, Morasso and Metta [22] built on Toussaint sensorimotor map [18] in developing a mechanism that allows a GNOSYS robot to learn 'when' to optimize 'what constraint' while realising a spatial goal and to be able to push a ball intelligently to the corners of a table while avoiding traps in arbitrary positions on the table. A key contribution of the paper is in the design of a bifurcation parameter, which measures the anticipatory ability of the model.…”
Section: Review Of Related Workmentioning
confidence: 99%
“…Mohan, Morasso and Metta [22] built on Toussaint sensorimotor map [18] in developing a mechanism that allows a GNOSYS robot to learn 'when' to optimize 'what constraint' while realising a spatial goal and to be able to push a ball intelligently to the corners of a table while avoiding traps in arbitrary positions on the table. A key contribution of the paper is in the design of a bifurcation parameter, which measures the anticipatory ability of the model.…”
Section: Review Of Related Workmentioning
confidence: 99%
“…2009;Hesslow 2012). The proposed neural framework further exploits several interesting computational conce pts in literature mainly, (a) the idea of growing neural gas (Fritzke 1995) which is an extension of selforganizing maps (Kohonen et al 2001) for internal representation of the peripersonal space; (b) Ne ural field dynamics (Amari 1977;Erlhagen & Schöner 2002) to organize goal directed reasoning and cooperative behavior in shared spaces; (c) the idea of reward fields (Toussaint 2006) for reasoning about action sequences based on the unfolding spatio-temporal dynamics of the task (d) learning of such behavior-modulating reward fields through experience gained by exploration (Mohan, Morasso, Metta, et al 2011) (e) the idea of Passive Motion Paradigm as an internal representation of the bodi e s of the agents for simulation, reasoning and execution of goal directed actions. The architecture presented here further builds up on (Toussaint 2006;Mohan, Morasso, Metta, et al 2011) by hypothesizing the synergistic interaction between two coupled internal models for reasoning and control, employing the architecture on multiple embodiments to reasoning and goal oriented cooperation in a real world industrial assembly line.…”
Section: A Case For Coupled Memory Representations Of Body and Space mentioning
confidence: 99%
“…The proposed neural framework further exploits several interesting computational conce pts in literature mainly, (a) the idea of growing neural gas (Fritzke 1995) which is an extension of selforganizing maps (Kohonen et al 2001) for internal representation of the peripersonal space; (b) Ne ural field dynamics (Amari 1977;Erlhagen & Schöner 2002) to organize goal directed reasoning and cooperative behavior in shared spaces; (c) the idea of reward fields (Toussaint 2006) for reasoning about action sequences based on the unfolding spatio-temporal dynamics of the task (d) learning of such behavior-modulating reward fields through experience gained by exploration (Mohan, Morasso, Metta, et al 2011) (e) the idea of Passive Motion Paradigm as an internal representation of the bodi e s of the agents for simulation, reasoning and execution of goal directed actions. The architecture presented here further builds up on (Toussaint 2006;Mohan, Morasso, Metta, et al 2011) by hypothesizing the synergistic interaction between two coupled internal models for reasoning and control, employing the architecture on multiple embodiments to reasoning and goal oriented cooperation in a real world industrial assembly line. Figure 12: A-D) Show the typically different possible setups in which objects can be lying in the workspace; A) shows a structured scenario where a fuse box with 3 fuses is near each robot and parallel assembly c a n be ea s i l y realized; B) An unstructured setup where objects are lying in random places and reasoning is essential for successful joint assembly; C) An unstructured and redundant setup where there are more than two fuse boxes to work with; D) A scenario where none of the two robots can reach all the desired objects, so the assembly is not realizable in a direct way without mutual cooperation and spatial reasoning; E) shows the robotic s platform used for experiments with two robots, the camera and the workspace tray.…”
Section: A Case For Coupled Memory Representations Of Body and Space mentioning
confidence: 99%
“…Chimps can easily solve this problem of combinatorial tool use (Maravita and Iriki, 2004) and since the observation of their behavior rules out the possibility of trial-and-error, the most likely interpretation is as follows: (1) the chimp has an abstract concept of a stick-like object which must have similar computational properties to the body schema in order to be integrated with it, at least temporarily in the course of the task (Iriki et al, 1996); (2) the recognition of crucial “affordances,” such as the fact that the food and the long stick are unreachable and the short stick is reachable and long enough to get the long stick, is carried out by means of covert, “imagined” movements. In a previous work (Mohan and Morasso, 2007, 2011d) we have shown how adding a reasoning system on top of the PMP-based real/mental action generation system can enable a cognitive robot (GNOSYS) to autonomously generate goal directed plans in such scenarios (where use of tools is obligatory for achieving the goal). Figure 8 illustrates the sequences of (real and virtual) actions initiated by iCub using different task-specific PMP networks (illustrated in different examples so far) when it exploits a long green stick as a tool to reach (an otherwise unreachable) red cylinder.…”
Section: Oct and Pmp As Computational Theoriesmentioning
confidence: 99%