An Embodied Agent Learning Affordances With Intrinsic Motivations and Solving Extrinsic Tasks With Attention and One-Step Planning

Baldassarre, Gianluca; Lord, W.; Granato, Giovanni; Santucci, Vieri Giuliano

doi:10.3389/fnbot.2019.00045

Cited by 13 publications

(14 citation statements)

References 61 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The remaining components, computationally simplified, are relevant to support the interaction of the agent with the simulated card environment (see 62 for the simple simulator used to this purpose). The visual sensor is formed by an RGBY matrix that encodes a small portion of the environment, approximately covering one card per time, and represents the input following an attention-focused saccade onto one specific card.…”

Section: Methodsmentioning

confidence: 99%

A computational model of language functions in flexible goal-directed behaviour

Granato

Borghi

Baldassarre

2020

Sci Rep

Self Cite

View full text Add to dashboard Cite

The function of language in high-order goal-directed human cognition is an important topic at the centre of current debates. Experimental evidence shows that inner speech, representing a self-directed form of language, empowers cognitive processes such as working memory, perception, categorization, and executive functions. Here we study the relations between inner speech and processes like feedback processing and cognitive flexibility. To this aim we propose a computational model that controls an artificial agent who uses inner speech to internally manipulate its representations. The agent is able to reproduce human behavioural data collected during the solution of the Wisconsin Card Sorting test, a neuropsychological test measuring cognitive flexibility, both in the basic condition and when a verbal shadowing protocol is used. The components of the model were systematically lesioned to clarify the specific impact of inner speech on the agent’s behaviour. The results indicate that inner speech improves the efficiency of internal representation manipulation. Specifically, it makes the representations linked to specific visual features more disentangled, thus improving the agent’s capacity to engage/disengage attention on stimulus features after positive/negative action outcomes. Overall, the model shows how inner speech could improve goal-directed internal manipulation of representations and enhance behavioural flexibility.

show abstract

Section: Methodsmentioning

confidence: 99%

A computational model of language functions in flexible goal-directed behaviour

Granato

Borghi

Baldassarre

2020

Sci Rep

Self Cite

View full text Add to dashboard Cite

show abstract

“…The concept of Intrinsic Motivations (IMs) is borrowed from biological [9] and psychological literature [10] describing how novel or unexpected "neutral" stimuli, as well as the perception of control over the environment, can generate learning processes even in the absence of assigned rewards or tasks. In the computational literature, IMs have been implemented in artificial agents to foster their autonomy in gathering knowledge [11], [12], learning repertoire of skills [13], [14], [15], [16], exploiting affordances from the environment [17], [18], [19], selecting their own tasks [20], [21], [22], and even boosting imitation learning techniques [23].…”

Section: Introductionmentioning

confidence: 99%

Autonomous Reinforcement Learning of Multiple Interrelated Tasks

Santucci

Baldassarre

Cartoni

2019

2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Self Cite

View full text Add to dashboard Cite

Autonomous multiple tasks learning is a fundamental capability to develop versatile artificial agents that can act in complex environments. In real-world scenarios, tasks may be interrelated (or "hierarchical") so that a robot has to first learn to achieve some of them to set the preconditions for learning other ones. Even though different strategies have been used in robotics to tackle the acquisition of interrelated tasks, in particular within the developmental robotics framework, autonomous learning in this kind of scenarios is still an open question. Building on previous research in the framework of intrinsically motivated open-ended learning, in this work we describe how this question can be addressed working on the level of task selection, in particular considering the multiple interrelated tasks scenario as an MDP where the system is trying to maximise its competence over all the tasks.

show abstract

“…One advantage, employed here, is that the world model can directly select actions to perform; instead, previous models [38,39] need an additional mechanism selecting actions on the basis of the state sequence produced by the world model. A second advantage is that for each environment state the world model can suggest the selection of actions that have a potential relevance in that context, rather than any action (this captures the popular idea of affordance in cognitive science [65,66]). A last advantage could be the easier learning (and understanding) of stateaction sequences directed to a goal produced by other agents; indeed, the world model would be neutral with respect to the fact that actions are performed by another part of the brain or by another agent.…”

Section: Discussion Of the General Features Of The Modelmentioning

confidence: 99%

A generative spiking neural-network model of goal-directed behaviour and one-step planning

Basanisi¹,

Brovelli²,

Cartoni³

et al. 2020

PLoS Comput Biol

Self Cite

View full text Add to dashboard Cite

In mammals, goal-directed and planning processes support flexible behaviour used to face new situations that cannot be tackled through more efficient but rigid habitual behaviours. Within the Bayesian modelling approach of brain and behaviour, models have been proposed to perform planning as probabilistic inference but this approach encounters a crucial problem: explaining how such inference might be implemented in brain spiking networks. Recently, the literature has proposed some models that face this problem through recurrent spiking neural networks able to internally simulate state trajectories, the core function at the basis of planning. However, the proposed models have relevant limitations that make them biologically implausible, namely their world model is trained ‘off-line’ before solving the target tasks, and they are trained with supervised learning procedures that are biologically and ecologically not plausible. Here we propose two novel hypotheses on how brain might overcome these problems, and operationalise them in a novel architecture pivoting on a spiking recurrent neural network. The first hypothesis allows the architecture to learn the world model in parallel with its use for planning: to this purpose, a new arbitration mechanism decides when to explore, for learning the world model, or when to exploit it, for planning, based on the entropy of the world model itself. The second hypothesis allows the architecture to use an unsupervised learning process to learn the world model by observing the effects of actions. The architecture is validated by reproducing and accounting for the learning profiles and reaction times of human participants learning to solve a visuomotor learning task that is new for them. Overall, the architecture represents the first instance of a model bridging probabilistic planning and spiking-processes that has a degree of autonomy analogous to the one of real organisms.

show abstract

An Embodied Agent Learning Affordances With Intrinsic Motivations and Solving Extrinsic Tasks With Attention and One-Step Planning

Cited by 13 publications

References 61 publications

A computational model of language functions in flexible goal-directed behaviour

A computational model of language functions in flexible goal-directed behaviour

Autonomous Reinforcement Learning of Multiple Interrelated Tasks

A generative spiking neural-network model of goal-directed behaviour and one-step planning

Contact Info

Product

Resources

About