2020
DOI: 10.1007/978-3-030-64313-3_8
|View full text |Cite
|
Sign up to set email alerts
|

How to Reduce Computation Time While Sparing Performance During Robot Navigation? A Neuro-Inspired Architecture for Autonomous Shifting Between Model-Based and Model-Free Learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

2
36
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 10 publications
(38 citation statements)
references
References 15 publications
2
36
0
Order By: Relevance
“…Another important current question is whether uncertainty alone is sufficient to arbitrate between MB and MF systems [49], or whether, when the two systems are equally uncertain, the agent should rely on the least computationally costly one [50]. If we want the agent to be initially agnostic about which system is more costly, and if we even want the agent to be able to potentially arbitrate between N different learning systems with different a priori unknown computational characteristics, then one proposal is simply to measure the average time taken by each system when it has to make decisions [50]. In some of the robotic experiments that we will describe in the next section, we found that this principle works robustly, enables to produce the ideal MB-to-MF sequence, not only during initial learning but also after a task change.…”
Section: Neuroscience Studies Of the Coordination Of Model-based And Model-free Reinforcement Learningmentioning
confidence: 99%
See 4 more Smart Citations
“…Another important current question is whether uncertainty alone is sufficient to arbitrate between MB and MF systems [49], or whether, when the two systems are equally uncertain, the agent should rely on the least computationally costly one [50]. If we want the agent to be initially agnostic about which system is more costly, and if we even want the agent to be able to potentially arbitrate between N different learning systems with different a priori unknown computational characteristics, then one proposal is simply to measure the average time taken by each system when it has to make decisions [50]. In some of the robotic experiments that we will describe in the next section, we found that this principle works robustly, enables to produce the ideal MB-to-MF sequence, not only during initial learning but also after a task change.…”
Section: Neuroscience Studies Of the Coordination Of Model-based And Model-free Reinforcement Learningmentioning
confidence: 99%
“…A more recent series of robotic experiments with the same research goal (i.e., assessing the efficience and robustness of bio-inspired coordination principles of MB and MF learning) has been presented in [61,62,47] and later in [60,50,63]. First, [61,62,47] compared different coordination principles, including methods coming from ensemble learning [51] in several different simulated robotic tasks.…”
Section: Robotic Tests Of Bio-inspired Principles For the Coordination Of Model-based And Model-free Reinforcement Learningmentioning
confidence: 99%
See 3 more Smart Citations