2016
DOI: 10.1007/978-3-319-47605-6_5
|View full text |Cite
|
Sign up to set email alerts
|

To Explore or to Exploit? Learning Humans’ Behaviour to Maximize Interactions with Them

Abstract: Abstract. Assume a robot operating in a public space (e.g., a library, a museum) and serving visitors as a companion, a guide or an information stand. To do that, the robot has to interact with humans, which presumes that it actively searches for humans in order to interact with them. This paper addresses the problem how to plan robot's actions in order to maximize the number of such interactions in the case human behavior is not known in advance. We formulate this problem as the exploration/exploitation probl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
4
1
1

Relationship

5
1

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 23 publications
0
5
0
Order By: Relevance
“…Since there are multiple options how to address the service scheduling problem and verification of each option would take at least 4 weeks, we already used the temporal models learned by the robot in the deployment to create a dynamic simulation of the deployment environment. Using this simulator, we tested over 50 different scheduling strategies, service utility functions and path planning policies and we found out that a more complex utility function in combination with distance-aware path planning can double the number of potential interactions [14]. Thus, for the next deployment of the robot, implemented these improvements and we will compare them to the original scheduling method.…”
Section: Discussionmentioning
confidence: 99%
“…Since there are multiple options how to address the service scheduling problem and verification of each option would take at least 4 weeks, we already used the temporal models learned by the robot in the deployment to create a dynamic simulation of the deployment environment. Using this simulator, we tested over 50 different scheduling strategies, service utility functions and path planning policies and we found out that a more complex utility function in combination with distance-aware path planning can double the number of potential interactions [14]. Thus, for the next deployment of the robot, implemented these improvements and we will compare them to the original scheduling method.…”
Section: Discussionmentioning
confidence: 99%
“…The initial body of work addressing the problem of data collection for MoDs is connected to the work of Krajnik et al (2017). The key publications are Santos et al, 2015Santos et al, , 2017 and Kulich et al (2016). All the listed publications attempt to address the problem of efficient data acquisition while streaking the balance between exploration and exploitation.…”
Section: Task Planningmentioning
confidence: 99%
“…Considering the ability of a robot that uses FreMEn to recognise “when” it observed the most informative situation, the novel information-based Monte-Carlo scheduler for exploration was proposed ( Santos et al, 2016 ). Eventualy, different exploration strategies ( Krajník et al, 2015b ; Santos et al, 2017 ; Molina et al, 2019 ) and exploration–exploitation dilema ( Kulich et al, 2016 ) were studied. In the study by ( Jovan et al, 2016 ), FreMEn was redefined into the Addition Amplitude Model (AAM).…”
Section: Related Workmentioning
confidence: 99%
“…There are a lot of different opinions as to what criterion influences the acceptability of a robot in human society most ( Talebpour et al, 2015 ; Kostavelis et al, 2017 ). Besides, the acceptability need not necessarily be the only point of view that defines the quality of performing the task ( Krajník et al, 2015a ; Fentanes et al, 2015 ; Kulich et al, 2016 ).…”
Section: Generalized Natural Criterionmentioning
confidence: 99%