2018 IEEE International Conference on Robotics and Biomimetics (ROBIO) 2018
DOI: 10.1109/robio.2018.8664803
|View full text |Cite
|
Sign up to set email alerts
|

Hierarchical Reinforcement Learning Framework Towards Multi-Agent Navigation

Abstract: In this paper, we propose a navigation algorithm oriented to multi-agent environment. This algorithm is expressed as a hierarchical framework that contains a Hidden Markov Model (HMM) and a Deep Reinforcement Learning (DRL) structure. For simplification, we term our method Hierarchical Navigation Reinforcement Network (HNRN). In highlevel architecture, we train an HMM to evaluate the agents perception to obtain a score. According to this score, adaptive control action will be chosen. While in low-level archite… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
2

Relationship

0
10

Authors

Journals

citations
Cited by 29 publications
(15 citation statements)
references
References 21 publications
0
15
0
Order By: Relevance
“…To simplify the decision-making process, raw sensor data can be directly used as input. Ding et al [59] studied the hierarchical RL method using lidar data as input. A high-level evaluation module is responsible for perceiving the overall environmental risks, and a low-level control module is responsible for making action decisions.…”
Section: Developmentmentioning
confidence: 99%
“…To simplify the decision-making process, raw sensor data can be directly used as input. Ding et al [59] studied the hierarchical RL method using lidar data as input. A high-level evaluation module is responsible for perceiving the overall environmental risks, and a low-level control module is responsible for making action decisions.…”
Section: Developmentmentioning
confidence: 99%
“…For example, Long et al [4] used the PPO algorithm end-to-end training obstacle avoidance algorithm on the Stage simulator. Ding et al [9] trained the DDPG and HMM joint algorithm on the Gazebo simulator [10] to solve the multi-vehicle collaborative path planning. Tai et al [11] trained ADDPG on V-REP [12] to solve the Mapless Navigation problem and Kahn et al [13] trained a model-based RL algorithm on open raves to solve obstacle avoidance.…”
Section: Related Workmentioning
confidence: 99%
“…Both Tai et al (2018) and Jin et al (2019) have shown that systems similar to the ones above can also be used to enable navigation in the presence of human pedestrians using RL and geometric sensor inputs. Ding et al (2018) showed that a similar capability can also be achieved using RL to train a system to choose between target pursuit and collision avoidance by incorporating a Hidden Markov Model (HMM) (Stratonovich 1965) into a hierarchical model. Considering the specific case in which some humans in the scene can be assumed to be companions, Li et al (2018) showed that end-to-end learned approaches could also enable Socially Concomitant Navigation (SCN), i.e., navigation in which the robot not only needs to avoid collisions as in previous work, but also needs to maintain a sense of affinity with respect to the motion of its companion.…”
Section: Localmentioning
confidence: 99%