Towards Generalization in Target-Driven Visual Navigation by Using Deep Reinforcement Learning

Devo, Alessandro; Mezzetti, Giacomo; Costante, Gabriele; Fravolini, Mario Luca; Valigi, Paolo

doi:10.1109/tro.2020.2994002

Cited by 77 publications

(38 citation statements)

References 35 publications

(8 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In previous research [45] , the DRL algorithm was found to be generalizable to new scenarios, but at the expense of a decrease in performance and the need to fine-tune the network. To improve the generalization ability of the visual navigation algorithm, Devo et al [52] proposed the importance weighted actor-learner architecture, a new framework comprising object localization and navigation networks. The object localization network takes the target image and current frame as input, and outputs a six-dimensional vector that represents the position of the target in the current frame.…”

Section: Developmentmentioning

confidence: 99%

See 1 more Smart Citation

Deep reinforcement learning based mobile robot navigation: A review

Zhu

Zhang

2021

Tsinghua Sci. Technol.

256

View full text Add to dashboard Cite

Navigation is a fundamental problem of mobile robots, for which Deep Reinforcement Learning (DRL) has received significant attention because of its strong representation and experience learning abilities. There is a growing trend of applying DRL to mobile robot navigation. In this paper, we review DRL methods and DRL-based navigation frameworks. Then we systematically compare and analyze the relationship and differences between four typical application scenarios: local obstacle avoidance, indoor navigation, multi-robot navigation, and social navigation. Next, we describe the development of DRL-based navigation. Last, we discuss the challenges and some possible solutions regarding DRL-based navigation.

show abstract

Section: Developmentmentioning

confidence: 99%

“…In addition to reducing the state input size, Devo et al [52] designed a two-network architecture comprising an object localization network and a navigation network to solve the generalization problem. This architecture reduces the state-space dimension of the navigation network by preprocessing images through the object localization network.…”

Section: Solutionmentioning

confidence: 99%

Deep reinforcement learning based mobile robot navigation: A review

Zhu

Zhang

2021

Tsinghua Sci. Technol.

256

View full text Add to dashboard Cite

show abstract

“…Generalization across environments is discussed in [12]. The authors trained the agent in domain-randomized mazelike environments and experimented with a robot in a small maze.…”

Section: Related Workmentioning

confidence: 99%

“…• The simulator often provides the agent with features that are not available in the real world: the segmentation masks [4], [6], [10], distance to the goal, stopping signal [4]- [7], [11], [12], [14], etc. This information is given either as one of the agent's inputs [4], [10] or in the form of an auxiliary task [6].…”

mentioning

confidence: 99%

See 1 more Smart Citation

Visual Navigation in Real-World Indoor Environments Using End-to-End Deep Reinforcement Learning

Kulhánek

Derner

Babuška

2021

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

Visual navigation is essential for many applications in robotics, from manipulation, through mobile robotics to automated driving. Deep reinforcement learning (DRL) provides an elegant map-free approach integrating image processing, localization, and planning in one module, which can be trained and therefore optimized for a given environment. However, to date, DRL-based visual navigation was validated exclusively in simulation, where the simulator provides information that is not available in the real world, e.g., the robot's position or segmentation masks. This precludes the use of the learned policy on a real robot. Therefore, we present a novel approach that enables a direct deployment of the trained policy on real robots. We have designed a new powerful simulator capable of domain randomization. To facilitate the training, we propose visual auxiliary tasks and a tailored reward scheme. The policy is fine-tuned on images collected from real-world environments. We have evaluated the method on a mobile robot in a real office environment. The training took approximately 30 hours on a single GPU. In 30 navigation experiments, the robot reached a 0.3-meter neighbourhood of the goal in more than 86.7 % of cases. This result makes the proposed method directly applicable to tasks like mobile manipulation.Index Terms-Vision-based navigation, reinforcement learning, deep learning methods. I. INTRODUCTIONV ISION-BASED navigation is essential for a broad range of robotic applications, from industrial and service robotics to automated driving. The wide-spread use of this technique will be further stimulated by the availability of lowcost cameras and high-performance computing hardware.Conventional vision-based navigation methods usually build a map of the environment and then use planning to reach

show abstract