Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network

Nguyen, Anh; Nguyen, Ngoc Duy; Tran, Kim Phuc; Tjiputra, Erman; Tran, Quang D.

doi:10.1109/iros45743.2020.9341494

Cited by 36 publications

(23 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recently, automatic emotion recognition has gained a lot of attention in both academia and industry [49]. It enables a wide range of novel applications in different domains, ranging from healthcare [15], surveillance [9] to robotics [42] and human-computer interaction [11].…”

Section: Introductionmentioning

confidence: 99%

Global-local attention for emotion recognition

Nguyen

et al. 2021

Neural Comput & Applic

Self Cite

View full text Add to dashboard Cite

Human emotion recognition is an active research area in artificial intelligence and has made substantial progress over the past few years. Many recent works mainly focus on facial regions to infer human affection, while the surrounding context information is not effectively utilized. In this paper, we proposed a new deep network to effectively recognize human emotions using a novel global-local attention mechanism. Our network is designed to extract features from both facial and context regions independently, then learn them together using the attention module. In this way, both the facial and contextual information is used to infer human emotions, therefore enhancing the discrimination of the classifier. The intensive experiments show that our method surpasses the current state-of-the-art methods on recent emotion datasets by a fair margin. Qualitatively, our global-local attention module can extract more meaningful attention maps than previous methods. The source code and trained model of our network are available at https://github.com/minhnhatvt/glamor-net.

show abstract

Section: Introductionmentioning

confidence: 99%

Global-local attention for emotion recognition

Nguyen

et al. 2021

Neural Comput & Applic

Self Cite

View full text Add to dashboard Cite

show abstract

“…Nguyen et al [75] propose a solution to directly map the multi-modal input sensory data to the output steering commands. A three-branched network architecture is introduced to process and fuse the three sensing modalities employed, namely 2D laser scans, and RGB-D camera data (i.e., colored images and point clouds).…”

Section: End-to-end Approachesmentioning

confidence: 99%

“…In some cases, the manufacturers offer software development kits that even allow real-time environment mapping, with both geometry and visual appearance. This kind of environment sensing has already been employed in several works reported in this survey [34,51,75] and is much less costly than 3D laser scanners. However, depending on the sensing technique adopted, there are some application scenarios in which RGB-D cameras cannot work whereas LIDARs can, such as in the dark.…”

Section: The Breakthrough Of Vision: Paradigms and Sensing Technology Advancementsmentioning

confidence: 99%

“…Only in [51] a particular attention is again given to defining a different kind of features, in order to enhance terrain classification outcomes. Other interesting approaches have been presented in [72,75], where CNNs are used to extract geometric features directly from 3D point clouds, depth maps and elevation models. Newer approaches to meaningful geometric feature extraction could probably be explored, likely by exploiting deep neural networks and trying to infer those non-trivial features, through saliency detection [19] for instance.…”

Section: From Features To Datamentioning

confidence: 99%

“…As already pointed out in [18,19,90], simulation of robot behaviors, sensors and algorithms do play a crucial role in the context of robot learning. Several works among the ones in this survey extensively used simulated data to train their models [49,53,57,75]. In particular, the most beneficial aspect of simulations which has been exploited the most is the possibility to easily label training data.…”

Section: The Importance Of Simulationmentioning

confidence: 99%

See 2 more Smart Citations

Learning-Based Methods of Perception and Navigation for Ground Vehicles in Unstructured Environments: A Review

Guastella

Muscato

2020

Sensors

View full text Add to dashboard Cite

The problem of autonomous navigation of a ground vehicle in unstructured environments is both challenging and crucial for the deployment of this type of vehicle in real-world applications. Several well-established communities in robotics research deal with these scenarios such as search and rescue robotics, planetary exploration, and agricultural robotics. Perception plays a crucial role in this context, since it provides the necessary information to make the vehicle aware of its own status and its surrounding environment. We present a review on the recent contributions in the robotics literature adopting learning-based methods to solve the problem of environment perception and interpretation with the final aim of the autonomous context-aware navigation of ground vehicles in unstructured environments. To the best of our knowledge, this is the first work providing such a review in this context.

show abstract

Autonomous Navigation with Mobile Robots Using Deep Learning and the Robot Operating System

Nguyen

Tran²

2021

Studies in Computational Intelligence

View full text Add to dashboard Cite

Autonomous Navigation in Complex Environments with Deep Multimodal Fusion Network

Cited by 36 publications

References 43 publications

Global-local attention for emotion recognition

Global-local attention for emotion recognition

Learning-Based Methods of Perception and Navigation for Ground Vehicles in Unstructured Environments: A Review

Autonomous Navigation with Mobile Robots Using Deep Learning and the Robot Operating System

Contact Info

Product

Resources

About