2020
DOI: 10.3390/s20247287
|View full text |Cite
|
Sign up to set email alerts
|

Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition

Abstract: This paper presents a novel method for integration of industrially-oriented human-robot speech communication and vision-based object recognition. Such integration is necessary to provide context for task-oriented voice commands. Context-based speech communication is easier, the commands are shorter, hence their recognition rate is higher. In recent years, significant research was devoted to integration of speech and gesture recognition. However, little attention was paid to vision-based identification of objec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2025
2025

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 36 publications
0
4
0
Order By: Relevance
“…Attentional mechanisms ( Hao et al, 2021b ) has aroused the interest of many researchers because it have fewer parameters, faster speed, and better results in important areas such as machine translation ( Hao et al, 2021a ), speech recognition ( Bahdanau et al, 2016 ; Rogowski et al, 2020 ), image recognition ( Wang et al, 2017 ), and gradually started to be applied in sEMG gesture recognition ( Hu J. et al, 2018 ; Jiang et al, 2019c ; Liu et al, 2021 ) Previous CNN gesture recognition models often do not give enough attention to the characteristics of the EMG signal and do not make full use of the temporal information ( Atzori et al, 2014b ; Atzori et al, 2016 ; Geng et al, 2016 ; Ketykó et al, 2019 ). Therefore, this paper introduces a convolutional attention mechanism into the EMG gesture recognition method and designs a one-dimensional convolutional attention module based on time and feature channels to make it more applicable to EMG gesture recognition.…”
Section: Methodsmentioning
confidence: 99%
“…Attentional mechanisms ( Hao et al, 2021b ) has aroused the interest of many researchers because it have fewer parameters, faster speed, and better results in important areas such as machine translation ( Hao et al, 2021a ), speech recognition ( Bahdanau et al, 2016 ; Rogowski et al, 2020 ), image recognition ( Wang et al, 2017 ), and gradually started to be applied in sEMG gesture recognition ( Hu J. et al, 2018 ; Jiang et al, 2019c ; Liu et al, 2021 ) Previous CNN gesture recognition models often do not give enough attention to the characteristics of the EMG signal and do not make full use of the temporal information ( Atzori et al, 2014b ; Atzori et al, 2016 ; Geng et al, 2016 ; Ketykó et al, 2019 ). Therefore, this paper introduces a convolutional attention mechanism into the EMG gesture recognition method and designs a one-dimensional convolutional attention module based on time and feature channels to make it more applicable to EMG gesture recognition.…”
Section: Methodsmentioning
confidence: 99%
“…By adeptly detecting and recognizing objects within the robot's working domain, the potential for collisions and accidents is effectively minimized. A notable study delves into the synthesis of vision-based object recognition and human-robot communication, shedding light on the profound importance of this synergy within industrial contexts (Rogowski et al, 2020).…”
Section: Introductionmentioning
confidence: 99%
“…This way of building maps can construct a more expressive map of the environment to assist the robot to better understand the environment as a whole, but it is not good for the robot to recognize the individuals in the environment. Robots cannot interact with individuals in the environment because they cannot distinguish between different individuals in the environment, which limits the application of intelligent robots to a certain extent [ 30 , 31 , 32 ]. The main contributions of this paper are as follows: Propose a method of target tracking combined with instance segmentation, which can segment the objects in an image and track each object to establish data associations for different frames.…”
Section: Introductionmentioning
confidence: 99%