Real-Time and Embedded Detection of Hand Gestures with an IMU-Based Glove

Mummadi, Chaithanya Kumar; Leo, Frederic Philips Peter; Verma, Keshav Deep; Kasireddy, Shivaji; Scholl, Philipp M.; Kempfle, Jochen; Laerhoven, Kristof Van

doi:10.3390/informatics5020028

Cited by 77 publications

(37 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, recently, hand gesture recognition has motivated new technologies in the area of computer vision. Previous studies have been proposed to solve hand gesture recognition tasks such as the glove-based approach [37], device-related methods [38] and the data glove-based method to address the issue of external sensors that enable us to monitor a user's hand motions more frequently [39]. Apart from such methods, nowadays, deep learning-based models are utilized to solve the hand gesture recognition and classification problems more efficiently and accurately [40,41].…”

Section: Related Workmentioning

confidence: 99%

Modelling a Spatial-Motion Deep Learning Framework to Classify Dynamic Patterns of Videos

2020

View full text Add to dashboard Cite

Video classification is an essential process for analyzing the pervasive semantic information of video content in computer vision. Traditional hand-crafted features are insufficient when classifying complex video information due to the similarity of visual contents with different illumination conditions. Prior studies of video classifications focused on the relationship between the standalone streams themselves. In this paper, by leveraging the effects of deep learning methodologies, we propose a two-stream neural network concept, named state-exchanging long short-term memory (SE-LSTM). With the model of spatial motion state-exchanging, the SE-LSTM can classify dynamic patterns of videos using appearance and motion features. The SE-LSTM extends the general purpose of LSTM by exchanging the information with previous cell states of both appearance and motion stream. We propose a novel two-stream model Dual-CNNSELSTM utilizing the SE-LSTM concept combined with a Convolutional Neural Network, and use various video datasets to validate the proposed architecture. The experimental results demonstrate that the performance of the proposed two-stream Dual-CNNSELSTM architecture significantly outperforms other datasets, achieving accuracies of 81.62%, 79.87%, and 69.86% with hand gestures, fireworks displays, and HMDB51 datasets, respectively. Furthermore, the overall results signify that the proposed model is most suited to static background dynamic patterns classifications.

show abstract

Section: Related Workmentioning

confidence: 99%

Modelling a Spatial-Motion Deep Learning Framework to Classify Dynamic Patterns of Videos

2020

View full text Add to dashboard Cite

show abstract

“…where, i t , f t , o t , z t represent the input gate, forget gate, output gate, and cell gate respectively. c t and h t are memory and output activation at time t. The Equations (10), (11), (13) and (14) are the formulas for forget, cell, output gates and hidden state.…”

Section: Spatio-temporal Feature Learningmentioning

confidence: 99%

“…Even though most of such glove-based systems focusing on sensors, these external sensors enable to observe the user's hand always. To address this drawback, a glove-based concept which utilizes the data gloves for human-computer interaction has proposed [13]. Besides the study [14] evaluate the performance of a wearable gesture recognition system that captures hand, finger and arm.…”

Section: Introductionmentioning

confidence: 99%

Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model

Hakim

Shih

Arachchi

et al. 2019

Sensors

View full text Add to dashboard Cite

With the recent growth of Smart TV technology, the demand for unique and beneficial applications motivates the study of a unique gesture-based system for a smart TV-like environment. Combining movie recommendation, social media platform, call a friend application, weather updates, chatting app, and tourism platform into a single system regulated by natural-like gesture controller is proposed to allow the ease of use and natural interaction. Gesture recognition problem solving was designed through 24 gestures of 13 static and 11 dynamic gestures that suit to the environment. Dataset of a sequence of RGB and depth images were collected, preprocessed, and trained in the proposed deep learning architecture. Combination of three-dimensional Convolutional Neural Network (3DCNN) followed by Long Short-Term Memory (LSTM) model was used to extract the spatio-temporal features. At the end of the classification, Finite State Machine (FSM) communicates the model to control the class decision results based on application context. The result suggested the combination data of depth and RGB to hold 97.8% of accuracy rate on eight selected gestures, while the FSM has improved the recognition rate from 89% to 91% in a real-time performance.

show abstract

“…For IMU-based approaches, many use glove-mounted sensors. Mummadi et al [12] used an IMU-based glove for realtime sign language recognition. They used various machine learning algorithms, such as Support Vector Machines, Naive Bayes, Multi-Layer Perceptron, and Random Forest, to classify the gestures.…”

Section: Related Workmentioning

confidence: 99%

“…They employed HMMs as the underlying algorithm for gesture recognition. However, these methods either only classified static gestures with the hand and fingers [8], [12], [14] or needed a huge database (about 1000 samples for each kind) [12].…”

Section: Related Workmentioning

confidence: 99%

Mobile manipulator control through gesture recognition using IMUs and Online Lazy Neighborhood Graph search

et al. 2019

View full text Add to dashboard Cite

Gesture-based control potentially eliminates the need for wearisome physical controls and facilitates easy interaction between a human and a robot. At the same time, it is intuitive and enables a natural means of control. In this paper, we present and evaluate a framework for gesture recognition using four wearable Inertial Measurement Units (IMUs) to indirectly control a mobile robot. Six gestures involving different hand and arm motions are defined. A novel algorithm based on an Online Lazy Neighborhood Graph (OLNG) search is used to recognise and classify the gestures online. A software framework is developed to control a robotic platform through integrating our gesture recognition algorithm with a Robot Operating System (ROS), which is in turn used to trigger predefined robot behaviours. Experiments show that the framework is able to correctly detect and classify six different gestures in real time with average success rates of 81.61 % and 81.67 %, while keeping the false-positive rate low by designing and using only 126 training samples.

show abstract

Real-Time and Embedded Detection of Hand Gestures with an IMU-Based Glove

Cited by 77 publications

References 25 publications

Modelling a Spatial-Motion Deep Learning Framework to Classify Dynamic Patterns of Videos

Modelling a Spatial-Motion Deep Learning Framework to Classify Dynamic Patterns of Videos

Dynamic Hand Gesture Recognition Using 3DCNN and LSTM with FSM Context-Aware Model

Mobile manipulator control through gesture recognition using IMUs and Online Lazy Neighborhood Graph search

Contact Info

Product

Resources

About