Fingertip detection and tracking for recognition of air-writing in videos

Mukherjee, Sohom; Sekh, Arif Ahmed; Dogra, Debi Prosad; Kar, Samarjit; Roy, Partha Pratim

doi:10.1016/j.eswa.2019.06.034

Cited by 80 publications

(54 citation statements)

References 58 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It was found experimentally that the four immediate neighbours contributed to self co-articulations in most cases. For polarity, the gradient of all the gesture coordinates was calculated using (17) and (18), and by (19), gestures with normal strokes were obtained. Then, to check the performance of global measure, using (20), the MER was calculated for 'ED', 'IV e ', and 'Po'.…”

Section: Experiments 1: With a Reference Patternmentioning

confidence: 99%

Segregation of meaningful strokes, a pre‐requisite for self co‐articulation removal in isolated dynamic gestures

Monsley

Yadav

Misra

et al. 2020

IET Image Processing

View full text Add to dashboard Cite

Gesture formation, a pre-processing step, has its importance when variations in patterns, scale, and speed come into play. Self co-articulations are intentional movements performed by an individual to complete a gesture, whose presence in the trajectory alters its original meaning. For recognition, most researchers have directly used the trajectory formed along with these self co-articulated strokes, with a few removing it using visible trait-like velocity. Usage of velocity has shortcomings as gesturing in air differs from gesturing over a solid surface; hence, we propose a gesture formation model, which incorporates global and local measures to remove these self co-articulations. The global measure uses Euclidean distance, instantaneous velocity, and polarity calculated from the complete gesture, while the local measure segments the gesture into stroke-level segments by using the minimummaximum-polarity algorithm and applies the selective bypass rules. The proposed model, when experimented on gestures patterns with premeditated speed variation, has a mean error rate of 0.0069 and 7.40% self co-articulations;individuals' natural gesticulation has a mean error rate of 0.0371 and 12.07% self co-articulations. Experimentation on each gesture of NITS hand gesture databases showed a relative improvement of 40% (accuracy 97%) over the existing baseline models.

show abstract

Section: Experiments 1: With a Reference Patternmentioning

confidence: 99%

Segregation of meaningful strokes, a pre‐requisite for self co‐articulation removal in isolated dynamic gestures

Monsley

Yadav

Misra

et al. 2020

IET Image Processing

View full text Add to dashboard Cite

show abstract

“…Recent work on in-air handwritten number recognition can be classified into two groups: image-based representation [4]- [13] and node-based representation [14]- [20]. In image-based representation, the numbers are saved as an image with two types of pixels: number path pixels and background pixels, as in static number recognition applications.…”

Section: In-air Handwritten Number Recognitionmentioning

confidence: 99%

“…Later, further improvements have been achieved by normalizing the hand-path by picking out a specific number of points [5]. Deep learning-based techniques employ one dimensional convolutional neural networks to learn the features from image-based representations [9]- [13]. It must be also noted that some methods were concerned with recognizing words written on a touch pad using finger strokes [9]- [11], as opposed to in-air drawing as in robotic applications.…”

Section: In-air Handwritten Number Recognitionmentioning

confidence: 99%

“…Recently, in-air hand-drawn number recognition has gained strong interest [4]- [20] due to applications-related importance and the interesting challenges in the problem. These methods are either based on image representation of numbers and shapes [4]- [13] followed by shape matching or the path representation of hand movement [14]- [20] followed by shape matching. Some methods are based on deep learning for shape matching [9]- [13].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Graph Spectral Domain Feature Learning With Application to in-Air Hand-Drawn Number and Shape Recognition

Alwaely

Abhayaratne

2019

IEEE Access

View full text Add to dashboard Cite

This paper addresses the problem of recognition of dynamic shapes by representing the structure in a shape as a graph and learning the graph spectral domain features. Our proposed method includes pre-processing for converting the dynamic shapes into a fully connected graph, followed by analysis of the eigenvectors of the normalized Laplacian of the graph adjacency matrix for forming the feature vectors. The method proposes to use the eigenvector corresponding to the lowest eigenvalue for formulating the feature vectors as it captures the details of the structure of the graph. The use of the proposed graph spectral domain representation has been demonstrated in an in-air hand-drawn number and symbol recognition applications. It has achieved average accuracy rates of 99.56% and 99.44%, for numbers and symbols, respectively, outperforming the existing methods for all datasets used. It also has the added benefits of fast real-time operation and invariance to rotation and flipping, making the recognition system robust to different writing and drawing variations.

show abstract

“…Research and application of moving-target positioning is an important part of intelligent video perception, and is being studied in China and other parts of the world. Mukherjee et al [1] proposed a position-tracking method to distinguish and track multiple targets in a given scenario, based on the colors of the moving targets. Christiansen et al [2] used the moving target segmentation method based on the points on the contour of The associate editor coordinating the review of this manuscript and approving it for publication was Kun Mean Hou.…”

Section: Introductionmentioning

confidence: 99%

Moving-Object Tracking Algorithm Based on PCA-SIFT and Optimization for Underground Coal Mines

Jiang

Dai

et al. 2019

IEEE Access

View full text Add to dashboard Cite

In view of the complex and changeable environment in underground coal mines, an improved algorithm based on the principal component analysis-scale invariant feature transform (PCA-SIFT) and mean shift is proposed to address the issues for which existing tracking algorithms are not adequate; for example, when differentiating between moving targets and the background, the tracking in the case of moving objects (e.g., confusion between foreground and background) is not optimal. This results in poor resolution and the inability to deal with very dusty conditions, scale change, and rotation. The proposed feature target tracking model was developed using the scale invariance property of the PCA-SIFT feature-extraction algorithm. Finally, the mean-shift method was used to track moving objects. The experimental results showed that the optimized algorithm for tracking moving objects was significantly better and more robust than the existing algorithm. INDEX TERMS Target tracking, scale invariant feature transform, mean shift, target detection.

show abstract

Fingertip detection and tracking for recognition of air-writing in videos

Cited by 80 publications

References 58 publications

Segregation of meaningful strokes, a pre‐requisite for self co‐articulation removal in isolated dynamic gestures

Segregation of meaningful strokes, a pre‐requisite for self co‐articulation removal in isolated dynamic gestures

Graph Spectral Domain Feature Learning With Application to in-Air Hand-Drawn Number and Shape Recognition

Moving-Object Tracking Algorithm Based on PCA-SIFT and Optimization for Underground Coal Mines

Contact Info

Product

Resources

About