2022
DOI: 10.1007/s00138-022-01332-8
|View full text |Cite
|
Sign up to set email alerts
|

Optimized hand pose estimation CrossInfoNet-based architecture for embedded devices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…We opted to only compare the two baselines because of the similarity in our approaches. On the first few samples both our architecture and CrossInfoMobileNet [12] performed well with insignificant differences, however as the number of samples increased, our proposed method exhibited superior performance compared to the competing state-of-the-art architectures. For instance, when evaluating the test samples based on the maximum joint error below a specified threshold, our method achieved a remarkable 96% accuracy at a threshold level of 40 mm.…”
Section: Error Analysis On Nyu Datasetsmentioning
confidence: 90%
See 1 more Smart Citation
“…We opted to only compare the two baselines because of the similarity in our approaches. On the first few samples both our architecture and CrossInfoMobileNet [12] performed well with insignificant differences, however as the number of samples increased, our proposed method exhibited superior performance compared to the competing state-of-the-art architectures. For instance, when evaluating the test samples based on the maximum joint error below a specified threshold, our method achieved a remarkable 96% accuracy at a threshold level of 40 mm.…”
Section: Error Analysis On Nyu Datasetsmentioning
confidence: 90%
“…This work presents lightweight network (LightWeightNet) hand pose estimation (HPE), a hand pose estimation based on convolutional neural network finetuned and optimized to mobile phone processors to minimize the computational cost and allow mobile phone users enjoy an immersive experience. The first attempt was the work of [11], where a CrossInfoMobilenet was presented replacing a computational critical CrossinfoNet [12]. Herein, we present an improved version of CrossInfoMobileNet with an additional depth-wise separable convolutions which greatly lowers the computational cost of a general convolutional neural network (CNN) model used in MobileNet3 [13].…”
mentioning
confidence: 99%
“…It comes with a new block type Squeeze-and-Excitation (SE) that better take into account feature maps based on their channel dependencies. Also, instead of the ReLU activation function, there is a Hard-swish function, which reduces the number of multiply-accumulate operations (MAC) but preserves nonlinearity [52]. It also comes separately in a version for more powerful and weaker target devices, and both versions can additionally be made minimalist or full.…”
Section: A Mobilenetmentioning
confidence: 99%