2015
DOI: 10.1109/tip.2015.2487860
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Deep Autoencoder for Human Pose Recovery

Abstract: Video-based human pose recovery is usually conducted by retrieving relevant poses using image features. In the retrieving process, the mapping between 2D images and 3D poses is assumed to be linear in most of the traditional methods. However, their relationships are inherently non-linear, which limits recovery performance of these methods. In this paper, we propose a novel pose recovery method using non-linear mapping with multi-layered deep neural network. It is based on feature extraction with multimodal fus… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
90
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 538 publications
(91 citation statements)
references
References 30 publications
1
90
0
Order By: Relevance
“…Auto encoders are also used in some other computer vision tasks, such as human pose recovery and hand gesture recognition [31][32][33]. Authors in [31] proposed a three stage structure, consisting of two conventional auto encoders in the first and third stages and one multilayer neural network in the second stage for human pose recovery.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Auto encoders are also used in some other computer vision tasks, such as human pose recovery and hand gesture recognition [31][32][33]. Authors in [31] proposed a three stage structure, consisting of two conventional auto encoders in the first and third stages and one multilayer neural network in the second stage for human pose recovery.…”
Section: Related Workmentioning
confidence: 99%
“…Authors in [31] proposed a three stage structure, consisting of two conventional auto encoders in the first and third stages and one multilayer neural network in the second stage for human pose recovery. To train the system, two kinds of inputs are used: 2D images (obtained from video frames) and 3D poses.…”
Section: Related Workmentioning
confidence: 99%
“…An example of efficient usage of such autoencoder representations can be found for the task of human pose recovery. Hong et al [7] used an autoencoder architecture to reconstruct 3D poses from 2D silhouettes. Further algorithmic developments have introduced the denoising autoencoder [8] and the variational autoencoder [9], where the latter extends the autoencoder to a generative model.…”
Section: Introductionmentioning
confidence: 99%
“…However, proposed idea implements accuracies of 91% versus 86% and 93% versus 93% on the in vitro and in vivo datasets used in validating the US EPA method. Chaoqun Hong et al proposed a new pose retrieval technique which focuses on multimodal integration feature extraction and backpropagation deep neural network by using multilayered deep neural network with nonlinear mapping [9]. In [10] TzuHsi Song et al focused on bone marrow trepan biopsy images and proposed a hybrid deep autoencoder (HDA) network with Curvature Gaussian method for active and exact bone marrow hematopoietic stem cell detection via related highlevel feature correspondence.…”
Section: Introductionmentioning
confidence: 99%