Adapting MobileNets for mobile based upper body pose estimation

Debnath, Bappaditya; OrBrien, Mary; Yamaguchi, Motonori; Behera, Ardhendu

doi:10.1109/avss.2018.8639378

Cited by 15 publications

(10 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This network is similar to the idea of the Hourglass network while utilizing U-Net as each component with a more optimized global connection across each stage resulting in fewer parameters and small model size. Debnath et al (2018) adapted MobileNets (Howard et al, 2017) for pose estimation by designing a split stream architecture at the final two layers of the MobileNets. Feng et al (2019) designed a lightweight variant of Hourglass network and trained it with a full teacher Hourglass network by a Fast Pose Distillation (FPD) training strategy.…”

Section: Detection-based Methodsmentioning

confidence: 99%

“…(1) Patch-based: (Jain et al, 2013;Chen and Yuille, 2014;Ramakrishna et al, 2014) (2) Network design: (Tompson et al, 2015;Bulat and Tzimiropoulos, 2016;Xiao et al, 2018), multi-scale inputs (Rafi et al, 2016), heatmap-based improvement (Papandreou et al, 2017), Hourglass (Newell et al, 2016), CPM (Wei et al, 2016), PRM (Yang et al, 2017), feed forward module (Belagiannis and Zisserman, 2017), HRNet (Sun et al, 2019), GAN (Chou et al, 2017;Peng et al, 2018) (3) Body structure constraint: (Tompson et al, 2014;Lifshitz et al, 2016;Yang et al, 2016;Gkioxari et al, 2016;Chu et al, 2016Chu et al, , 2017Ning et al, 2018;Ke et al, 2018;Tang et al, 2018a;Tang and Wu, 2019) (4) Temporal constraint: (Jain et al, 2014;Pfister et al, 2015;Luo et al, 2018) (5) Network compression: (Tang et al, 2018b;Debnath et al, 2018;Feng et al, 2019) 2D Multiple…”

Section: Regression-basedmentioning

confidence: 99%

See 1 more Smart Citation

Monocular human pose estimation: A survey of deep learning-based methods

Chen¹,

Tian²,

He³

2020

Computer Vision and Image Understanding

325

149

View full text Add to dashboard Cite

Section: Detection-based Methodsmentioning

confidence: 99%

Section: Regression-basedmentioning

confidence: 99%

Monocular human pose estimation: A survey of deep learning-based methods

Chen¹,

Tian²,

He³

2020

Computer Vision and Image Understanding

325

149

View full text Add to dashboard Cite

“…The SPPE regression methods currently perform lower as compared to the body part detection methods. For example, the best accuracy was reported by Debnath et al [34] with a PCKh@0.2 of 96.4%. Although body part detection methods have shown excellent performance, however, they are prone to estimating false positives [143].…”

Section: Discussionmentioning

confidence: 94%

“…An alternative approach is to use heatmaps which provide richer supervision information compared to joint coordinates, by preserving spatial location information [220]. This information is ideal for training CNNs and has resulted in a growing interest in leveraging CNNs for the purpose of HPE [5,13,17,34,45,89,97,124,146,178,201,206,209]. Table 6 shows that the best performing body part detection-based method is achieved by Debnath et al [34] with a PCKh@0.2 of 96.4%.…”

Section: Body Part Detectionmentioning

confidence: 99%

Human Body Pose Estimation for Gait Identification: A Comprehensive Survey of Datasets and Models

et al. 2022

View full text Add to dashboard Cite

Person identification is a problem that has received substantial attention, particularly in security domains. Gait recognition is one of the most convenient approaches enabling person identification at a distance without the need of high-quality images. There are several review studies addressing person identification such as the utilization of facial images, silhouette images, and wearable sensor. Despite skeleton-based person identification gaining popularity while overcoming the challenges of traditional approaches, existing survey studies lack the comprehensive review of skeleton-based approaches to gait identification. We present a detailed review of the human pose estimation and gait analysis that make the skeleton-based approaches possible. The study covers various types of related datasets, tools, methodologies, and evaluation metrics with associated challenges, limitations, and application domains. Detailed comparisons are presented for each of these aspects with recommendations for potential research and alternatives. A common trend throughout this paper is the positive impact that deep learning techniques are beginning to have on topics such as human pose estimation and gait identification. The survey outcomes might be useful for the related research community and other stakeholders in terms of performance analysis of existing methodologies, potential research gaps, application domains, and possible contributions in the future.

show abstract

“…MobileNets reduce computation in the first few layers by embracing depthwise separable convolutions and inception models. The embedded pointwise convolution factorizes standard convolution into a 1×1 convolution and depth-wise convolution, which reduces computation and model size [30]. Therefore, MobileNets institute autonomous behavior into systems to reduce execution and cognitive burden on users by facilitating remote inspection and package delivery, besides effectively surveying hostile environments.…”

Section: Mobilenetmentioning

confidence: 99%

Alignment control using visual servoing and mobilenet single-shot multi-box detection (SSD): a review

Rogelio¹,

Dadios

Vicerra

et al. 2022

Int. J. Adv. Intell. Informatics

View full text Add to dashboard Cite

The concept is highly critical for robotic technologies that rely on visual feedback. In this context, robot systems tend to be unresponsive due to reliance on pre-programmed trajectory and path, meaning the occurrence of a change in the environment or the absence of an object. This review paper aims to provide comprehensive studies on the recent application of visual servoing and DNN. PBVS and Mobilenet-SSD were chosen algorithms for alignment control of the film handler mechanism of the portable x-ray system. It also discussed the theoretical framework features extraction and description, visual servoing, and Mobilenet-SSD. Likewise, the latest applications of visual servoing and DNN was summarized, including the comparison of Mobilenet-SSD with other sophisticated models. As a result of a previous study presented, visual servoing and MobileNet-SSD provide reliable tools and models for manipulating robotics systems, including where occlusion is present. Furthermore, effective alignment control relies significantly on visual servoing and deep neural reliability, shaped by different parameters such as the type of visual servoing, feature extraction and description, and DNNs used to construct a robust state estimator. Therefore, visual servoing and MobileNet-SSD are parameterized concepts that require enhanced optimization to achieve a specific purpose with distinct tools.

show abstract

Adapting MobileNets for mobile based upper body pose estimation

Cited by 15 publications

References 20 publications

Monocular human pose estimation: A survey of deep learning-based methods

Monocular human pose estimation: A survey of deep learning-based methods

Human Body Pose Estimation for Gait Identification: A Comprehensive Survey of Datasets and Models

Alignment control using visual servoing and mobilenet single-shot multi-box detection (SSD): a review

Contact Info

Product

Resources

About