Human pose estimation has drawn extensive attention recently and there has been significant progress on it due to the rising popularity of convolutional neural networks (CNN). However, existing stateof-the-art approaches suffer from occlusion, complicated backgrounds, and substantial position fluctuations because of disregarding the human body form. Human parsing is a very pertinent activity that can provide crucial semantic data about bodily parts for position estimation. To overcome the aforesaid limitations, this paper introduces a human pose estimation method using a group-based convolutional neural network model. The proposed method adopts a bottom-up parsing strategy that yields features to extract skeletal key points in the human body. Moreover, it creates a grouping of anatomical key points for individuals by utilizing the non-parametric description for the key point association vector field. Experimental results indicate that the proposed method provides superior performance than the state-of-the-art algorithms in terms of accuracy. In addition, it optimizes its output and detects occluded as well as invisible key points by incorporating feature representation. The proposed method surpasses the recent methods, achieving 93% of the mean average accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.