Human pose estimation has drawn extensive attention recently and there has been significant progress on it due to the rising popularity of convolutional neural networks (CNN). However, existing stateof-the-art approaches suffer from occlusion, complicated backgrounds, and substantial position fluctuations because of disregarding the human body form. Human parsing is a very pertinent activity that can provide crucial semantic data about bodily parts for position estimation. To overcome the aforesaid limitations, this paper introduces a human pose estimation method using a group-based convolutional neural network model. The proposed method adopts a bottom-up parsing strategy that yields features to extract skeletal key points in the human body. Moreover, it creates a grouping of anatomical key points for individuals by utilizing the non-parametric description for the key point association vector field. Experimental results indicate that the proposed method provides superior performance than the state-of-the-art algorithms in terms of accuracy. In addition, it optimizes its output and detects occluded as well as invisible key points by incorporating feature representation. The proposed method surpasses the recent methods, achieving 93% of the mean average accuracy.