“…CNNs have proved successful in many machine learning tasks such as handwriting recognition [ 17 ], natural language processing [ 18 ], text classification [ 19 ], image classification [ 17 ], face recognition [ 20 ], face detection [ 21 ], object detection [ 22 ], video classification [ 23 ], object tracking [ 24 ], super resolution [ 25 ], human pose estimation [ 26 ], and so forth. CNNs, introduced by Lecun et al [ 17 ], combine three architectural concepts of local receptive fields, shared weights, and spatial or temporal subsampling in order to ensure some degree of shift, scale, and distortion invariance.…”