“…Test achievements for the fine-tuning of the ResNet50 model and end-to-end training of the developed CNN model were found to be 92.6% and 91.6%, respectively. For comparative purposes, various local texture descriptors were considered; namely, Local Binary Patterns (LBP) ( Ahonen, Hadid, & Pietikainen, 2006 ), Frequency Decoded LBP (FDLBP) ( Dubey, 2019 ), Quaternionic Local Ranking Binary Pattern (QLRBP) (Lan, Zhou, & Tang, 2015), Binary Gabor Pattern (BGP) ( Zhang, Zhou, & Li, 2012 ), Local Phase Quantization (LPQ) ( Ojansivu & Heikkilä, 2008 ), Binarized Statistical Image Features (BSIF) ( Kannala & Rahtu, 2012 ), CENsus TRansform hISTogram (CENTRIST) ( Wu & Rehg, 2010 ), and Pyramid Histogram of Oriented Gradients (PHOG) ( Bosch, Zisserman, & Munoz, 2007 ). From the local texture descriptors, the BSIF with SVM classifier produced a 90.5% accuracy score.…”