“…Thermal images was captured for the Coal/Gangue in certain conditions to increase the difference characters between the two types because of the variability of heat factor taking in account that the Coal/Gangue react by different degrees for the heating environment, a hypothesis has been proposed to say that putting the Coal/Gangue in hot environment and capture the thermal images of the surface will make the classification of the Coal/Gangue more efficient, this hypothesis has been discussed extensively in (SVM-YCbCr) [24], the Coal/Gangue samples have been collected from Bituminous coal, produced in Shanxi Province, western of China, they were put in thermal container until they reach 50 Celsius, after that 139 thermal images (70 coal, 69 gangue) have been captured using the thermal camera which generate thermal images of (.IS2) extension then using the Fluck SmartView 3.1 application the captured thermal images (.IS2) have been converted into PNG images with (680×480) pixels resolution as shown in Figure 1, in the experiment the dataset has been divided into three categorizes training (91), validation(28) and testing (20), Table 1 shows the data set divided between the three phases of the experiment. But number of images still not enough and inevitably will case over fitting problem, this problem has been addressed by many researchers in there works and to solve the scarcity of image resources an augmentation process performed in the small dataset in order to increase it with respect to generate different pixel values in the same position to make sure that a different image are generated beside the original image, Krizhevsky et al [26] in the Alexnet used the augmentation principles to increase the data set in there work, so in order to increase the dataset samples here the augmentation principle has been applied, first the images have been centered and cropped into (480 × 480) pixels resolution to be suitable for augmentation process, then three rotation processes with degrees (90,180,270) have been done and increased the data set from 139 into 556 after that a horizontal inverting has been done to create 1112 images divided in the three categories as explained in Table 2, Figure 4 shows the transformation done to an image and the new generated images and the differences between them.…”