Quality control of special steel is accomplished through visual inspection of its microstructure based on microscopic images. This study proposes an “automatic-quality-level-estimation system” based on machine learning. Visual inspection of this type is sensory-based, so training data may include variations in judgments and training errors due to individual differences between inspectors, which makes it easy for a drop in generalization performance to occur due to overfitting. To deal with this issue, we here propose the preprocessing of inspection images and a data augmentation technique. Preprocessing reduces variation in images by extracting features that are highly related to the level of quality from inspection images. Data augmentation, meanwhile, suppresses the problem of overfitting when training with a small number of images by taking into account information on variation in judgment values obtained from on-site experience. While the correct-answer rate for judging quality level by an inspector was about 90%, the proposed method achieved a correct-answer rate of 92.5%, which indicates that the method shows promise for practical application.