Background: Melanoma is a skin cancer that starts when the melanocytes that produce the skin color pigment start to grow out of control and form a cancer. Detecting melanoma early before it spreads to the lymph nodes and other parts of the body is very important because it makes a big difference to the patient's 5-year life expectancy. Screening is the process of conducting a skin examination to suspect a mole is melanoma using dermoscopic or macroscopic images. However, manual screening takes a long time. Therefore, automatic melanoma detection is needed to speed up the melanoma detection process. The previous studies still have weakness because it has low precision or recall, which means the model cannot predict melanoma accurately. The distribution of melanoma and moles datasets is imbalanced where the number of melanomas is less than moles. In addition, in previous study, comparisons of several CNN transfer learning architectures have not been carried out on dermoscopic and macroscopic images.
Objective: This study aims to detect melanoma using the Convolutional Neural Network (CNN) with transfer learning on dermoscopic and macroscopic melanoma images. CNN with Transfer learning is a popular method for classifying digital images with high accuracy.
Methods: This study compares four CNN with transfer learning architectures, namely MobileNet, Xception, VGG16, and ResNet50 on dermoscopic and macroscopic image. This research also uses black-hat filtering and inpainting at the preprocessing stage to remove hair from the skin image.
Results: MobileNet is the best model for classifying melanomas or moles in this experiment which has 83.86% of F1 score and 11 second of training time per epoch.
Conclusion: MobileNet and Xception have high average F1 scores of 84.42% and 80.00%, so they can detect melanoma accurately even though the number of melanoma datasets is less than moles. Therefore, it can be concluded that MobileNet and Xception are suitable models for classifying melanomas and moles. However, MobileNet has the fastest training time per epoch which is 11 seconds. In the future, oversampling method can be implemented to balance the number of datasets to improve the performance of the classification model.