Nowadays, lung cancer is one of the most dangerous diseases that require early diagnosis. Artificial intelligence has played an essential role in the medical field in general and in analyzing medical images and diagnosing diseases in particular, as it can reduce human errors that can occur with the medical expert when analyzing medical image. In this research study, we have done a systematic survey of the research published during the last 5 years in the diagnosis of lung cancer classification of lung nodules in 4 reliable databases (Science Direct, Scopus, web of science, and IEEE), and we selected 50 research paper using systematic literature review. The goal of this review work is to provide a concise overview of recent advancements in lung cancer diagnosis issues by machine learning and deep learning algorithms. This article summarizes the present state of knowledge on the subject. Addressing the findings offered in recent research publications gives the researchers a better grasp of the topic. We checked all the characteristics, such as challenges, recommendations for future work were analyzed in detail, and the published datasets and their source were presented to facilitate the researchers’ access to them and use it to develop the results achieved previously.
Lung cancer is the fastest-growing and most dangerous type of cancer worldwide. It ranks first among cancer diseases in the number of deaths, and diagnosing it at late stages makes treatment more difficult. Artificial intelligence has played an essential role in the medical field in general, and early diagnosis of diseases and analyzing medical images in particular, as it can reduce human errors that may occur with the medical expert in medical image analysis. In this study, a hybrid framework is proposed between deep learning using the proposed convolutional neural network and multi-criteria decision-making techniques in order to reach an effective and accurate classification model for lung cancer diagnosis and select the best methodology to solve the problem of class imbalance datasets, which is a general problem in medical data that causes problems and errors in prediction. The IQ-OTHNCCD dataset that has a class imbalance was used. Three class balancing techniques were used separately and the data from each one enters the proposed convolutional neural network for feature extraction and classification. Then the Fuzzy-Weighted Zero-Inconsistency algorithm and VIKOR were used to make the ranking for the best classification approach and determine the best technique to balance the classes. This contributed to increasing the efficiency of the classification, where the best model got an accuracy of 99.27 %, sensitivity of 99.33 %, specificity of 99 %, precision of 98.67 % and F1-score of 99 %. This study can be applied to any data that suffers from the class imbalance problem to find the best technique that gives the highest classification accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.