In this paper, the Scale-Invariant Feature Transform (SIFT) and Fast Library for Approximate Nearest Neighbors (FLANN) based algorithm is used to detect the abnormalities in the National Lung Screening Trial (NLST) CT scans as the exact clinical nodule locations are not provided in the dataset. These identified nodules on NLST CT Scans are then annotated using LabelImg tool. This process consumes time and so furthermore, the automatic nodule detection, You Only Look Once version 4 (YOLOv4) object detection model is implemented. The YOLOv4 object detection model is provided with total of 4187 labelled images in form of training (70%), validation (20%), and test (10%) datasets. Our YOLOv4 model achieves precision of 95%, sensitivity of 81% and mean Average Precision (mAP) of 89.1%.
Background:
Lung cancer is among the major causes of death in the world. Early detection of lung cancer is a major challenge. These encouraged the development of Computer-Aided Detection (CAD) system.
Objectives:
We designed a CAD system for performance improvement in detecting and classifying pulmonary nodules. Though the system will not replace radiologists, it will be helpful to them in order to accurately diagnose lung cancer.
Methods:
The architecture comprises of two steps, among which in the first step CT scans are pre-processed and the candidates are extracted using the positive and negative annotations provided along with the LUNA16 dataset, and the second step consists of three different neural networks for classifying the pulmonary nodules obtained from the first step. The models in the second step consist of 2D-Convolutional Neural Network (2D-CNN), Visual Geometry Group-16 (VGG-16) and simplified VGG-16, which independently classify pulmonary nodules.
Results:
The classification accuracies achieved for 2D-CNN, VGG-16 and simplified VGG-16 were 99.12%, 98.17% and 99.60%, respectively.
Conclusion:
The integration of deep learning techniques along with machine learning and image processing can serve as a good means of extracting pulmonary nodules and classifying them with improved accuracy. Based on these results, it can be concluded that the transfer learning concept will improve system performance. In addition, performance improves proper designing of the CAD system by considering the amount of dataset and the availability of computing power.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.