COVID-19 has become a global disaster that has disturbed the socioeconomic fabric of the world. Efficient and cost-effective diagnosis methods are very much required for better treatment and eliminating false cases for COVID-19. COVID-19 disease is a type of respiratory syndrome, thus lung X-ray analysis has got the attention for an effective diagnosis. Hence, the proposed study introduces an Image processing based COVID-19 detection model C-COVIDNet, which is trained on a dataset of chest X-ray images belonging to three categories: COVID-19, Pneumonia, and Normal person. Image preprocessing pipeline is used for extracting the region of interest (ROI), so that the required features may be present in the input. This lightweight convolution neural network (CNN) based approach has achieved an accuracy of 97.5% and an F1-score of 97.91%. Model input images are generated in batches using a custom data generator. The performance of C-COVIDNet has outperformed the state-of-the-art. The promising results will surely help in accelerating the development of deep learning-based COVID-19 diagnosis tools using radiography.
Diabetes is a chronic disease which indicates the high level of body glucose level. As per the World Health Organization (WHO), 422 million people were diabetic until 2014. This paper develops an accurate classification machine learning model and an efficient usage of data pre-processing pipeline to improve overall accuracy. For the purpose, six algorithms: Support Vector Machine with Linear kernel (Linear-SVM), Support Vector Machine with RBF kernel (RBF-SVM), K-Nearest Neighbor (KNN), Artificial Neural Network (ANN), Decision Tree and Random Forest are used for classification purpose and their comparative accuracy is analyzed. Data Imputation, Oversampling and Feature scaling techniques are the constituents of Data preprocessing pipeline. Experiments are performed on a well-known dataset of National Institute of Diabetes and Digestive and Kidney Diseases, the PIMA diabetes dataset. The data preprocessing techniques, data imputation and Synthetic Minority Oversample Technique (SMOTE) analysis improved classification accuracy from 77% on raw data, to 88.12% (on Random Forest Classifier) and 91% (on ANN Classifier), respectively. Furthermore, a new feature generation approach is applied and performance is analyzed using the SVM model. Original data attributes BMI and Insulin are replaced with new features BMI_NORMAL and INSULIN_NORMAL, respectively. The significant improvement by proposed technique is confirmed by statistical testing followed by post-hoc analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.