According to the World Health Organization, cancer is the second leading cause of mortality. Breast cancer is the most prevalent cancer diagnosed in women around the world. Breast cancer diagnostics range from mammograms to CT scans and ultrasounds, but a biopsy is the only way to know for sure if the suspicious cells detected in the breast are cancerous or not. This paper's main contribution is multi-fold. First, it proposes a deep learning approach to detect breast cancer from biopsy microscopy images. Deep convolution nets of various types are used. Second, the paper examines the effects of different data preprocessing techniques on the performance of deep learning models. Third, the paper introduces an ensemble method for aggregating the best models in order to improve performance. The experimental results revealed that Densenet169, Resnet50, and Resnet101 are the three best models achieving accuracy scores of 62%, 68%, and 85%, respectively. without data preprocessing. With the help of data augmentation and segmentation, the accuracy of these models increased by 20%, 17%, and 6%, respectively. Additionally, the ensemble learning technique improves the accuracy of the models even further. The results show that the best accuracy achieved is 92.5%.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.