Breast cancer is a major cause of death among women worldwide. Hematoxylin and eosin (H&E) stained breast tissue samples from biopsies are observed under microscopes for primary diagnosis of breast cancer. In this paper, we propose a deep learning-based method for classification of H&E stained breast tissue images released for BACH challenge [1] by fine-tuning Inception-v3 convolutional neural network (CNN) [2]. These images are to be classified into four classes -(i) normal tissue, (ii) benign lesion, (iii) in situ carcinoma and (iv) invasive carcinoma. Our strategy is to extract patches based on nuclear density and rejecting patches that are not rich in nuclei, e.g. from non-epithelial regions. This allowed us to discard uninformative regions of the images as compared to random or grid sampling, because visual signs of tumors are most evident in the epithelium. Every patch with high nuclear density in an image is classified in one of the four above mentioned categories. The class of the entire image is determined using majority voting over the nuclear classes. We obtained an average accuracy of 85% over the four classes and 93% for non-cancer (i.e. normal or benign) vs. malignant (in situ or invasive carcinoma), which significantly improves upon a previous benchmark [3].
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.