Medical images differ from natural images in significantly higher resolutions and smaller regions of interest. Because of these differences, neural network architectures that work well for natural images might not be applicable to medical image analysis. In this work, we extend the globally-aware multiple instance classifier, a framework we proposed to address these unique properties of medical images. This model first uses a low-capacity, yet memory-efficient, network on the whole image to identify the most informative regions. It then applies another higher-capacity network to collect details from chosen regions. Finally, it employs a fusion module that aggregates global and local information to make a final prediction. While existing methods often require lesion segmentation during training, our model is trained with only image-level labels and can generate pixel-level saliency maps indicating possible malignant findings. We apply the model to screening mammography interpretation: predicting the presence or absence of benign and malignant lesions. On the NYU Breast Cancer Screening Dataset, consisting of more than one million images, our model achieves an AUC of 0.93 in classifying breasts with malignant findings, outperforming ResNet-34 and Faster R-CNN. Compared to ResNet-34, our model is 4.1x faster for inference while using 78.4% less GPU memory. Furthermore, we demonstrate, in a reader study, that our model surpasses radiologist-level AUC by a margin of 0.11. The proposed model is available online.This paper is an extension of work originally presented at the 10th International Workshop on Machine Learning in Medical Imaging (Shen et al., 2019).mammogram per the American College of Radiology Breast Imaging Reporting and Data System (BI-RADS), with specific follow-up recommendations for each category (Liberman and Menell, 2002).Screening mammography interpretation is a particularly challenging task because mammograms are in very high resolutions while most asymptomatic cancer lesions are small, sparsely distributed over the breast and may present as subtle changes in the breast tissue pattern. While randomized clinical trials have shown that screening mammography has significantly reduced breast cancer mortality (Duffy et al., 2002;Kopans, 2002), it is associated with limitations such as false positive recalls for additional imaging and subsequent false positive biopsies which result in benign, non-cancerous findings. About 10% to 20% of women who have an abnormal screening mammogram are recommended to undergo a biopsy. Only 20% to 40% of these biopsies yield a diagnosis of cancer (Kopans, 2015).To tackle these limitations, convolutional neural networks (CNN) have been applied to assist radiologists in the analy-