High-definition (HD) Fourier transform infrared (FT-IR) spectroscopic imaging is an emerging technique that not only enables chemistry-based visualization of tissue constituents, and label free extraction of biochemical information but its higher spatial detail makes it a potentially useful platform to conduct digital pathology. This methodology, along with fast and efficient data analysis, can enable both quantitative and automated pathology. Here we demonstrate a combination of HD FT-IR spectroscopic imaging of breast tissue microarrays (TMAs) with data analysis algorithms to perform histologic analysis. The samples comprise four tissue states, namely hyperplasia, dysplasia, cancerous and normal. We identify various cell types which would act as biomarkers for breast cancer detection and differentiate between them using statistical pattern recognition tools i.e. Random Forest (RF) and Bayesian algorithms. Feature optimization is integrally carried out for the RF algorithm, reducing computation time as well as redundant spectral features. We achieved an order of magnitude reduction in the number of features with comparable prediction accuracy to that of the original feature set. Together, the demonstration of histology and selection of features paves the way for future applications in more complex models and rapid data acquisition.