The lack of specific and sensitive early diagnostic options for pancreatic cancer (PC) results in patients being largely diagnosed with late-stage disease, thus inoperable and burdened with high mortality. Molecular spectroscopic methodologies, such as Raman or infrared spectroscopies, show promise in becoming a leader in screening for early-stage cancer diseases, including PC. However, should such technology be introduced, the identification of differentiating spectral features between various cancer types is required. This would not be possible without the precise extraction of spectra without the contamination by necrosis, inflammation, desmoplasia, or extracellular fluids such as mucous that surround tumor cells. Moreover, an efficient methodology for their interpretation has not been well defined. In this study, we compared different methods of spectral analysis to find the best for investigating the biomolecular composition of PC cells cytoplasm and nuclei separately. Sixteen PC tissue samples of main PC subtypes (ductal adenocarcinoma, intraductal papillary mucinous carcinoma, and ampulla of Vater carcinoma) were collected with Raman hyperspectral mapping, resulting in 191,355 Raman spectra and analyzed with comparative methodologies, specifically, hierarchical cluster analysis, non-negative matrix factorization, T-distributed stochastic neighbor embedding, principal components analysis (PCA), and convolutional neural networks (CNN). As a result, we propose an innovative approach to spectra classification by CNN, combined with PCA for molecular characterization. The CNN-based spectra classification achieved over 98% successful validation rate. Subsequent analyses of spectral features revealed differences among PC subtypes and between the cytoplasm and nuclei of their cells. Our study establishes an optimal methodology for cancer tissue spectral data classification and interpretation that allows precise and cognitive studies of cancer cells and their subcellular components, without mixing the results with cancer-surrounding tissue. As a proof of concept, we describe findings that add to the spectroscopic understanding of PC.
Graphical Abstract