“…CPTAC RNA-seq and mass spectrometry datasets for breast (Krug et al, 2020), ovarian (Hu et al, 2020b;Zhang et al, 2016), colorectal (Vasaikar et al, 2019;Zhang et al, 2014), lung adenocarcinoma (Gillette et al, 2020), and endometrial (Dou et al, 2020) cancer discovery studies were retrieved in accordance with the CPTAC data use and embargo policies using the cptac v.0.9.1 package in Python 3.9. Statistical learning was performed using scikit-learn 0.24.2 (Lindgren et al, 2021). Transcriptomics data were standardized, after which data were split 80/20 into train and test sets.…”