e21148 Background: Protein glycosylation is the most abundant and complex form of post-translational protein modification. Glycosylation profoundly affects protein structure, conformation, and function. The elucidation of the potential role of differential protein glycosylation as biomarkers has so far been limited by the technical complexity of generating and interpreting this information. We have recently established a novel, powerful platform that combines liquid chromatography/mass spectrometry with a proprietary artificial-intelligence-based data processing engine that allows, for the first time, highly scalable interrogation of the glycoproteome. Methods: Using this platform, we interrogated 694 glycopeptide (GP) and non-glycosylated peptide transitions derived from 74 serum proteins in pre-treatment peripheral blood samples from a cohort of 316 individuals with non-small-cell lung cancer (NSCLC) (128 females, 187 males, 1 with unknown sex, median age 66 years, age range 31-89 years, stage 0-4 N’s: 1 / 99 / 80 / 84 / 49, 3 missing) and a comparison cohort of 194 healthy control samples (102 females, 92 males, median age 52 years, age range 30-63 years). Age- and sex-adjusted differential expression analysis for 596 normalized biomarkers were performed to evaluate statistically significant differential abundances using an FDR-adjusted q-value of 0.05 as a cutoff. Repeated five-fold cross-validated LASSO-regularized logistic regression was performed to create a multivariable classifier that predicts whether a serum sample belongs to the healthy or NSCLC cohort. Results: We identified 432 biomarkers with significant abundance differences at FDR ≤ 0.05 between samples with NSCLC and healthy controls. Using 70% of the complete cohort (balanced by case/control membership, NSCLC stage, sex, and age quartile) as a training set, we selected a total of 375 glycopeptide and non-glycosylated peptide biomarker features that remained differentially expressed at FDR-adjusted q-value ≤ 0.05 as input into a LASSO-regularized multivariable classifier. This resulting in a 19-biomarker model exhibiting an accuracy of 94.8% (96.9% sensitivity, 91.2% specificity) and AUC of 0.989. This classifier was validated in an independent test set comprising the remaining 30% of subjects, yielding an accuracy of 94.5% (95.5% sensitivity, 93.0% specificity) and AUC of 0.975. Sensitivity in the test set was 100% / 96% / 99% / 96% / 94% / 10%, in stages 0-4 and missing, respectively. Conclusions: Our results indicate that glycoproteomic biomarkers can be leveraged as a strong liquid biopsy-based screening tool for patients at high risk of NSCLC, as an alternative to imaging modalities.
9545 Background: Protein glycosylation is the most abundant and complex form of post-translational protein modification. Glycosylation profoundly affects protein structure, conformation, and function. The elucidation of the potential role of differential protein glycosylation as biomarkers has so far been limited by the technical complexity of generating and interpreting this information. We have recently established a novel, powerful platform that combines liquid chromatography/mass spectrometry with a proprietary artificial-intelligence-based data processing engine that allows, for the first time highly scalable interrogation of the glycoproteome. Methods: Using this platform, we interrogated 526 glycopeptide (GP) signatures derived from 75 serum proteins in pretreatment blood samples from a cohort of 205 individuals (66 females, 139 males, age range 24 to 97 years) with metastatic malignant melanoma treated either with nivolumab plus ipilimumab (95 patients) or pembrolizumab (110 patients) immune-checkpoint inhibitor (ICI) therapy. Results: In an optimized assay containing 27 glycopeptides and 20 non-glycosylated peptides, we identified 14 GPs with abundance differences at FDR q≤0.05 with regard to PFS. Using 40% of the cohort as a training set and selecting 12 glycopeptide and non-glycosylated peptide biomarker features of the 47 total by LASSO shrinkage, we created a multivariable-model-based classifier for PFS that yielded a hazard ratio (HR) for prediction of likely ICI benefit of 7.5 at p < 0.0001. This classifier was validated in the test set comprised of the held-out 60% of patients, yielding a HR of 4.7 at a similar p-value for separating patients likely benefiting from ICI therapy and those likely not benefiting from ICI therapy (50% PFS of 18 months vs. 3 months based on classifier score above/below cutoff). This classifier has a sensitivity of > 99% to predict likely ICI benefit, while still performing at a specificity of 26%, thus helping to safely reduce ultimately unnecessary and non-beneficial exposure to these agents of one in four who otherwise would unnecessarily be exposed to them. Conclusions: Our results indicate that glycoproteomics holds a strong promise as a predictor for checkpoint inhibitor treatment benefit that appears to significantly outperform other currently pursued biomarker approaches in this context.
e15529 Background: Excluding skin cancers, colorectal cancer is the third most common cancer diagnosed in both men and women in the United States. Colorectal cancer (CRC) affects men and women of all racial and ethnic groups and is most often found in people who are 50 years old or older. To aid diagnosis and improve screening for CRC, this study focuses on identifying glycoprotein biomarkers using blood serum. Methods: Novel methods including liquid-chromatography/mass-spectrometry (LC-MS) with in-house peak integration software PB-Net were used to identify glycoprotein biomarkers by analyzing blood serum. Samples were sourced from different biorepositories including 245 CRC, 38 adenoma and 196 healthy controls. The data were split into 75% training and 25% hold-out test set for multivariable predictions. Statistical analysis was performed on normalized data to identify potential biomarkers differentiating adenoma and different stages of CRC samples from the healthy controls. Results: There were 419 significantly differentially expressed glycopeptides/peptides from comparisons between CRC and adenoma samples against the healthy control samples with an FDR < 0.05. A subset of these biomarkers were assessed, generating a 21-biomarker multivariable classifier model. We observed a test set AUC of 0.926, and the sensitivity for all stages of CRC was 90% (87% early stage, 92% late stage). Notably, sensitivity for adenomas was 79%, a large improvement upon the state of the art in adenoma diagnosis. Conclusions: Identification of these key glycopeptides/peptides in blood serum could prove to be a promising non-invasive diagnostic tool that can help improve screening and aid in early detection of advanced adenomas and CRC.
e12545 Background: Breast cancer is the most common cancer among women worldwide.Traditional methods of cancer detection such as tissue biopsy are invasive, costly, time consuming and not amenable for repetition. As a result, minimally invasive liquid biopsies, especially blood-based biomarkers show potential value for breast cancer risk prediction and early detection. In this study, we investigated the use of serum glycoproteins circulating in blood to identify a panel of potential prognostic markers that may aid in predicting breast cancer in women. Methods: We applied a novel platform for characterizing blood glycoproteomic biomarkers, combining liquid-chromatography/mass spectrometry (LC-MS) with artificial intelligence/neural networks (AI-NN) to analyze serum samples from 279 breast cancer patients (median age 56 years, with stage 0-4 N’s: 1 / 83 / 114 / 56 / 25) and 102 healthy control samples (median age 52 years). A panel of 596 serum glycosylated and non-glycosylated peptides, representing 71 serum proteins, were analyzed. Age-adjusted differential expression analysis for 596 normalized biomarkers were performed to evaluate statistically significant differential abundances using an FDR q-value of 0.05 as a cutoff. Using the top differentially expressed markers as input, a LASSO penalized logistic regression model with 5-fold repeated cross validation was applied to identify the top biomarkers contributing to the separation between healthy controls and breast cancer patients. Results: We identified 243 out of 596 markers that were differentially expressed (FDR <<0.05) between breast cancer samples and healthy controls. Out of those, 11 markers were obtained as the top predictors in classifying breast cancer patients and healthy controls. The classification algorithm yielded an accuracy of 94% (95.9% sensitivity, 88.7% specificity) and an AUC of 0.983 on the training set. This classifier was validated on an independent test set with 30% of the subjects, yielding an accuracy of 93% (96.4% sensitivity, 83.9% specificity) and an AUC of 0.974. Test sensitivity was high across stages, at 96% / 90% / 95% / 90% in stages 1-4, respectively. Conclusions: Based on the results, we conclude that circulating glycoproteins in serum may be useful in screening applications in breast cancer, and strongly demonstrates the utility of glycoprotein profiles as a powerful non-invasive diagnostic tool.
e17604 Background: Ovarian cancer (OC) is the fifth- leading cause of cancer-related deaths among women, and the most lethal gynecological cancer. Currently available biomarkers, including CA-125 and HE4, show suboptimal diagnostic performance for differentiating among benign and malignant pelvic tumors, and the early recognition of OC. Differentiation of benign and malignant pelvic tumors is required for proper patient triaging and management, yet non-invasive methods remain a largely unmet medical need. Methods: We applied a novel platform for characterizing peripheral blood glycoproteomic biomarkers, combining liquid-chromatography/mass spectrometry (LC-MS) with artificial intelligence/neural networks (AI-NN) for the targeted quantification of serum protein glycosylation at intact glycopeptide level to analyze serum samples from 296 treatment-naïve women with histopathology-confirmed diagnosis of either benign (n = 151) or malignant (n = 145) tumors, and from 55 healthy control subjects, procured from a commercial biobank. Using data-dependent acquisition, a panel of 683 serum glycosylated and non-glycosylated peptides, representing 71 serum proteins, was interrogated. Samples were randomly divided into a training and a testing set for multivariable analysis. Data processing was performed using PB-Net, an in-house-developed high-throughput peak integration software. Raw data were normalized, processed by statistical analysis, and applied to machine learning models. Results: Comparison of glycopeptide abundances among patients with OC and benign pelvic tumors yielded 428 statistically significantly differentially expressed glycopeptides/peptides (at FDR < 0.05). A subpanel of these markers used to generate a score for predicting OC yielded areas under the receiver-operating-characteristic of 0.955 and 0.894 in the training and testing sets, respectively. The predicted probability of malignancy increased with cancer stage, and probability distributions were similar across training and test sets. Applying the model to healthy subjects, not utilized in training, resulted in few misclassifications and a spread nearly equivalent to that of the benign tumor cases. This indicates the signature robustly predicts malignancy and severity of disease. Conclusions: Our novel approach exhibited impressive levels of accuracy for the noninvasive differentiation of benign and malignant pelvic masses, compared with existing biomarkers and algorithms, thereby demonstrating the utility of glycoprotein profiles as a powerful, noninvasive new diagnostic modality.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.