Reanalysis of DIA Data Demonstrates the Capabilities of MS/MS-Free Proteomics to Reveal New Biological Insights in Disease-Related Samples

Ivanov, Mark V.; Kopeykina, Anna S.; Gorshkov, Mikhail V.

doi:10.1021/jasms.4c00134

J. Am. Soc. Mass Spectrom.

2024

DOI: 10.1021/jasms.4c00134

|View full text |Cite

Reanalysis of DIA Data Demonstrates the Capabilities of MS/MS-Free Proteomics to Reveal New Biological Insights in Disease-Related Samples

Mark V. Ivanov,

Anna S. Kopeykina,

Mikhail V. Gorshkov

Abstract: Data-independent acquisition (DIA) at the shortened data acquisition time is becoming a method of choice for quantitative proteomic applications requiring high throughput analysis of large cohorts of samples. With the advent of the combination of high resolution mass spectrometry with an asymmetric track lossless analyzer, these DIA capabilities were further extended with the recent demonstration of quantitative analyses at the speed of up to hundreds of samples per day. In particular, the proteomic data for t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Preprint1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

References 68 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

A modified decision tree improves generalization across multiple brains proteomic data sets and reveals the role of ferroptosis in Alzheimer’s disease

Ivanov,

Kopeykina,

Kazakova

et al. 2024

Preprint

View full text Add to dashboard Cite

Low generalization to the patient cohort and variety of experimental conditions in the proteomic search for disease biomarkers are among the main reasons for the bumpy road of quantitative proteomics from discovery stage to clinical validation. Only a small fraction of biomarkers discovered so far by proteomic analysis reaches clinical trials. Here, we presented a machine learning-based workflow for proteomics data analysis, which partially solves some of these issues. In particular, we used a customized decision tree model, which was regulated using a newly introduced parameter, min_cohorts_leaf, that resulted in better generalization of trained models. Further, we analyzed the trend of feature importance’s curve as a function of min_cohorts_leaf parameter and found that it could be used for accurate feature selection to obtain a list of proteins with significantly improved generalization. Finally, we demonstrated that the recently introduced DirectMS1 search algorithm for protein identification and quantitation provides a simple, yet, a highly efficient solution for the problem of combining multiple data sets obtained using different experimental settings. The developed workflow was tested using five published LC-MS/MS data sets obtained in the large consortia studies of Alzheimer’s disease brain samples. The selected data sets consist of 535 files in total analyzed using label-free single-shot data-dependent or data-independent acquisitions. Using the proposed modified ExtraTrees model we found that the expressions of two proteins involved in ferroptosis Serotransferrin TRFE and DNA repair nuclease/redox regulator APEX1, are important for explaining a lack of dementia for patients with the presence of neuritic plaques and neurofibrillary tangles.

show abstract

A modified decision tree improves generalization across multiple brains proteomic data sets and reveals the role of ferroptosis in Alzheimer’s disease

Ivanov,

Kopeykina,

Kazakova

et al. 2024

Preprint

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Reanalysis of DIA Data Demonstrates the Capabilities of MS/MS-Free Proteomics to Reveal New Biological Insights in Disease-Related Samples

Cited by 1 publication

References 68 publications

A modified decision tree improves generalization across multiple brains proteomic data sets and reveals the role of ferroptosis in Alzheimer’s disease

A modified decision tree improves generalization across multiple brains proteomic data sets and reveals the role of ferroptosis in Alzheimer’s disease

Contact Info

Product

Resources

About