2021
DOI: 10.1021/jacsau.1c00449
|View full text |Cite
|
Sign up to set email alerts
|

Accurate Machine Learning Prediction of Protein Circular Dichroism Spectra with Embedded Density Descriptors

Abstract: A data-driven approach to simulate circular dichroism (CD) spectra is appealing for fast protein secondary structure determination, yet the challenge of predicting electric and magnetic transition dipole moments poses a substantial barrier for the goal. To address this problem, we designed a new machine learning (ML) protocol in which ordinary pure geometry-based descriptors are replaced with alternative embedded density descriptors and electric and magnetic transition dipole moments are successfully predicted… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
8

Relationship

2
6

Authors

Journals

citations
Cited by 25 publications
(18 citation statements)
references
References 47 publications
0
18
0
Order By: Relevance
“…A wide range of ML methodologies have been proposed to improve computational accuracy, cost, and speed in IR, Raman, VCD, and ECD spectra (Figure ). Similar advances must be possible for TCD spectroscopy, too. Although there are no (to the best of our knowledge) ML-based methodologies developed and published for TCD yet, special features of this spectroscopy and its implementation for multidimensional analysis of the biological and abiological structures discussed above make this spectroscopy particularly suitable for the application of ML.…”
Section: Future Directionsmentioning
confidence: 99%
See 1 more Smart Citation
“…A wide range of ML methodologies have been proposed to improve computational accuracy, cost, and speed in IR, Raman, VCD, and ECD spectra (Figure ). Similar advances must be possible for TCD spectroscopy, too. Although there are no (to the best of our knowledge) ML-based methodologies developed and published for TCD yet, special features of this spectroscopy and its implementation for multidimensional analysis of the biological and abiological structures discussed above make this spectroscopy particularly suitable for the application of ML.…”
Section: Future Directionsmentioning
confidence: 99%
“…(c) ML-based simulation of ECD spectra of peptides predicted based on 1000 MD configurations. Reprinted with permission from ref . Copyright 2021 Americal Chemical Society.…”
Section: Future Directionsmentioning
confidence: 99%
“…Carbon-based nanomaterials (e.g., carbon nanotubes, graphene, carbon quantum dots, and nanodiamonds), metals (e.g., gold and silver nanoparticles), and metal oxide nanoparticles (e.g., SiO 2 , ZnO, CeO 2 , and TiO 2 nanoparticles) exhibit different corona features driven by corresponding distinct nanoparticle–protein interactions. Specifically, the binding of pristine carbon nanotubes (CNTs) with blood proteins, including bovine fibrinogen, gamma globulin (Ig), Tf, and BSA, showed a positive correlation with the number of aromatic residues, suggesting the binding was governed by π–π stacking between protein aromatic residues (Trp, Phe, and Tyr) and carbon nanotubes in addition to hydrophobic interactions. , The proteins formed well-ordered rodlike structures on the CNT surfaces, with the amphiphilic α-helices possessing a high fraction of hydrophobic residues converted into coil or β-sheet structures. For graphene oxide, the oxygen-containing functional groups (e.g., hydroxyls, epoxides, and carboxyls) enabled the additional contributions of electrostatic attraction and hydrogen bonding to stabilize the protein corona and, consequently, denatured the ordered protein structures into unstructured random coils .…”
Section: Biophysical Signatures Of the Protein Coronamentioning
confidence: 99%
“…Recent years have witnessed a fast development of the usage of spectral information in chemical machine learning. The recognition of protein structures by machine learning methods using spectral information [32] as well as spectrum prediction from protein structure [33,34] have been developed and applied to protein molecular dynamics [35] and real‐time interaction simulations [36] . There are increasing attempts to apply data‐driven ML methods to automate the linking of the spectrum of molecules to structures [37–39] .…”
Section: Introductionmentioning
confidence: 99%