2022
DOI: 10.1039/d2dd00021k
|View full text |Cite
|
Sign up to set email alerts
|

3D chemical structures allow robust deep learning models for retention time prediction

Abstract: Chromatographic retention time (RT) is a powerful characteristic used to identify, separate, or rank molecules in a mixture. With accumulated RT data, it becomes possible to develop deep learning approaches...

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 43 publications
(90 reference statements)
0
4
0
Order By: Relevance
“…The prepared database was used for construction of 11 ML models after the generation of fingerprint descriptors using RDKit AllChem software. The RDKit AllChem software was chosen due to it being a freely available tool with the ability to generate a broad range of molecular descriptors, including structural, topological, electronic, and thermodynamic properties of molecules. , The molecular descriptors used in this study included molecular weight, number of rotatable bonds, topological polar surface area (TPSA), and Morgan fingerprints with radii of 2 and 1024 bits. K -fold cross-validation was used as the primary validation method with k set to 5.…”
Section: Resultsmentioning
confidence: 99%
“…The prepared database was used for construction of 11 ML models after the generation of fingerprint descriptors using RDKit AllChem software. The RDKit AllChem software was chosen due to it being a freely available tool with the ability to generate a broad range of molecular descriptors, including structural, topological, electronic, and thermodynamic properties of molecules. , The molecular descriptors used in this study included molecular weight, number of rotatable bonds, topological polar surface area (TPSA), and Morgan fingerprints with radii of 2 and 1024 bits. K -fold cross-validation was used as the primary validation method with k set to 5.…”
Section: Resultsmentioning
confidence: 99%
“…Recently, small molecule retention time (SMRT) dataset ( Domingo-Almenara et al 2019 ) containing 80 038 molecules was released to the public, stimulating deep learning-based retention time prediction methods, such as DLM ( Domingo-Almenara et al 2019 ), DNNpwa ( Ju et al 2021 ), and 1D-CNN ( Fedorova et al 2022 ). More deep learning-based methods, such as GNN-RT ( Yang et al 2021 ), CPORT ( Zaretckii et al 2022 ), MPNN ( Osipenko et al 2022 ), and Blender ( García et al 2022 ), apply transfer learning ( Weiss et al 2016 ) to predict the retention times in specific chromatographic separation systems. These methods alleviate the limitation of small training data by pre-training the neural networks on SMRT and further reusing some parameters in the pre-trained networks.…”
Section: Introductionmentioning
confidence: 99%
“…The representation of molecules in 3D grids of voxels encoding their atoms has been used to train deep learning neural networks. 18,19,20,21 Electrostatic potentials generated by point charges localized at the atom centers -another way of representing the 3D structure of molecules -has long since been used to establish relationships with molecular properties, namely in 3D-QSAR, 22 or to assess molecular similarity. 23 Direct 3D representations can be used for data sets of pre-aligned structures (e.g.…”
Section: Introductionmentioning
confidence: 99%