Emergent synthetic methods for the modular advancement of sp<sup>3</sup>-rich fragments

Caplin, Max J.; Foley, Daniel J.

doi:10.1039/d1sc00161b

Cited by 74 publications

(30 citation statements)

References 86 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Therefore, multicollinearity analysis will validate the presence of the larger substructure (containing CCCH and CCCCCH fragments) suggested in this study's results [47]. Should it exist, in-vitro experimentation can be performed to determine how the substructure affects ML performance in predicting binding affinity, revealing important information on the usefulness of such substructures in VS [49]. In addition, the protein-ligand models used in this study came from a single dataset, which introduces dataset bias and may affect the results of feature analysis.…”

Section: G Machine Learning Benchmarkingmentioning

confidence: 58%

Unsupervised Machine Learning Approach for Identifying Biomechanical Influences on Protein-Ligand Binding Affinity

Singh¹

2021

IJACSA

View full text Add to dashboard Cite

Drug discovery is incredibly time-consuming and expensive, averaging over 10 years and $985 million per drug. Calculating the binding affinity between a target protein and a ligand through Virtual Screening is critical for discovering viable drugs. Although supervised machine learning (ML) can predict binding affinity accurately, models experience severe overfitting due to an inability to identify informative properties of proteinligand complexes. This study used unsupervised ML to reveal underlying protein-ligand characteristics that strongly influence binding affinity. Protein-ligand 3D models were collected from the PDBBind database and vectorized into 2422 features per complex. Principal Component Analysis (PCA), t-Distributed Stochastic Neighbor Embedding (t-SNE), K-Means Clustering, and heatmaps were used to identify groups of complexes and the features responsible for the separation. ML benchmarking was used to determine the features' effect on ML performance. The PCA heatmap revealed groups of complexes with binding affinity of pKd<6 and pKd>8 and identified the number of CCCH and CCCCCH fragments in the ligand as the most responsible features. A high correlation of 0.8337, their ability to explain 18% of the binding affinity's variance, and an error increase of 0.09 in Decision Trees when trained without the two features suggests that the fragments exist within a larger ligand substructure that significantly influences binding affinity. This discovery is a baseline for informative ligand representations to be generated so that ML models overfit less and can more reliably identify novel drug candidates. Future work will focus on validating the ligand substructure's presence and discovering more informative intra-ligand relationships.

show abstract

Section: G Machine Learning Benchmarkingmentioning

confidence: 58%

Unsupervised Machine Learning Approach for Identifying Biomechanical Influences on Protein-Ligand Binding Affinity

Singh¹

2021

IJACSA

View full text Add to dashboard Cite

show abstract

“…34 Pd(OAc) 2 (10 mol%) and Ag 2 CO 3 in super-stoichiometric amount was found to be the best catalyst and oxidant combination, along with (n-BuO) 2 PO 2 H as additive in DCE at 120 1C for 24 h. This method was successfully applied to a wide range of olefins, such as terminal alkyl olefins, allyl alcohol derivatives, acrylaldehyde, allyl acetate, acrylates, and acrylonitrile (1-7). The scope of the reaction includes phenylpropylamine derivatives bearing different substitution patterns in the aromatic ring (8)(9)(10)(11)(12)(13)(14)(15)(16). In all cases studied, the methylene g-C-H bonds, whose activation would involve five-membered cyclopalladation, remained unaltered.…”

Section: Formation Of C-c Bondsmentioning

confidence: 99%

“…[1][2][3][4][5][6][7][8][9][10] Despite the extensive progress made, achieving high site selectivity control is still a prominent goal in the field. [11][12][13][14][15][16] A successful approach towards this goal is the use of directing groups (DGs) to assist the C-H metalation step. A DG is a Lewis basic entity that acts as a ligand for the metal and brings the active catalytic species into close proximity to the desired C-H bond.…”

Section: Introductionmentioning

confidence: 99%

Remote ortho-C–H functionalization via medium-sized cyclopalladation

et al. 2022

View full text Add to dashboard Cite

show abstract

“…When these facts are considered it could be reasoned that the disproportionate occurrence of bonds formed with sp 3 -character 47 in relation to sp 2 -character could be attributed to synthetic challenges that their inclusion presents. A recent Perspective by Caplin and Foley 48 further emphasises the challenges associated with sp 3 -rich fragments as well as highlighting recent advances in C(sp 3 )–H functionalisation which are beginning to have an impact in this area.…”

Section: Outcome Of the Analysismentioning

confidence: 99%

C–H functionalisation tolerant to polar groups could transform fragment-based drug discovery (FBDD)

Chessari¹,

Grainger²,

Holvey³

et al. 2021

Chem. Sci.

View full text Add to dashboard Cite

show abstract

Emergent synthetic methods for the modular advancement of sp³-rich fragments

Abstract: Fragment-based drug discovery is an important and increasingly reliable technology for the delivery of clinical candidates. Notably, however, sp3-rich fragments are a largely untapped resource in molecular discovery, in part...

Cited by 74 publications

References 86 publications

Unsupervised Machine Learning Approach for Identifying Biomechanical Influences on Protein-Ligand Binding Affinity

Unsupervised Machine Learning Approach for Identifying Biomechanical Influences on Protein-Ligand Binding Affinity

Remote ortho-C–H functionalization via medium-sized cyclopalladation

C–H functionalisation tolerant to polar groups could transform fragment-based drug discovery (FBDD)

Contact Info

Product

Resources

About

Emergent synthetic methods for the modular advancement of sp3-rich fragments

Abstract: Fragment-based drug discovery is an important and increasingly reliable technology for the delivery of clinical candidates. Notably, however, sp3-rich fragments are a largely untapped resource in molecular discovery, in part...

Cited by 74 publications

References 86 publications

Unsupervised Machine Learning Approach for Identifying Biomechanical Influences on Protein-Ligand Binding Affinity

Unsupervised Machine Learning Approach for Identifying Biomechanical Influences on Protein-Ligand Binding Affinity

Remote ortho-C–H functionalization via medium-sized cyclopalladation

C–H functionalisation tolerant to polar groups could transform fragment-based drug discovery (FBDD)

Contact Info

Product

Resources

About

Emergent synthetic methods for the modular advancement of sp³-rich fragments