Roadmap to Pharmaceutically Relevant Reactivity Models Leveraging High-Throughput Experimentation

Xu, Jessica; Kalyani, Dipannita; Struble, Thomas J.; Dreher, Spencer D.; Krska, Shane W.; Buchwald, Stephen L.; Jensen, Klavs F.

doi:10.26434/chemrxiv-2022-x694w

Cited by 17 publications

(14 citation statements)

References 41 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, it seems that labs that aim to integrate ML models need to gather their own data sets that are created with the required chemical diversity in mind. Recent reports by Xu et al 48 and Rinehart et al 49 also reinforce the viability of this route. The chemical and pharmaceutical industries would greatly benefit from pre-competitive collaboration where high quality and high-diversity reaction data are shared in an intellectual-property-preserving manner, such as federated learning.…”

Section: Discussionmentioning

confidence: 73%

Machine Learning C–N Couplings: Obstacles for a General-Purpose Reaction Yield Prediction

et al. 2023

View full text Add to dashboard Cite

Pd-catalyzed C−N couplings are commonplace in academia and industry. Despite their significance, finding suitable reaction conditions leading to a high yield, for instance, remains a challenging and time-consuming task which usually requires screening over many sets of conditions. To help select promising reaction conditions in the vast space of reagent combinations, machine learning is an emerging technique with a lot of promise. In this work, we assess whether the reaction yield of C−N couplings can be predicted from databases of chemical reactions. We test the generalizability of models both on challenging data splits and on a dedicated experimental test set. We find that, provided the chemical space represented by the training set is not left, the models perform well. However, the applicability domain is quickly left even for simple reactions of the same type, as, for instance, present in our plate test set. The results show that yield prediction for new reactions is possible from the algorithmic side but in practice is hindered by the available data. Most importantly, more data that cover the diversity in reagents are needed for a general-purpose prediction of reaction yields. Our findings also expose a challenge to this field in that it appears to be extremely deceiving to judge models based on literature data with test sets which are split off the same literature data, even when challenging splits are considered.

show abstract

Section: Discussionmentioning

confidence: 73%

Machine Learning C–N Couplings: Obstacles for a General-Purpose Reaction Yield Prediction

et al. 2023

View full text Add to dashboard Cite

show abstract

“…Following an examination of atypical bases for C−N cross-coupling with similar pK a H values, 27 we determined that a commercially available base, NaOTMS (pK a H = 11), 23 was optimal (entry 1). 28 While silanolate bases have rarely been utilized in C−N cross-coupling reactions, 29,30 our results suggest their wider adoption could increase yields for substrates bearing base-sensitive functional groups.…”

mentioning

confidence: 87%

Pd-Catalyzed Amination of Base-Sensitive Five-Membered Heteroaryl Halides with Aliphatic Amines

et al. 2023

Self Cite

View full text Add to dashboard Cite

We report a versatile and functional-group-tolerant method for the Pd-catalyzed C–N cross-coupling of five-membered heteroaryl halides with primary and secondary amines, an important but underexplored transformation. Coupling reactions of challenging, pharmaceutically relevant heteroarenes, such as 2-H-1,3-azoles, are reported in good-to-excellent yields. High-yielding coupling reactions of a wide set of five-membered heteroaryl halides with sterically demanding α-branched cyclic amines and acyclic secondary amines are reported for the first time. The key to the broad applicability of this method is the synergistic combination of (1) the moderate-strength base NaOTMS, which limits base-mediated decomposition of sensitive five-membered heteroarenes that ultimately leads to catalyst deactivation, and (2) the use of a GPhos-supported Pd catalyst, which effectively resists heteroarene-induced catalyst deactivation while promoting efficient coupling, even for challenging and sterically demanding amines. Cross-coupling reactions between a wide variety of five-membered heteroaryl halides and amines are demonstrated, including eight examples involving densely functionalized medicinal chemistry building blocks.

show abstract

“…If successful, this research could be transformative in reaction optimization and transition the field to more direct predictions of optimal reaction conditions. For a recent review of in-depth modeling of HTE data sets, also refer to Jensen and co-workers …”

Section: Data-driven Optimizationmentioning

confidence: 99%

A Brief Introduction to Chemical Reaction Optimization

et al. 2023

View full text Add to dashboard Cite

From the start of a synthetic chemist’s training, experiments are conducted based on recipes from textbooks and manuscripts that achieve clean reaction outcomes, allowing the scientist to develop practical skills and some chemical intuition. This procedure is often kept long into a researcher’s career, as new recipes are developed based on similar reaction protocols, and intuition-guided deviations are conducted through learning from failed experiments. However, when attempting to understand chemical systems of interest, it has been shown that model-based, algorithm-based, and miniaturized high-throughput techniques outperform human chemical intuition and achieve reaction optimization in a much more time- and material-efficient manner; this is covered in detail in this paper. As many synthetic chemists are not exposed to these techniques in undergraduate teaching, this leads to a disproportionate number of scientists that wish to optimize their reactions but are unable to use these methodologies or are simply unaware of their existence. This review highlights the basics, and the cutting-edge, of modern chemical reaction optimization as well as its relation to process scale-up and can thereby serve as a reference for inspired scientists for each of these techniques, detailing several of their respective applications.

show abstract

Roadmap to Pharmaceutically Relevant Reactivity Models Leveraging High-Throughput Experimentation

Cited by 17 publications

References 41 publications

Machine Learning C–N Couplings: Obstacles for a General-Purpose Reaction Yield Prediction

Machine Learning C–N Couplings: Obstacles for a General-Purpose Reaction Yield Prediction

Pd-Catalyzed Amination of Base-Sensitive Five-Membered Heteroaryl Halides with Aliphatic Amines

A Brief Introduction to Chemical Reaction Optimization

Contact Info

Product

Resources

About