A Comprehensive Discovery Platform for Organophosphorus Ligands for Catalysis

Gensch, Tobias; Gomes, Gabriel; Friederich, Pascal; Peters, E.‐M.; Gaudin, Théophile; Pollice, Robert; Jorner, Kjell; Nigam, Anukriti; M, Lindner D'Addario; Ms, Sigman; Aspuru‐Guzik, Alán

doi:10.26434/chemrxiv.12996665.v1

Cited by 9 publications

(7 citation statements)

References 31 publications

(33 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This mindset enables the identification of subtle trends within the data, even when a particular result may be unexpected from a chemical intuition standpoint, which can guide further screening, hypothesis development, and future optimization campaigns. The expansion and distribution of databases of physical organic features will help to increase the accessibility of the data science workflow to chemists in a variety of fields. , It should be noted that the incorporation of data science principles to project design goes hand-in-hand with modern advances in automation that streamline the data collection process. , …”

Section: Discussionmentioning

confidence: 99%

Data Science Meets Physical Organic Chemistry

et al. 2021

Self Cite

View full text Add to dashboard Cite

Metrics & MoreArticle Recommendations CONSPECTUS: At the heart of synthetic chemistry is the holy grail of predictable catalyst design. In particular, researchers involved in reaction development in asymmetric catalysis have pursued a variety of strategies toward this goal. This is driven by both the pragmatic need to achieve high selectivities and the inability to readily identify why a certain catalyst is effective for a given reaction.While empiricism and intuition have dominated the field of asymmetric catalysis since its inception, enantioselectivity offers a mechanistically rich platform to interrogate catalyst-structure response patterns that explain the performance of a particular catalyst or substrate.In the early stages of an asymmetric reaction development campaign, the overarching mechanism of the reaction, catalyst speciation, the turnover limiting step, and many other details are unknown or posited based on related reactions. Considering the unclear details leading to a successful reaction, initial enantioselectivity data are often used to intuitively guide the ultimate direction of optimization. However, if the conditions of the Curtin−Hammett principle are satisfied, then measured enantioselectivity can be directly connected to the ensemble of diastereomeric transition states (TSs) that lead to the enantiomeric products, and the associated free energy difference between competing TSs (ΔΔG ⧧ = −RT ln[(S)/(R)], where (S) and (R) represent the concentrations of the enantiomeric products). We, and others, speculated that this important piece of information can be leveraged to guide reaction optimization in a quantitative way.Although traditional linear free energy relationships (LFERs), such as Hammett plots, have been used to illuminate important mechanistic features, we sought to develop data science derived tools to expand the power of LFERs in order to describe complex reactions frequently encountered in modern asymmetric catalysis. Specifically, we investigated whether enantioselectivity data from a reaction can be quantitatively connected to the attributes of reaction components, such as catalyst and substrate structural features, to harness data for asymmetric catalyst design.In this context, we developed a workflow to relate computationally derived features of reaction components to enantioselectivity using data science tools. The mathematical representation of molecules can incorporate many aspects of a transformation, such as molecular features from substrate, product, catalyst, and proposed transition states. Statistical models relating these features to reaction outputs can be used for various tasks, such as performance prediction of untested molecules. Perhaps most importantly, statistical models can guide the generation of mechanistic hypotheses that are embedded within complex patterns of reaction continued...

show abstract

Section: Discussionmentioning

confidence: 99%

Data Science Meets Physical Organic Chemistry

et al. 2021

Self Cite

View full text Add to dashboard Cite

show abstract

“…SI-5 through SI-10, Supplementary Data 1 through 2, and Supplementary Movie 1 through 2. The computed molecular features utilized in this study have been made publicly available at https:// kraken.cs.toronto.edu/dashboard 45 .…”

Section: Data Availabilitymentioning

confidence: 99%

“…We employed a Chemspeed SWING robotic system for the experimental execution of parallel reaction loops in batch and employed the Phoenics and Gryffin algorithms for the proposal of parallel combinations continuous and categorical process parameter selections. Recognizing the impact of phosphine selection on the optimization outcome, we employed a variety of categorical parameter selection strategies, including chemical intuition and computed molecular descriptor clustering of 365 commercially available phosphines 45 . Here, we discuss the advantages and limitations of each phosphine selection strategy and their impacts on this challenging optimization problem.…”

mentioning

confidence: 99%

Data-science driven autonomous process optimization

et al. 2021

Self Cite

View full text Add to dashboard Cite

Autonomous process optimization involves the human intervention-free exploration of a range process parameters to improve responses such as product yield and selectivity. Utilizing off-the-shelf components, we develop a closed-loop system for carrying out parallel autonomous process optimization experiments in batch. Upon implementation of our system in the optimization of a stereoselective Suzuki-Miyaura coupling, we find that the definition of a set of meaningful, broad, and unbiased process parameters is the most critical aspect of successful optimization. Importantly, we discern that phosphine ligand, a categorical parameter, is vital to determination of the reaction outcome. To date, categorical parameter selection has relied on chemical intuition, potentially introducing bias into the experimental design. In seeking a systematic method for selecting a diverse set of phosphine ligands, we develop a strategy that leverages computed molecular feature clustering. The resulting optimization uncovers conditions to selectively access the desired product isomer in high yield.

show abstract

“…All too often these efforts fail, impeding access to potentially promising new medicines and materials. Emerging approaches in reactivity prediction that combine highthroughput experimentation [5][6][7][8] with molecular descriptor sets [9][10][11] and multivariate statistical analysis including machine learning [12][13][14][15][16] can accelerate this process and increase success rates; however, the predictions generated by these approaches are often limited to the specific reaction under investigation (Figure 1A). Developing and refining the next generation of organic chemistry tools, including computer-aided synthesis design, automated reaction optimization, and predictive algorithms, 17 requires the development of general and quantitative frameworks linking molecular structure to reactivity for many different reactants and catalysts.…”

mentioning

confidence: 99%

A reactivity map for oxidative addition enables quantitative predictions for multiple catalytic reaction classes

Donnecke

Paci

Leitch

2021

Preprint

View full text Add to dashboard Cite

Making accurate, quantitative predictions of chemical reactivity based on molecular structure is an unsolved problem in chemical synthesis, particularly for complex molecules. We report a generally applicable, mechanistically based structure-reactivity approach to create a quantitative model for the oxidative addition of (hetero)aryl electrophiles to palladium(0), which is a key step in myriad catalytic processes. This model links simple molecular descriptors to relative rates of oxidative addition for 79 substrates, including chloride, bromide and triflate leaving groups. Because oxidative addition often controls the rate and/or selectivity of palladium-catalyzed reactions, this model can be used to make quantitative predictions about catalytic reaction outcomes. Demonstrated applications include a multivariate linear model for the initial rate of Sonogashira coupling reactions, and successful site-selectivity predictions for a series of multihalogenated substrates relevant to the synthesis of pharmaceuticals and natural products.

show abstract

A Comprehensive Discovery Platform for Organophosphorus Ligands for Catalysis

Cited by 9 publications

References 31 publications

Data Science Meets Physical Organic Chemistry

Data Science Meets Physical Organic Chemistry

Data-science driven autonomous process optimization

A reactivity map for oxidative addition enables quantitative predictions for multiple catalytic reaction classes

Contact Info

Product

Resources

About