Synthesis yields of organic reactions are one of the most important factors in ranking synthesis routes created by synthesis route design systems such as Transform-Oriented Synthesis Planning and Knowledge base-Oriented Synthesis Planning. If it is possible to predict the yields of synthesis reactions before starting experiments, one can easily determine an order of synthesis routes for experimental works. In the present study, the reaction profiles of the Curtius rearrangement with different substituents were calculated to generate an equation predicting experimental yields of this reaction. Reactions followed by the formation of isocyanates were also analyzed to consider the relationship between reaction times and experimental yields. A partial least squares (PLS) regression was used to correlate the experimental yields with the calculated activation energies, E a (calc), together with experimental conditions such as dielectric constants of solvents, reaction times, and reaction temperatures as explanatory variables. Although the PLS regression using all the data gave very poor results, we succeeded in making a model equation with R 2 = 0.887 using a modified data set. However, there is a conflict between the predictability and the interpretability on the reaction time. This discrepancy mainly comes from unnecessarily long reaction times in the experiments for azides with calculated E a values of less than 33 kcal mol -1 . To construct a good model equation for the experimental yields of the Curtius reaction, we have to use data sets obtained from within 90 min of the reaction for the PLS regression.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.