Detailed chemical kinetic models offer valuable mechanistic
insights
into industrial applications. Automatic generation of reliable kinetic
models requires fast and accurate radical thermochemistry estimation.
Kineticists often prefer hydrogen bond increment (HBI) corrections
from a closed-shell molecule to the corresponding radical for their
interpretability, physical meaning, and facilitation of error cancellation
as a relative quantity. Tree estimators, used due to limited data,
currently rely on expert knowledge and manual construction, posing
challenges in maintenance and improvement. In this work, we extend
the subgraph isomorphic decision tree (SIDT) algorithm originally
developed for rate estimation to estimate HBI corrections. We introduce
a physics-aware splitting criterion, explore a bounded weighted uncertainty
estimation method, and evaluate aleatoric uncertainty-based and model
variance reduction-based prepruning methods. Moreover, we compile
a data set of thermochemical parameters for 2210 radicals involving
C, O, N, and H based on quantum chemical calculations from recently
published works. We leverage the collected data set to train the SIDT
model. Compared to existing empirical tree estimators, the SIDT model
(1) offers an automatic approach to generating and extending the tree
estimator for thermochemistry, (2) has better accuracy and R
2, (3) provides significantly more realistic
uncertainty estimates, and (4) has a tree structure much more advantageous
in descent speed. Overall, the SIDT estimator marks a great leap in
kinetic modeling, offering more precise, reliable, and scalable predictions
for radical thermochemistry.