The theoretical calculation of pKa values for Brønsted acids is a challenging task that involves sophisticated and time-consuming methods. Therefore, heuristic approaches are efficient and appealing methodologies to approximate these values. Herein, we used the maximum surface electrostatic potential (VS,max) on the acidic hydrogen atoms of carboxylic acids to describe the H-bond interaction with water (the same descriptor that is used to characterize σ-bonded complexes) and correlate the results with experimental pKa values to obtain a predictive model for other carboxylic acids. We benchmarked six different methods, all including an implicit solvation model (water): Five density functionals and the Møller–Plesset second order perturbation theory in combination with six different basis sets for a total of thirty-six levels of theory. The ωB97X-D/cc-pVDZ level of theory stood out as the best one for consistently reproducing the reported pKa values, with a predictive power of 98% correlation in a test set of ten other carboxylic acids.
Theoretical calculation of equilibrium
dissociation constants is a very computationally demanding and time-consuming
process since it requires an extremely accurate computation of the
solvation free energy changes for each of the species involved. By
correlating the minimum surface electrostatic potential (V
S,min) on the nitrogen atom of several aliphatic amino
groupscalculated at the density functional theory (DFT) ωB97X-D/cc-pVDZ
level of theorywe obtained regression models for each kind
of substitution pattern from which we interpolate their corresponding
pK
b values with remarkable accuracy: primary R
2 = 0.9519; secondary R
2 = 0.9112; and tertiary R
2 = 0.8172
(N = 20 for each family). These models were validated
with tests sets (N = 5) with mean absolute error
(MAE) values of 0.1213 (primary), 0.4407 (secondary), and 0.3057 (tertiary).
Combining this ansatz with another previously reported by our group
to estimate pK
a values [Caballero-García,
G.; et al. Molecules 2019, 24(1), 79] we are able
to reproduce the isoelectric points of 13 amino acids with no titrable
side chains with MAE = 0.4636 pI units.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.