Nanodiamonds have a wide range of applications including
catalysis,
sensing, tribology, and biomedicine. To leverage nanodiamond design
via machine learning, we introduce the new data set ND5k, consisting
of 5089 diamondoid and nanodiamond structures and their frontier orbital
energies. ND5k structures are optimized via tight-binding density
functional theory (DFTB) and their frontier orbital energies are computed
using density functional theory (DFT) with the PBE0 hybrid functional.
From this data set we derive a qualitative design suggestion for nanodiamonds
in photocatalysis. We also compare recent machine learning models
for predicting frontier orbital energies for similar structures as
they have been trained on (interpolation on ND5k), and we test their
abilities to extrapolate predictions to larger structures. For both
the interpolation and extrapolation task, we find the best performance
using the equivariant message passing neural network PaiNN. The second
best results are achieved with a message passing neural network using
a tailored set of atomic descriptors proposed here.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.