Two molecular similarity methods have been used to select nearest neighbors from four different sets of chemicals. One of the methods is based on the Euclidean distance of chemicals in the ten dimensional principal components space derived from 97 graph invariants. The second approach is based on the count of atom pairs common to a pair of molecules. Two probe chemicals were selected, and neighbors of each were determined by the two methods for the following four sets of molecules: (a) a combined set of octane and nonane isomers, (b) a relatively more diverse set of 382 chemicals, (c) a diverse set of 3692 chemicals, and (d) the STARLIST data base of log P consisting of 4067 structures. The results show that the measures reflect an intuitive notion of chemical similarity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.