A widely practiced approach to structure elucidation is based on the chemical and/or the spectral properties of the compound of unknown structure. As the power and sophistication of spectrometers improved, spectral data assumed a more central role. With increasing amounts of such data to process, it was natural to look to the computer to enhance productivity. This paper traces the development of computer-based tools for structure elucidation.
A problem common to computer programs for structure elucidation is the efficient and prospective use of the input information to constrain the structure generation process. The input may consist of potentially overlapping substructure requirements and alternative substructure interpretations of spectral data. Other useful information may be structural features that must not be present in the output structures. All of these may interact in a complex manner that is impossible to determine by use of a bond-by-bond structure assembly algorithm. A new method is described called structure reduction. In contrast to structure assembly, this method begins with a set of all bonds and removes inconsistent bonds as structure generation progresses. This results in a more efficient use of the input information and the ability to use potentially overlapping required substructures. Several examples illustrate the application of our computer program COCOA, which uses this method to solve real-world structure elucidation problems.
The simple linear neural network model was investigated as a method for automated interpretation of infrared spectra. The model was trained using a database of infrared spectra of organic compounds of known structure. The model was able to learn, without any prior input of spectrum-structure correlations, to recognize and identify 76 functional groupings with accuracies ranging from fair to excellent. The effect of network input parameters and of training set composition were studied, and several sources of spurious correlations were identified and corrected.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.