A QSAR model for predicting the blood brain barrier permeability (BBBP) in a large and heterogeneous variety of compounds (136 compounds) has been developed using approximate similarity (AS) matrices as predictors and PLS as multivariate regression technique. AS values fuse information of both the isomorphic similarity and nonisomorphic dissimilarity with the purpose of achieving an accurate predictive space. In addition to the fact of applying AS values to heterogeneous data sets, a new concept on graph isomorphism based on the extended maximum common subgraph (EMCS) is defined for the building of AS spaces considering the atoms and bonds, which are bridges between the isomorphic and nonisomorphic substructures. This new isomorphism detection has as objective to take into account the position and nature of the nucleus substituents, thus allowing the development of accurate models for large and diverse sets of compounds. After an outliers study, the training and test stages were made and the results obtained using several AS approaches were compared. Several validation processes were carried out by means of employing several test sets, and high predictive ability was obtained for all the cases (Q(2) = 0.81 and standard error in prediction, SEP = 0.29).
ChemInform is a weekly Abstracting Service, delivering concise information at a glance that was extracted from about 200 leading journals. To access a ChemInform Abstract, please click on HTML or PDF.
A new QSAR method based on approximate similarity measurements is described in this paper. Approximate similarity is calculated using both the classical similarity based on the graph isomorphism and a distance computation between nonisomorphic subgraphs. The latter is carried out through a parametric function where different topological invariants can be considered. After optimizing the contribution of nonisomorphic distance to the new graph similarity, predictive models built with approximate similarity matrixes show higher predictive ability than those using traditional similarity matrixes. The new method has been applied to the prediction of steroids binding to the corticosteroid globulin receptor. The proposed model allows us to obtain valuable external predictions (r=0.82 and SEP=0.30) after training the model by cross-validation (Q2=0.84 and SECV=0.47). Slope and bias parameters are also given.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.