Rigorous Incorporation of Tautomers, Ionization Species, and Different Binding Modes into Ligand-Based and Receptor-Based 3D-QSAR Methods

Natesan, Senthil; Baláž, Štefan

doi:10.2174/1381612811319230013

Cited by 2 publications

(1 citation statement)

References 71 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In his studies on the Yamazaki dataset, Gleeson pointed out that PPB is closely related to both the ionization state and the liphophilicity of a molecule ( 3 ). Dealing with different representations of molecules (i.e., ionization states and tautomerism) is often a mandatory process especially when using ligand-receptor based models ( 47 , 48 ). Different SMILES representations of the same molecule lead to different descriptor values ( 49 ).…”

Section: Discussionmentioning

confidence: 99%

QSAR Development for Plasma Protein Binding: Influence of the Ionization State

et al. 2018

View full text Add to dashboard Cite

PurposeThis study explored several strategies to improve the performance of literature QSAR models for plasma protein binding (PPB), such as a suitable endpoint transformation, a correct representation of chemicals, more consistency in the dataset, and a reliable definition of the applicability domain.MethodsWe retrieved human fraction unbound (Fu) data for 670 compounds from the literature and carefully checked them for consistency. Descriptors were calculated taking account of the ionization state of molecules at physiological pH (7.4), in order to better estimate the affinity of molecules to blood proteins. We used different algorithms and chemical descriptors to explore the most suitable strategy for modeling the endpoint. SMILES (simplified molecular input line entry system)-based string descriptors were also tested with the CORAL software (CORelation And Logic). We did an outlier analysis to establish the models to use (or not to use) in case of well recognized families.ResultsInternal validation of the selected models returned Q2 values close to 0.60. External validation also gave r2 values always greater than 0.60. The CORAL descriptor based model for √fu was the best, with r2 0.74 in external validation.ConclusionsPerformance in prediction confirmed the robustness of all the derived models and their suitability for real-life purposes, i.e. screening chemicals for their ADMET profiling. Optimization of descriptors can be useful in order to obtain the correct results with a ionized molecule.Electronic supplementary materialThe online version of this article (10.1007/s11095-018-2561-8) contains supplementary material, which is available to authorized users.

show abstract