Tutorials in Chemoinformatics 2017
DOI: 10.1002/9781119161110.ch9
|View full text |Cite
|
Sign up to set email alerts
|

QSPR Models on Fragment Descriptors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6

Relationship

4
2

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 19 publications
0
6
0
Order By: Relevance
“…MLR calculations : The ISIDA QSPR, program combines forward and backward stepwise variable selection techniques. It generates large numbers of linear models starting from the training set, automatically scanning over specified ISIDA descriptor types (see section 2.2.1) and applying, for each descriptor space, algorithms of forward stepwise variable selection.…”
Section: Computational Proceduresmentioning
confidence: 99%
“…MLR calculations : The ISIDA QSPR, program combines forward and backward stepwise variable selection techniques. It generates large numbers of linear models starting from the training set, automatically scanning over specified ISIDA descriptor types (see section 2.2.1) and applying, for each descriptor space, algorithms of forward stepwise variable selection.…”
Section: Computational Proceduresmentioning
confidence: 99%
“…As shown in Table S4 (see the Supporting Information), we can assume that the structural and activity information encoded on the initial (raw) full set was kept as much as possible on the final (curated) data set. Additionally, we could confirm that the curated data set is free of duplicates using the “Find duplicate structures” option of the EdiSDF program included on the ISIDA project. , A schematic representation of the training set optimization process applied in this work is shown in Figure .…”
Section: Resultsmentioning
confidence: 99%
“…The NCI diversity set used for comparison is comprised by 1635 unique compounds included in the Diversity sets III, IV, and V provided in . Duplicate compounds in the three diversity sets were identified and removed using the EdiSDF program included on the ISIDA project. , The dissimilarity matrix for the MDS was obtained using the pdist function implemented in MatLab . The Jaccard coefficient was employed as the proximity metric.…”
Section: Resultsmentioning
confidence: 99%
“…In this step, the descriptors are selected one by one or in twos with the help of FVS‐1 and FVS‐2 (Forward stepwise Variable Selection) algorithms , respectively, each of which collected 0.4 n , 0.5 n , …, 0.9 n descriptors to increase the diversity of the resulting models, where n is the number of the compounds in the training set. Step (c) eliminates the variables with low t i =a i /Δa i values for the models, where Δ a i is a standard deviation for the coefficient a i at the i ‐th variable in the model . The algorithm selects the variable with the minimal t min < t 0 and builds a new model excluding this variable.…”
Section: Methodsmentioning
confidence: 99%