2017
DOI: 10.1021/acs.jcim.7b00274
|View full text |Cite
|
Sign up to set email alerts
|

Interpretation of Quantitative Structure–Activity Relationship Models: Past, Present, and Future

Abstract: This paper is an overview of the most significant and impactful interpretation approaches of quantitative structure-activity relationship (QSAR) models, their development, and application. The evolution of the interpretation paradigm from "model → descriptors → (structure)" to "model → structure" is indicated. The latter makes all models interpretable regardless of machine learning methods or descriptors used for modeling. This opens wide prospects for application of corresponding interpretation approaches to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
154
0
5

Year Published

2018
2018
2021
2021

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 197 publications
(160 citation statements)
references
References 121 publications
(245 reference statements)
1
154
0
5
Order By: Relevance
“…Random forest (RF) [36] is a supervised learning algorithm with an ensemble of decision trees generated from a bootstrapped (bagged) sampling of compounds and features. It is widely used in the traditional structure-property relation research [37], and was considered as a "gold standard" according to its robustness, easy usage and high prediction accuracy in structure-property relationship research [38]. Here, the ECFP with a fixed length of 1024 [12] was used with the RF model, which was implemented in Python 3.6.3 [39] with the package Scikit-learn, version 0.21.2 [40].…”
Section: Random Forestmentioning
confidence: 99%
“…Random forest (RF) [36] is a supervised learning algorithm with an ensemble of decision trees generated from a bootstrapped (bagged) sampling of compounds and features. It is widely used in the traditional structure-property relation research [37], and was considered as a "gold standard" according to its robustness, easy usage and high prediction accuracy in structure-property relationship research [38]. Here, the ECFP with a fixed length of 1024 [12] was used with the RF model, which was implemented in Python 3.6.3 [39] with the package Scikit-learn, version 0.21.2 [40].…”
Section: Random Forestmentioning
confidence: 99%
“…Most of them share a common drawback: failure to interpret the underlying causal relationships between the inputs and the response treating the ANN models essentially as a black box . Coefficient of correlation/determination ( R / R 2 ) values calculated for a linear fit between predicted and measured values are often erroneously (as noted by Héberger) used as performance metrics invalidating the models.…”
Section: Introductionmentioning
confidence: 99%
“…An emerging technology is explainable AI which tries to open the black box. There are many already existing possibilities to explain model prediction, as shown by Polishchuk . Newer technologies are emerging, especially with regard to neural networks.…”
Section: Where Are We Headed?mentioning
confidence: 99%