2021
DOI: 10.1016/j.ailsci.2021.100004
|View full text |Cite
|
Sign up to set email alerts
|

Quantifying sources of uncertainty in drug discovery predictions with probabilistic models

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(8 citation statements)
references
References 37 publications
0
8
0
Order By: Relevance
“…In particular, the median value of all compounds in the generated dataset defines the threshold, but the range of allowed thresholds are fixed to be 4 ≤ pValue ≤ 6, where pValue = − log 10 (Activity Value). If the median is outside this range, a fixed threshold pValue = 5 is applied, which follows the common practice (Mayr et al, 2018;Stanley et al, 2021) in drug discovery. In this way, we can try our best to keep the dataset balanced while making the generated tasks meaningful.…”
Section: Binary Classification Task With Adaptive Thresholdmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, the median value of all compounds in the generated dataset defines the threshold, but the range of allowed thresholds are fixed to be 4 ≤ pValue ≤ 6, where pValue = − log 10 (Activity Value). If the median is outside this range, a fixed threshold pValue = 5 is applied, which follows the common practice (Mayr et al, 2018;Stanley et al, 2021) in drug discovery. In this way, we can try our best to keep the dataset balanced while making the generated tasks meaningful.…”
Section: Binary Classification Task With Adaptive Thresholdmentioning
confidence: 99%
“…Additionally, current OOD research focus almost exclusively on the single-instance prediction tasks while overlook multi-instance prediction tasks, and how to better handle the distribution shift in multiple instance domains (e.g., molecule and protein inputs in the SBAP task) remains an open problem. Lastly, while not explored in this paper, the large-scale realistic datasets always come with non-negligible inherent noise, both aleatoric and epistemic (Lazic & Williams, 2021). And how to incorporate noise learning with OOD generalization to boost model's robustness and generalization in the meantime is an important research direction.…”
Section: Domain Generalization Algorithmsmentioning
confidence: 99%
“…Although there are ML methods that provide inherent prediction uncertainties such as Gaussian process modeling [ 20 ], most ML/DL methods including DNNs produce numerical end points without uncertainty estimates. Approaches for uncertainty quantification of predictions complementing ML/DL include probabilistic [ 21 ] and ensemble methods [ 22 ]. The latter methods determine prediction variance on the basis of differently trained models generated with the same algorithm.…”
Section: Quantifying Prediction Uncertaintymentioning
confidence: 99%
“…The latter methods determine prediction variance on the basis of differently trained models generated with the same algorithm. For probabilistic approaches, Bayesian DNNs provide a prime example [ 21 ]. However, due to their computational costs, Bayesian DNNs are only applicable to large data sets if approximations are introduced.…”
Section: Quantifying Prediction Uncertaintymentioning
confidence: 99%
“…For this to be realised, a statistical methodology must be adopted through which the outputs of the various approaches can be integrated to produce updated judgments. Bayesian inference represents a powerful technique for achieving this, permitting as it does the generation of probabilistic distributions which may be related to severity of toxicity (Lazic and Williams 2021 ). Its application within predictive toxicology constitutes an emerging field of interest, and as such it has been drawn upon in recent studies aimed towards development of models describing endpoints spanning skin sensitivity (Reynolds et al 2019 ), drug-induced liver injury (Semenova et al 2020 ; Williams et al 2020 ) and cardiotoxicity (Felli and Leishman 2020 ).…”
Section: Introductionmentioning
confidence: 99%