2020
DOI: 10.1016/j.jmgm.2019.107516
|View full text |Cite
|
Sign up to set email alerts
|

A classification model for blood brain barrier penetration

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
22
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 33 publications
(25 citation statements)
references
References 31 publications
0
22
0
Order By: Relevance
“…External estimates of probability that a compound will undergo active efflux mediated by P-glycoprotein (P-gp) can also be included [21,24,37]. In other approaches, large pools of various 1D, 2D (including molecular fingerprints), and 3D molecular descriptors calculated by different methods [26,[38][39][40][41][42][43][44][45][46][47][48][49][50][51][52] are analyzed by various statistical learning techniques, e.g., multiple linear regression, linear discriminant analysis, partial least squares regression, support vector machines, artificial neural networks, random forests, etc., often in combination with some descriptor selection protocols [23,24,26,[43][44][45][46][47][48][49][50][51][52]. However, the limited size of the training sets, use of unverified data, and too-small modeling errors for such an inherently noisy endpoint often give rise to the concerns of possible model overfitting [16].…”
Section: Introductionmentioning
confidence: 99%
“…External estimates of probability that a compound will undergo active efflux mediated by P-glycoprotein (P-gp) can also be included [21,24,37]. In other approaches, large pools of various 1D, 2D (including molecular fingerprints), and 3D molecular descriptors calculated by different methods [26,[38][39][40][41][42][43][44][45][46][47][48][49][50][51][52] are analyzed by various statistical learning techniques, e.g., multiple linear regression, linear discriminant analysis, partial least squares regression, support vector machines, artificial neural networks, random forests, etc., often in combination with some descriptor selection protocols [23,24,26,[43][44][45][46][47][48][49][50][51][52]. However, the limited size of the training sets, use of unverified data, and too-small modeling errors for such an inherently noisy endpoint often give rise to the concerns of possible model overfitting [16].…”
Section: Introductionmentioning
confidence: 99%
“…While such models assign speci c logBB/PR values for each drug, binary models have so far reached a higher prediction accuracy and provide a preliminary insight regarding the behavior of candidate drugs which is su cient in early drug discovery stages. Predominantly, binarization of drug permeability across the BBB is performed by setting empirical thresholds to logBB values [5][6][7][8][9]. However, S. Kunwittaya et al [6] have shown that varying logBB thresholds lead to a difference in the prediction accuracy.…”
Section: Introductionmentioning
confidence: 99%
“…In this context, different types of classi ers were trained in the literature including Support Vector Machines (SVM) [6,8,11,12], Linear Discriminant Analysis (LDA) [13], Arti cial Neural Networks (ANN) [6] and Multi-Layer Perceptron (MLP) [8,9], k-Nearest Neighbors (k-NN) [8], Decision Trees (DT) [6,7] and Random Forests (RF) [5,8,9]. Other studies apply consensus models, by training and combining multiple classi ers [8,9]. While consensus models mitigate the over tting problem of single classi ers, they naturally require high computational power, especially when dealing with high dimensional data.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations