2020
DOI: 10.1111/cbdd.13657
|View full text |Cite
|
Sign up to set email alerts
|

Fingerprint‐based computational models of 5‐lipo‐oxygenase activating protein inhibitors: Activity prediction and structure clustering

Abstract: Inflammatory diseases can be treated by inhibiting 5-lipo-oxygenase activating protein (FLAP). In this study, a data set containing 2,112 FLAP inhibitors was collected. A total of 25 classification models were built by five machine learning algorithms with five different types of fingerprints. The best model, which was built by support vector machine algorithm with ECFP_4 fingerprint had an accuracy and a Matthews correlation coefficient of 0.862 and 0.722 on the test set, respectively. The predicted results w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 65 publications
0
4
0
Order By: Relevance
“…Currently, research has been proposed to combine GNN features with different ML algorithms for the classification or regression tasks. Besides, ECFP fingerprints were also extensively used for the representation of molecules because of their easier interpretation and faster computation compared to other descriptors, and moreover, they have been successfully applied in various prediction tasks. Here, four classic supervised learners were also considered to be coupled with GNN neural features and ECFP fingerprints, including SVR, RF, XGBoost, and LightGBM. Since GAT-10 yields the optimal performance among 36 GNN models, the neural features extracted by GAT-10 were used in the subsequent comparative study.…”
Section: Resultsmentioning
confidence: 99%
“…Currently, research has been proposed to combine GNN features with different ML algorithms for the classification or regression tasks. Besides, ECFP fingerprints were also extensively used for the representation of molecules because of their easier interpretation and faster computation compared to other descriptors, and moreover, they have been successfully applied in various prediction tasks. Here, four classic supervised learners were also considered to be coupled with GNN neural features and ECFP fingerprints, including SVR, RF, XGBoost, and LightGBM. Since GAT-10 yields the optimal performance among 36 GNN models, the neural features extracted by GAT-10 were used in the subsequent comparative study.…”
Section: Resultsmentioning
confidence: 99%
“…In terms of the best model, XGBoost outperformed all and the accuracy and MCC values were observed as 0.90 and 0.81, respectively. Additionally, a pervious fingerprint-based ML study on FLAP modulators stated that that the reliability of predicted results depends mainly on the compounds themselves rather than algorithms or fingerprints ( Tu et al, 2020 ).…”
Section: Resultsmentioning
confidence: 99%
“…The formula is where y ( J ) and σ­( J ) are the average and standard deviation of the probability of n models predicted for compound J , respectively. N ( x , y ( J ), σ­( J )) is the normal distribution density function of probability …”
Section: Methodsmentioning
confidence: 99%
“…N(x, y(J), σ(J)) is the normal distribution density function of probability. 51 On the basis of the prediction accuracy and standard deviation of multiple models for the same compound, we can obtain a normal distribution curve. The d STD-PRO value represents the area under the curve and also the probability when the compound is predicted to be of the opposite type.…”
Section: Model Evaluationmentioning
confidence: 99%