2016
DOI: 10.1186/s13321-016-0124-8
|View full text |Cite
|
Sign up to set email alerts
|

GPURFSCREEN: a GPU based virtual screening tool using random forest classifier

Abstract: BackgroundIn-silico methods are an integral part of modern drug discovery paradigm. Virtual screening, an in-silico method, is used to refine data models and reduce the chemical space on which wet lab experiments need to be performed. Virtual screening of a ligand data model requires large scale computations, making it a highly time consuming task. This process can be speeded up by implementing parallelized algorithms on a Graphical Processing Unit (GPU).ResultsRandom Forest is a robust classification algorith… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2016
2016
2022
2022

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 22 publications
(14 citation statements)
references
References 21 publications
0
14
0
Order By: Relevance
“…To address this challenge, several statistical measures that require less computational expense, such as the maximal information coefficient (MIC), have been developed to promote the efficient handling of big data. 124 In parallel, development of computational infrastructure that can be rapidly expanded, such as the Hadoop file system, 125 crowdsourcing, 126 and massively parallel processing hardware (including the recruitment of graphical processing units 127 ), are being actively explored and adopted.…”
Section: Big Data-driven Techniques For Drug Discoverymentioning
confidence: 99%
“…To address this challenge, several statistical measures that require less computational expense, such as the maximal information coefficient (MIC), have been developed to promote the efficient handling of big data. 124 In parallel, development of computational infrastructure that can be rapidly expanded, such as the Hadoop file system, 125 crowdsourcing, 126 and massively parallel processing hardware (including the recruitment of graphical processing units 127 ), are being actively explored and adopted.…”
Section: Big Data-driven Techniques For Drug Discoverymentioning
confidence: 99%
“…For AID778, the Random Forest gave the AUC of 62 % with all data sets (about 95,000 compounds) [39] and we obtained the AUC of 71.28 % using the WL method with 4000 data sets. Firstly, we compared our results for AS with the previous results.…”
Section: Resultsmentioning
confidence: 80%
“…To achieve negatives, the positive MMRSs and unlabeled MMRSs in each dataset were used as input to train classifier respectively. Eight classification algorithms (Kstar [37], BN [38], IBK [39], J48 [40], RF [41], SVM [42], AdaBoost [43], Bagging [44]) were adopted. These algorithms have been shown effective in various domains of bioinformatics and medicinal chemistry.…”
Section: Methodsmentioning
confidence: 99%