Karima Sid scite author profile

Karima Sid

5Publications

9Citation Statements Received

49Citation Statements Given

How they've been cited

How they cite others

Affiliations

Larbi Ben M'hidi University of Oum El Bouaghi, Université Constantine 2

Publications

Order By: Most citations

Ensemble Learning for Large Scale Virtual Screening on Apache Spark

Sid

Batouche

2018

View full text Add to dashboard Cite

Virtual screening (VS) is an in-silico tool for drug discovery that aims to identify the candidate drugs through computational techniques by screening large libraries of small molecules. Various ligand and structure-based virtual screening approaches have been proposed in the last decades. Machine learning (ML) techniques have been widely applied in drug discovery and development process, predominantly in ligand based virtual screening approaches. Ensemble learning is a very common paradigm in ML field, where many models are trained on the same problem's data, to combine in the end the results in one improved prediction. Applying VS to massive molecular libraries (Big Data) is computationally intensive; so the split of these data to chunks to parallelize and distribute the task became necessary. For many years, MapReduce has been successfully applied on clusters to solve the problems with very large datasets, but with some limitations. Apache Spark is an open source framework for Big Data processing, which overcomes the shortcomings of MapReduce. In this paper, we propose a new approach based on ensemble learning paradigm in Apache Spark to improve in terms of execution time and precision the large-scale virtual screening. We generate a new training dataset to evaluate our approach. The experimental results show a good predictive performance up to 92% precision with an acceptable execution time.

show abstract

Big Data Analytics Techniques in Virtual Screening for Drug Discovery

Sid¹,

Batouche

2017

View full text Add to dashboard Cite

DeepD_DrugC: Deep and distributed workflow to predict drug- candidates

Sid

Zertal

Mezioud

2022

View full text Add to dashboard Cite

Distributed heterogeneous ensemble learning on Apache Spark for ligand-based virtual screening

Sid

Batouche

2021

IJDMMM

View full text Add to dashboard Cite

Distributed heterogeneous ensemble learning on Apache Spark for ligand-based virtual screening

Sid¹,

Batouche²

2021

IJDMMM

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Karima Sid

Ensemble Learning for Large Scale Virtual Screening on Apache Spark

Big Data Analytics Techniques in Virtual Screening for Drug Discovery

DeepD_DrugC: Deep and distributed workflow to predict drug- candidates

Distributed heterogeneous ensemble learning on Apache Spark for ligand-based virtual screening

Distributed heterogeneous ensemble learning on Apache Spark for ligand-based virtual screening

Contact Info

Product

Resources

About