Akhil Sanker scite author profile

Madaj³

et al. 2020

Preprint

Our work is composed of a python program for programmatic data mining of PubChem to collect data to implement a machine learning based AutoQSAR algorithm to generate drug leads for the flaviviruses -Dengue and West Nile. The drug leads generated by the program are feed as programmatic inputs to AutoDock Vina package for automated In Silico modelling of interaction between the compounds generated as drug leads by the program and the chosen Dengue and West Nile drug target methyltransferase, whose inhibition leads to the control of viral replication. The machine learning based AutoQSAR algorithm involves feature selection, QSAR modelling, validation and prediction. The drug leads generated each time the program is run is reflective of the constantly growing PubChem database is an important dynamic feature of the program which facilitates fast and dynamic drug lead generation against the West Nile and Dengue virus in way which is reflective of the constantly growing PubChem database. The program prints out the top drug leads after screening PubChem library which is over a billion compounds. The leads generated by the program are fed as programmatic inputs to an In Silico modelling package. The interaction of top drug lead compounds generated by the program and drug targets of West Nile and Dengue virus, was modelled in an automated way through programmatic commands. Thus our program ushers in a new age of automatic ease in the virtual drug screening and drug identification through programmatic data mining of chemical data libraries and drug lead generation through machine learning based AutoQSAR algorithm and an automated In Silico

Target2DeNovoDrug: a novel programmatic tool for in silico-deep learning based de novo drug design for any target of interest

Madaj

S²,

Journal of Biomolecular Structure and Dynamics

et al. 2021

The past decade has seen a surge in the range of application data science, machine learning, deep learning, and AI methods to drug discovery. The presented work involves an assemblage of a variety of AI methods for drug discovery along with the incorporation of in silico techniques to provide a holistic tool for automated drug discovery. When drug candidates are required to be identified for a particular drug target of interest, the user is required to provide the tool target signatures in the form of an amino acid sequence or its corresponding nucleotide sequence. The tool collects data registered on PubChem required to perform an automated QSAR and with the validated QSAR model, prediction and drug lead generation are carried out. This protocol we call Target2Drug. This is followed by a protocol we call Target2DeNovoDrug wherein novel molecules with likely activity against the target are generated de novo using a generative LSTM model. It is often required in drug discovery that the generated molecules possess certain properties like drug-likeness, and therefore to optimize the generated de novo molecules toward the required drug-like property we use a deep learning model called DeepFMPO, and this protocol we call Target2DeNovoDrugPropMax. This is followed by the fast automated AutoDock-Vina based in silico modeling and profiling of the interaction of optimized drug leads and the drug target. This is followed by an automated execution of the Molecular Dynamics protocol that is also carried out for the complex identified with the best protein-ligand interaction from the AutoDock-Vina based virtual screening. The results are stored in the working folder of the user. The code is maintained, supported, and provide for use in the following GitHub repository

Automated identification of small drug molecules for Hepatitis C virus through a novel programmatic tool and extensive Molecular Dynamics studies of select drug candidates

Madaj

et al. 2020

Preprint

AbstractWe report a novel python based programmatic tool that automates the dry lab drug discovery workflow for Hepatitis C virus. Firstly, the python program is written to automate the process of data mining PubChem database to collect data required to perform a machine learning based AutoQSAR algorithm through which drug leads for Hepatitis C virus is generated. The workflow of the machine learning based AutoQSAR involves feature learning and descriptor selection, QSAR modelling, validation and prediction. The drug leads generated by the program are required to satisfy the Lipinski’s drug likeness criteria. 50 of the drug leads generated by the program are fed as programmatic inputs to an In Silico modelling package by the program for fast virtual screening and computer modelling of the interaction of the compounds generated as drug leads and the drug target, a viral Helicase of Hepatitis C. The results are stored automatically in the working folder of the user by the program. The program also generates protein-ligand interaction profiling and stores the visualized images in the working folder of the user. Select protein-ligand complexes associated with structurally diverse ligands having lowest binding energy were selected for extensive molecular dynamics simulation studies and subsequently for molecular mechanics generalized-born surface area (MMGBSA) with pairwise decomposition calculations. The molecular mechanics studies predict In Silico that the compounds generated by the program inhibit the viral helicase of Hepatitis C and prevent the replication of the virus. Thus our programmatic tool ushers in the new age of automatic ease in drug identification for Hepatitis C virus through a programmatic tool that completely automates the dry lab drug discovery workflow. The program is hosted, maintained and supported at the GitHub repository link given below https://github.com/bengeof/Automated-drug-identification-programmatic-tool-for-Hepatitis-C-virus

Compound2Drug – a Machine/deep Learning Tool for Predicting the Bioactivity of PubChem Compounds

Valluri²,

Sanker³

et al. 2020

Preprint

Network data is composed of nodes and edges. Successful application of machine learning/deep learning algorithms on network data to make node classification and link prediction has been shown in the area of social networks through which highly customized suggestions are offered to social network users. Similarly one can attempt the use of machine learning/deep learning algorithms on biological network data to generate predictions of scientific usefulness. In the present work, compound-drug target interaction data set from bindingDB has been used to train machine learning/deep learning algorithms which are used to predict the drug targets for any PubChem compound queried by the user. The user is required to input the PubChem Compound ID (CID) of the compound the user wishes to gain information about its predicted biological activity and the tool outputs the RCSB PDB IDs of the predicted drug target. The tool also incorporates a feature to perform automated In Silico modelling for the compounds and the predicted drug targets to uncover their protein-ligand interaction profiles. The programs fetches the structures of the compound and the predicted drug targets, prepares them for molecular docking using standard AutoDock Scripts that are part of MGLtools and performs molecular docking, protein-ligand interaction profiling of the targets and the compound and stores the visualized results in the working folder of the user. The program is hosted, supported and maintained at the following GitHub repository <a href="https://github.com/bengeof/Compound2Drug">https://github.com/bengeof/Compound2Drug</a>

A program to automate the discovery of drugs for West Nile and Dengue virus—programmatic screening of over a billion compounds on PubChem, generation of drug leads and automatedin silicomodelling

Journal of Biomolecular Structure and Dynamics

Madaj

et al. 2020