Collaborative drug discovery for More Medicines for Tuberculosis (MM4TB)

Ekins, Sean; Spektor, Anna; Clark, Alex M.; Dole, Krishna; Bunin, Barry A.

doi:10.1016/j.drudis.2016.10.009

Cited by 12 publications

(10 citation statements)

References 101 publications

(105 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The discovery of new TB drug candidates with novel mechanisms of action and a shortened length of treatment is of fundamental importance. Much of the effort has resorted to large high-throughput screens in academia, industry and efforts funded by the NIH and the Bill and Melinda Gates Foundation 3–5 . However, the translation of in vitro active compounds coming from these screens and moving them in vivo is fraught with difficulty in terms of finding molecules that balance activity versus good physicochemical and pharmacokinetic properties.…”

Section: Introductionmentioning

confidence: 99%

Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery

et al. 2018

Self Cite

View full text Add to dashboard Cite

Tuberculosis is a global health dilemma. In 2016, the WHO reported 10.4 million incidences and 1.7 million deaths. The need to develop new treatments for those infected with Mycobacterium tuberculosis ( Mtb) has led to many large-scale phenotypic screens and many thousands of new active compounds identified in vitro. However, with limited funding, efforts to discover new active molecules against Mtb needs to be more efficient. Several computational machine learning approaches have been shown to have good enrichment and hit rates. We have curated small molecule Mtb data and developed new models with a total of 18,886 molecules with activity cutoffs of 10 μM, 1 μM, and 100 nM. These data sets were used to evaluate different machine learning methods (including deep learning) and metrics and to generate predictions for additional molecules published in 2017. One Mtb model, a combined in vitro and in vivo data Bayesian model at a 100 nM activity yielded the following metrics for 5-fold cross validation: accuracy = 0.88, precision = 0.22, recall = 0.91, specificity = 0.88, kappa = 0.31, and MCC = 0.41. We have also curated an evaluation set ( n = 153 compounds) published in 2017, and when used to test our model, it showed the comparable statistics (accuracy = 0.83, precision = 0.27, recall = 1.00, specificity = 0.81, kappa = 0.36, and MCC = 0.47). We have also compared these models with additional machine learning algorithms showing Bayesian machine learning models constructed with literature Mtb data generated by different laboratories generally were equivalent to or outperformed deep neural networks with external test sets. Finally, we have also compared our training and test sets to show they were suitably diverse and different in order to represent useful evaluation sets. Such Mtb machine learning models could help prioritize compounds for testing in vitro and in vivo.

show abstract

Section: Introductionmentioning

confidence: 99%

Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery

et al. 2018

Self Cite

View full text Add to dashboard Cite

show abstract

“…A highly relevant issue that can strongly benefit from novel procedures standing on big data and classical ML methods is drug discovery for neglected diseases. Cheminformatics tools have been assembled into a web-based platform in the project More Medicines for Tuberculosis (MM4TB), funded by the European Union [ 104 ]. The project relies on classical ML methods ( Bayesian modelling , SVMs, random forest , and bootstrapping ), collaboratively working on data acquired from the screening of natural products and synthetic compounds against the microorganism Mycobacterium tuberculosis .…”

Section: Materials Discoverymentioning

confidence: 99%

Big data and machine learning for materials science

et al. 2021

View full text Add to dashboard Cite

Herein, we review aspects of leading-edge research and innovation in materials science that exploit big data and machine learning (ML), two computer science concepts that combine to yield computational intelligence. ML can accelerate the solution of intricate chemical problems and even solve problems that otherwise would not be tractable. However, the potential benefits of ML come at the cost of big data production; that is, the algorithms demand large volumes of data of various natures and from different sources, from material properties to sensor data. In the survey, we propose a roadmap for future developments with emphasis on computer-aided discovery of new materials and analysis of chemical sensing compounds, both prominent research fields for ML in the context of materials science. In addition to providing an overview of recent advances, we elaborate upon the conceptual and practical limitations of big data and ML applied to materials science, outlining processes, discussing pitfalls, and reviewing cases of success and failure.

show abstract

“…This technique is already proving beneficial to investigations into drug repurposing, a relatively low-risk strategy where small molecules that are known to have therapeutic benefits and acceptable safety profiles are investigated to see if they can be applied to other conditions by exploiting drug repurposing databases such as the NCGC Pharmaceutical Collection [30]. A number of drug repurposing studies have already been published to demonstrate the potential of machine learning for exploiting information to identify novel molecular disease targets [31] and repurpose existing medications for the treatment of a range of conditions including lupus [32], neurodegenerative disorders [33] and tuberculosis [34]. As it becomes easier to collate and analyse more data it is expected that this technology can also make significant in-roads into combatting orphan diseases, which although rare still affect up to 350 million people worldwide [35].…”

Section: Body Of the Textmentioning

confidence: 99%

“…[73], utilising not only standard methods and ideas, but also innovative concepts, approaches and algorithms, there are intrinsic problems that are related to how the research in this area is funded and how these efforts are rewarded. Whilst the pharmaceutical industry is engaged in data sharing and there are excellent examples of developing collaborative platforms for opensource drug discovery [34], their need to protect their intellectual property and not share all the data and software applications is understandable. However, due to the lack of appropriate funding, the efforts of the academic community often result in projects that are short-lived delivering sometimes ingenious software solutions that are frequently not finished and/or difficult to integrate into other relevant software platforms as the file formats and data storage are not standardized.…”

Section: Body Of the Textmentioning

confidence: 99%

The Benefits of In Silico Modeling to Identify Possible Small-Molecule Drugs and Their Off-Target Interactions

Zloh

Kirton

2018

Future Med. Chem.

View full text Add to dashboard Cite

The research into the use of small molecules as drugs continues to be a key driver in the development of molecular databases, computer-aided drug design software and collaborative platforms. The evolution of computational approaches is driven by the essential criteria that a drug molecule has to fulfill, from the affinity to targets to minimal side effects while having adequate absorption, distribution, metabolism, and excretion (ADME) properties. A combination of ligand- and structure-based drug development approaches is already used to obtain consensus predictions of small molecule activities and their off-target interactions. Further integration of these methods into easy-to-use workflows informed by systems biology could realize the full potential of available data in the drug discovery and reduce the attrition of drug candidates.

show abstract

Collaborative drug discovery for More Medicines for Tuberculosis (MM4TB)

Cited by 12 publications

References 101 publications

Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery

Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery

Big data and machine learning for materials science

The Benefits of In Silico Modeling to Identify Possible Small-Molecule Drugs and Their Off-Target Interactions

Contact Info

Product

Resources

About