Quantitative Structure-Activity Relationship modeling is one of the major computational tools employed in medicinal chemistry. However, throughout its entire history it has drawn both praise and criticism concerning its reliability, limitations, successes, and failures. In this paper, we discuss: (i) the development and evolution of QSAR; (ii) the current trends, unsolved problems, and pressing challenges; and (iii) several novel and emerging applications of QSAR modeling. Throughout this discussion, we provide guidelines for QSAR development, validation, and application, which are summarized in best practices for building rigorously validated and externally predictive QSAR models. We hope that this Perspective will help communications between computational and experimental chemists towards collaborative development and use of QSAR models. We also believe that the guidelines presented here will help journal editors and reviewers apply more stringent scientific standards to manuscripts reporting new QSAR studies, as well as encourage the use of high quality, validated QSARs for regulatory decision making.
Despite an abundance of online databases providing access to chemical data, there is increasing demand for high-quality, structure-curated, open data to meet the various needs of the environmental sciences and computational toxicology communities. The U.S. Environmental Protection Agency’s (EPA) web-based CompTox Chemistry Dashboard is addressing these needs by integrating diverse types of relevant domain data through a cheminformatics layer, built upon a database of curated substances linked to chemical structures. These data include physicochemical, environmental fate and transport, exposure, usage, in vivo toxicity, and in vitro bioassay data, surfaced through an integration hub with link-outs to additional EPA data and public domain online resources. Batch searching allows for direct chemical identifier (ID) mapping and downloading of multiple data streams in several different formats. This facilitates fast access to available structure, property, toxicity, and bioassay data for collections of chemicals (hundreds to thousands at a time). Advanced search capabilities are available to support, for example, non-targeted analysis and identification of chemicals using mass spectrometry. The contents of the chemistry database, presently containing ~ 760,000 substances, are available as public domain data for download. The chemistry content underpinning the Dashboard has been aggregated over the past 15 years by both manual and auto-curation techniques within EPA’s DSSTox project. DSSTox chemical content is subject to strict quality controls to enforce consistency among chemical substance-structure identifiers, as well as list curation review to ensure accurate linkages of DSSTox substances to chemical lists and associated data. The Dashboard, publicly launched in April 2016, has expanded considerably in content and user traffic over the past year. It is continuously evolving with the growth of DSSTox into high-interest or data-rich domains of interest to EPA, such as chemicals on the Toxic Substances Control Act listing, while providing the user community with a flexible and dynamic web-based platform for integration, processing, visualization and delivery of data and resources. The Dashboard provides support for a broad array of research and regulatory programs across the worldwide community of toxicologists and environmental scientists. Electronic supplementary materialThe online version of this article (10.1186/s13321-017-0247-6) contains supplementary material, which is available to authorized users.
The U.S. Environmental Protection Agency (EPA) is developing methods for utilizing computational chemistry, high-throughput screening (HTS), and various toxicogenomic technologies to predict potential for toxicity and prioritize limited testing resources toward chemicals that likely represent the greatest hazard to human health and the environment. This chemical prioritization research program, entitled "ToxCast," is being initiated with the purpose of developing the ability to forecast toxicity based on bioactivity profiling. The proof-of-concept phase of ToxCast will focus upon chemicals with an existing, rich toxicological database in order to provide an interpretive context for the ToxCast data. This set of several hundred reference chemicals will represent numerous structural classes and phenotypic outcomes, including tumorigens, developmental and reproductive toxicants, neurotoxicants, and immunotoxicants. The ToxCast program will evaluate chemical properties and bioactivity profiles across a broad spectrum of data domains: physical-chemical, predicted biological activities based on existing structure-activity models, biochemical properties based on HTS assays, cell-based phenotypic assays, and genomic and metabolomic analyses of cells. These data will be generated through a series of external contracts, along with collaborations across EPA, with the National Toxicology Program, and with the National Institutes of Health Chemical Genomics Center. The resulting multidimensional data set provides an informatics challenge requiring appropriate computational methods for integrating various chemical, biological, and toxicological data into profiles and models predicting toxicity.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.