Perfluoroalkyl and polyfluoroalkyl substances (PFASs)
are a class
of chemicals widely used in industrial applications due to their exceptional
properties and stability. However, they do not readily degrade in
the environment and are linked to contamination and adverse health
effects in humans and wildlife. To find alternatives for the most
commonly used PFAS molecules that maintain their desirable chemical
properties but are not adverse to biological lifeforms, a novel approach
based upon machine learning is utilized. The machine learning model
is trained on an existing set of PFAS molecules to generate over 260,000
novel PFAS molecules, which we dub PFAS-AI-Gen. Using molecular descriptors
with known relationships to toxicity and industrial suitability followed
by molecular docking and molecular dynamics simulations, this set
of molecules is screened. In this manner, increasingly complex calculations
are performed only for candidate molecules that are most likely to
yield the desired properties of low binding affinity toward two selected
protein receptors, the human pregnane x receptor (hPXR) and peroxisome
proliferator-activated receptor γ (PPAR-γ), and high industrial
suitability, defined by critical micelle concentration (CMC). The
selection criteria of low binding affinity and high industrial suitability
are relative to the popular PFAS alternative GenX. hPXR and PPAR-γ
are selected as they are PFAS targets and facilitate a variety of
functions, such as drug metabolism and glucose regulation, respectively.
Through this approach, 22 promising new PFAS substitutes that may
warrant experimental investigation are identified. This integrated
approach of molecular screening and toxicity estimation may be applicable
to other chemical classes.
Machine learning and artificial intelligence are increasingly gaining in prominence through image analysis, language processing, and automation, to name a few applications. Machine learning is also making profound changes in chemistry. From revisiting decades-old analytical techniques for the purpose of creating better calibration curves, to assisting and accelerating traditional in silico simulations, to automating entire scientific workflows, to being used as an approach to deduce underlying physics of unexplained chemical phenomena, machine learning and artificial intelligence are reshaping chemistry, accelerating scientific discovery, and yielding new insights. This review provides an overview of machine learning and artificial intelligence from a chemist’s perspective and focuses on a number of examples of the use of these approaches in computational chemistry and in the laboratory.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.