Understanding historical chemical usage is crucial for
assessing
current and past impacts on human health and the environment and for
informing future regulatory decisions. However, past monitoring data
are often limited in scope and number of chemicals, while suitable
sample types are not always available for remeasurement. Data-driven
cheminformatics methods for patent and literature data offer several
opportunities to fill this gap. The
chemical stripes
were developed as an interactive, open source tool for visualizing
patent and literature trends over time, inspired by the global warming
and biodiversity stripes. This paper details the underlying code and
data sets behind the visualization, with a major focus on the patent
data sourced from PubChem, including patent origins, uses, and countries.
Overall trends and specific examples are investigated in greater detail
to explore both the promise and caveats that such data offer in assessing
the trends and patterns of chemical patents over time and across different
geographic regions. Despite a number of potential artifacts associated
with patent data extraction, the integration of cheminformatics, statistical
analysis, and data visualization tools can help generate valuable
insights that can both illuminate the chemical past and potentially
serve toward an early warning system for the future.