A new strategy for the identification of known compounds in Streptomyces extracts that can be applied in the discovery of natural products is presented. The strategy incorporates screening a database of 5,553 natural products including 5,102 structures from Streptomyces sp. alone, using a high throughput LCMS data processing algorithm that utilises HRMS data and predicted LC retention times (tR) as filters for rapid identification of compounds in the natural product extract. The database named StrepDB contains for each compound, the structure, molecular formula, molecular mass, and LC predicted retention time. All identified compounds are annotated and color coded for easier visualization. It is an indirect approach to quickly assess masses (which are not annotated) that may potentially lead to the discovery of new or novel structures. In addition, a spectral database named MbcDB was generated using ACD/Spectrus DB Platform. MbcDB contains 665 natural products, each with structure, experimental HRESIMS, MS/MS, UV, and NMR spectra. StrepDB was used to screen a mutant Streptomyces albus extract that led to the identification and isolation of two new compounds: legonmaleimides A and B, the structures of which were elucidated with the aid of MbcDB and spectroscopic techniques. The structures were confirmed by computer assisted structure elucidation (CASE) methods using ACD/Structure Elucidator Suite. The developed methodology suggests a pipeline approach to the dereplication of extracts and discovery of novel natural products.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.