Vaccination is generally considered to be the most effective method of preventing infectious diseases. All vaccinations work by presenting a foreign antigen to the immune system in order to evoke an immune response. The active agent of a vaccine may be intact but inactivated (‘attenuated’) forms of the causative pathogens (bacteria or viruses), or purified components of the pathogen that have been found to be highly immunogenic. The increased understanding of antigen recognition at molecular level has resulted in the development of rationally designed peptide vaccines. The concept of peptide vaccines is based on identification and chemical synthesis of B-cell and T-cell epitopes which are immunodominant and can induce specific immune responses. The accelerating growth of bioinformatics techniques and applications along with the substantial amount of experimental data has given rise to a new field, called immunoinformatics. Immunoinformatics is a branch of bioinformatics dealing with in silico analysis and modelling of immunological data and problems. Different sequence- and structure-based immunoinformatics methods are reviewed in the paper.
With this application note we aim to offer the community a production-ready tool for de novo design. It can be effectively applied on drug discovery projects that are striving to resolve either exploration or exploitation problems while navigating the chemical space. By releasing the code we are aiming to facilitate the research on using generative methods on drug discovery problems and to promote the collaborative efforts in this area so that it can be used as an interaction point for future scientific collaborations. File list (2) download file view on ChemRxiv REINVENT 2.0-an AI tool for de novo drug design.pdf (409.34 KiB) download file view on ChemRxiv REINVENT 2.0-an AI tool for de novo drug design sup... (846.46 KiB)
Molecular generative models trained with small sets of molecules represented as SMILES strings can generate large regions of the chemical space. Unfortunately, due to the sequential nature of SMILES strings, these models are not able to generate molecules given a scaffold (i.e., partially-built molecules with explicit attachment points). Herein we report a new SMILES-based molecular generative architecture that generates molecules from scaffolds and can be trained from any arbitrary molecular set. This approach is possible thanks to a new molecular set pre-processing algorithm that exhaustively slices all possible combinations of acyclic bonds of every molecule, combinatorically obtaining a large number of scaffolds with their respective decorations. Moreover, it serves as a data augmentation technique and can be readily coupled with randomized SMILES to obtain even better results with small sets. Two examples showcasing the potential of the architecture in medicinal and synthetic chemistry are described: First, models were trained with a training set obtained from a small set of Dopamine Receptor D2 (DRD2) active modulators and were able to meaningfully decorate a wide range of scaffolds and obtain molecular series predicted active on DRD2. Second, a larger set of drug-like molecules from ChEMBL was selectively sliced using synthetic chemistry constraints (RECAP rules). In this case, the resulting scaffolds with decorations were filtered only to allow those that included fragment-like decorations. This filtering process allowed models trained with this dataset to selectively decorate diverse scaffolds with fragments that were generally predicted to be synthesizable and attachable to the scaffold using known synthetic approaches. In both cases, the models were already able to decorate molecules using specific knowledge without the need to add it with other techniques, such as reinforcement learning. We envision that this architecture will become a useful addition to the already existent architectures for de novo molecular generation.
With this application note we aim to offer the community a production-ready tool for de novo design. It can be effectively applied on drug discovery projects that are striving to resolve either exploration or exploitation problems while navigating the chemical space. By releasing the code we are aiming to facilitate the research on using generative methods on drug discovery problems and to promote the collaborative efforts in this area so that it can be used as an interaction point for future scientific collaborations.
Due to the strong relationship between desired molecular activity to its structural core, screening of focused, core sharing chemical libraries is a key step in lead optimisation. Despite the plethora of current research focused on in silico methods for molecule generation, to our knowledge, no tool capable of designing such libraries has been proposed. In this work, we present a novel tool for de novo drug design called Lib-INVENT. This is capable of rapidly proposing chemical libraries of compounds sharing the same core while maximising a range of desirable properties. To further help the process of designing focused libraries, the user can list specific chemical reactions that can be used for the library creation. Lib-INVENT is therefore a flexible tool for generating virtual chemical libraries for lead optimisation in a broad range of scenarios. Additionally, the shared core ensures that the compounds in the library are similar, possessing desirable properties and can be also synthesized under the same or similar conditions. File list (2) download file view on ChemRxiv Lib-INVENT.pdf (1.42 MiB) download file view on ChemRxiv Supporting information.pdf (265.77 KiB)
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.