BackgroundThe venom of predatory marine cone snails mainly contains a diverse array of unique bioactive peptides commonly referred to as conopeptides or conotoxins. These peptides have proven to be valuable pharmacological probes and potential drugs because of their high specificity and affinity to important ion channels, receptors and transporters of the nervous system. Most previous studies have focused specifically on the conopeptides from piscivorous and molluscivorous cone snails, but little attention has been devoted to the dominant vermivorous species.ResultsThe vermivorous Chinese tubular cone snail, Conus betulinus, is the dominant Conus species inhabiting the South China Sea. The transcriptomes of venom ducts and venom bulbs from a variety of specimens of this species were sequenced using both next-generation sequencing and traditional Sanger sequencing technologies, resulting in the identification of a total of 215 distinct conopeptides. Among these, 183 were novel conopeptides, including nine new superfamilies. It appeared that most of the identified conopeptides were synthesized in the venom duct, while a handful of conopeptides were identified only in the venom bulb and at very low levels.ConclusionsWe identified 215 unique putative conopeptide transcripts from the combination of five transcriptomes and one EST sequencing dataset. Variation in conopeptides from different specimens of C. betulinus was observed, which suggested the presence of intraspecific variability in toxin production at the genetic level. These novel conopeptides provide a potentially fertile resource for the development of new pharmaceuticals, and a pathway for the discovery of new conotoxins.Electronic supplementary materialThe online version of this article (doi:10.1186/s13742-016-0122-9) contains supplementary material, which is available to authorized users.
Marine drugs have developed rapidly in recent decades. Cone snails, a group of more than 700 species, have always been one of the focuses for new drug discovery. These venomous snails capture prey using a diverse array of unique bioactive neurotoxins, usually named as conotoxins or conopeptides. These conotoxins have proven to be valuable pharmacological probes and potential drugs due to their high specificity and affinity to ion channels, receptors, and transporters in the nervous systems of target prey and humans. Several research groups, including ours, have examined the venom gland of cone snails using a combination of transcriptomic and proteomic sequencing, and revealed the existence of hundreds of conotoxin transcripts and thousands of conopeptides in each Conus species. Over 2000 nucleotide and 8000 peptide sequences of conotoxins have been published, and the number is still increasing quickly. However, more than 98% of these sequences still lack 3D structural and functional information. With the rapid development of genomics and bioinformatics in recent years, functional predictions and investigations on conotoxins are making great progress in promoting the discovery of novel drugs. For example, ω-MVIIA was approved by the U.S. Food and Drug Administration in 2004 to treat chronic pain, and nine more conotoxins are at various stages of preclinical or clinical evaluation. In short, the genus Conus, the big family of cone snails, has become an important genetic resource for conotoxin identification and drug development.
Although there are various Conus species with publicly available transcriptome and proteome data, no genome assembly has been reported yet. Here, using Chinese tubular cone snail (C. betulinus) as a representative, we sequenced and assembled the first Conus genome with original identification of 133 genome-widely distributed conopeptide genes. After integration of our genomics, transcriptomics, and peptidomics data in the same species, we established a primary genetic central dogma of diverse conopeptides, assuming a rough number ratio of ~1:1:1:10s for the total genes: transcripts: proteins: post-translationally modified peptides. This ratio may be special for this worm-hunting Conus species, due to the high diversity of various Conus genomes and the big number ranges of conopeptide genes, transcripts, and peptides in previous reports of diverse Conus species. Only a fraction (45.9%) of the identified conotopeptide genes from our achieved genome assembly are transcribed with transcriptomic evidence, and few genes individually correspond to multiple transcripts possibly due to intraspecies or mutation-based variances. Variable peptide processing at the proteomic level, generating a big diversity of venom conopeptides with alternative cleavage sites, post-translational modifications, and N-/C-terminal truncations, may explain how the 133 genes and ~123 transcripts can generate thousands of conopeptides in the venom of individual C. betulinus. We also predicted many conopeptides with high stereostructural similarities to the putative analgesic ω-MVIIA, addiction therapy AuIB and insecticide ImI, suggesting that our current genome assembly for C. betulinus is a valuable genetic resource for high-throughput prediction and development of potential pharmaceuticals.
Understanding and describing three-dimensional (3D) protein structures have dominated biological and biochemistry research for many years. A comprehensive description of protein folding structure is essential for the advancement of protein research. In this study, a novel description method is developed to generate a set of folding patterns with specific shape features, as well as vector characteristics in space. To accomplish the goal, this method embeds features from geometry, morphology and topology together into an algorithmic approach to achieve a full description for proteins. A set of 27 vectors is derived mathematically from an enclosed space, and each vector represents a 3D folding shape of five successive C(alpha) atoms in the protein backbone. The 27 vectors are represented by 27 symbols, which are called as the protein folding shape code (PFSC). The PFSC method offers a digital description of folding shapes along a protein backbone, which facilitates protein structure analysis. The PFSC method provides a tool to study the similarity and dissimilarity for protein or protein conformers. The PFSC results show overall agreement with structural assignments from the protein data bank, as well as results from other methods. All results show that the PFSC method is a reliable tool with explicit meaning for protein folding shape description.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.