Background: To preliminarily elucidate the possible molecular basis of scorpion efficacy through the methods of transcriptomic-proteomic-peptidomics.
Methods: The transcriptome data of scorpion were obtained by high-throughput sequencing and analyzed by bioinformatics to establish the protein prediction database of scorpion. Furthermore, the proteomics and peptidomics of scorpion were analyzed by LC-MS analysis, and the GO annotation and KEGG pathway function enrichment analysis were performed for the identified proteins to explore the material basis of the efficacy of scorpion.
Results: A total of 17605 gene sequences were obtained by transcriptome sequencing, and 12368 proteins could be predicted. 278 transcription factors and 51 family information of transcription factors were obtained through database comparison. From GO and KEGG annotation results, it was found that scorpion had functions such as regulating cell cycle, amino acid metabolism and signal transduction, which may be closely related to the analgesic and anti-cancer functions of scorpion. In the process of proteome identification, 10,072 peptides were found, and 1917 protein components were deduced from these peptides. For the identification of peptides, 6758 peptides were matched in the transcriptome database and 33,834 peptides were identified by de novo sequencing. The results showed that the scorpion proteins were mainly enriched in protein translation, tricarboxylic acid cycle and glycolysis, and the KEGG pathway was mainly enriched in oxidative phosphorylation, amino acid metabolism and ribosome.
Conclusions: A total of 33834 peptides and 1917 proteins were identified from scorpion by integrated transcriptomic-proteomics-peptidomics association analysis. GO annotation and KEGG pathway functional enrichment analysis showed that the macromolecular components of scorpion were mainly related to oxidative phosphorylation, amino acid metabolism and immune regulation.