Please cite this article as: Prashanth, J.R., Lewis, R.J., An efficient transcriptome analysis pipeline to accelerate venom peptide discovery and characterisation, Toxicon (2015), doi: 10.1016/ j.toxicon.2015.09.012. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
M A N U S C R I P T A C C E P T E D ACCEPTED MANUSCRIPT
AbstractTranscriptome sequencing is now widely adopted as an efficient means to study the chemical diversity of venoms. To improve the efficiency of analysis of these large datasets, we have optimised an analysis pipeline for cone snail venom gland transcriptomes. The pipeline combines ConoSorter with sequence architecture-based elimination and similarity searching using BLAST to improve the accuracy of sequence identification and classification, while reducing requirements for manual intervention. As a proof-of-concept, we used this approach reanalysed three previously published cone snail transcriptomes from diverse dietary groups. Our pipeline method generated similar results to the published studies with significantly less manual intervention. We additionally found undiscovered sequences in the piscovorous C. geographus and vermivorous C. miles and identified sequences in incorrect superfamilies in the molluscivorus C. marmoreus and C. geographus transcriptomes. Our results indicate that this method can improve toxin detection without extending analysis time. While this method was evaluated on cone snail transcriptomes it can be easily optimised to retrieve toxins from other venomous animals.