Multigene and genomic data sets have become commonplace in the field of phylogenetics, but many existing tools are not designed for such data sets, which often makes the analysis time‐consuming and tedious. Here, we present PhyloSuite, a (cross‐platform, open‐source, stand‐alone Python graphical user interface) user‐friendly workflow desktop platform dedicated to streamlining molecular sequence data management and evolutionary phylogenetics studies. It uses a plugin‐based system that integrates several phylogenetic and bioinformatic tools, thereby streamlining the entire procedure, from data acquisition to phylogenetic tree annotation (in combination with iTOL). It has the following features: (a) point‐and‐click and drag‐and‐drop graphical user interface; (b) a workplace to manage and organize molecular sequence data and results of analyses; (c) GenBank entry extraction and comparative statistics; and (d) a phylogenetic workflow with batch processing capability, comprising sequence alignment (mafft and macse), alignment optimization (trimAl, HmmCleaner and Gblocks), data set concatenation, best partitioning scheme and best evolutionary model selection (PartitionFinder and modelfinder), and phylogenetic inference (MrBayes and iq‐tree). PhyloSuite is designed for both beginners and experienced researchers, allowing the former to quick‐start their way into phylogenetic analysis, and the latter to conduct, store and manage their work in a streamlined way, and spend more time investigating scientific questions instead of wasting it on transferring files from one software program to another.
18Multi-gene and genomic datasets have become commonplace in the field of 19 phylogenetics, but many of the existing tools are not designed for such datasets, 20 which makes the analysis time-consuming and tedious. We therefore present 21 PhyloSuite, a user-friendly workflow desktop platform dedicated to streamlining 22 molecular sequence data management and evolutionary phylogenetics studies. It 23 employs a plugin-based system that integrates a number of useful phylogenetic and 24 bioinformatic tools, thereby streamlining the entire procedure, from data acquisition 25 to phylogenetic tree annotation, with the following features: (i) point-and-click and 26 drag-and-drop graphical user interface, (ii) a workspace to manage and organize 27 molecular sequence data and results of analyses, (iii) GenBank entries extraction and 28 comparative statistics, (iv) a phylogenetic workflow with batch processing capability, 29(v) elaborate bioinformatic analysis for mitochondrial genomes. The aim of 30 PhyloSuite is to enable researchers to spend more time playing with scientific 31 questions, instead of wasting it on conducting standard analyses. The compiled binary 32 of PhyloSuite is available under the GPL license at 33 https://github.com/dongzhang0725/PhyloSuite/releases, implemented in Python and 34 runs on Windows, Mac OSX and Linux. 35 36 37 Advancements in next-generation sequencing technologies (Metzker, 2009) have 38 resulted in a huge increase in the amount of genetic data available through public 39 databases. While this opens a multitude of research possibilities, retrieving and 40 managing such large amounts of data may be difficult and time-consuming for 41 researchers who are not computer-savvy. A standard analytical procedure for 42 phylogenetic analysis is: selecting and downloading GenBank entries, extracting 43 target genes (for multi-gene datasets, such as organelle genomes) and/or mining other 44 data, sequence alignment, alignment optimization, concatenation of alignments (for 45 multi-gene datasets), selection of best-fit partitioning schemes and evolutionary 46 models, phylogeny reconstruction, and finally visualization and annotation of the 47 phylogram. This can be very time-consuming if different programs have to be 48 employed for different steps, especially as they often have different input file format 49 requirements, and sometimes even require manual file tweaking. Therefore, 50 multifunctional, workflow-type software packages are becoming increasingly needed 51 by a broad range of evolutionary biologists (Smith, 2015). Specifically, as single-gene 52 datasets are rapidly being replaced by multi-gene or genomic datasets as a tool of 53 choice for phylogenetic reconstruction (Degnan and Rosenberg, 2009; Rivera-Rivera 54 and Montoya-Burgos, 2016), automated gene extraction from genomic data and batch 55 manipulation in some of the above steps, like alignment, are becoming a necessity. 56 Although there are several tools in existence, designed to streamline this process 57 by incorporat...
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.