The Angiosperms353 gene set (AGS) consists of a set of 353 universal low-copy nuclear genes that were selected by examining more than 600 angiosperm species. These genes can be used for phylogenetic studies and population genetics at multiple taxonomic scales. However, current pipelines are not able to recover Angiosperms353 genes efficiently and accurately from high-throughput sequences. Here, we developed Easy353, a reference-guided assembly tool to recover the AGS from high-throughput sequencing (HTS) data (including genome skimming, RNA-seq, and target enrichment). Easy353 is an open-source user-friendly assembler for diverse types of high-throughput data. It has a graphical user interface and a command-line interface that is compatible with all widely-used computer systems. Evaluations based on both simulated and empirical data suggest that Easy353 yields low rates of assembly errors.
The advancement of Next-generation Sequencing (NGS) technologies has led to a revolution in the field of evolutionary biology. With the increasing number of available genomes and transcriptomes, researchers can mine various types of molecular markers that are vital for phylogenetic, evolutionary, and ecological studies. Numerous tools have been developed to extract these molecular markers from NGS data. However, due to a limited number of well-annotated reference genomes for non-model organisms, it is still challenging to obtain these markers accurately and efficiently. Here, we present GeneMiner, an improved and expanded version of our previous tool, Easy353. GeneMiner involves the reference-guided de Bruijn Graph assembly with seed self-discovery and greedy extension. Additionally, it includes a verification step using a parameter-bootstrap method to reduce the negative impact on assembly caused by the relatively distant reference. Our results using both experimental and simulation data showed GeneMiner can accurately acquire phylogenetic molecular markers for plants using transcriptomic, genomic, and other NGS data. GeneMiner is designed to be user-friendly, fast, and memory-efficient, and is compatible with Linux, Windows, and macOS. All source codes are publicly available on GitHub for easy accessibility and transparency (https://github.com/happywithxpl/GeneMiner).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.