2016
DOI: 10.3389/fmicb.2016.00269
|View full text |Cite
|
Sign up to set email alerts
|

GenSeed-HMM: A Tool for Progressive Assembly Using Profile HMMs as Seeds and its Application in Alpavirinae Viral Discovery from Metagenomic Data

Abstract: This work reports the development of GenSeed-HMM, a program that implements seed-driven progressive assembly, an approach to reconstruct specific sequences from unassembled data, starting from short nucleotide or protein seed sequences or profile Hidden Markov Models (HMM). The program can use any one of a number of sequence assemblers. Assembly is performed in multiple steps and relatively few reads are used in each cycle, consequently the program demands low computational resources. As a proof-of-concept and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
33
0
3

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 32 publications
(37 citation statements)
references
References 57 publications
(104 reference statements)
1
33
0
3
Order By: Relevance
“…Microviridae HMM-profiles (Alves et al, 2016) https://doi.org/10.17632/zb9swhpv3y.1 MVP database (Gao et al, 2018) http://mvp.medgenius.info…”
Section: Star+methods Key Resources Tablementioning
confidence: 99%
See 1 more Smart Citation
“…Microviridae HMM-profiles (Alves et al, 2016) https://doi.org/10.17632/zb9swhpv3y.1 MVP database (Gao et al, 2018) http://mvp.medgenius.info…”
Section: Star+methods Key Resources Tablementioning
confidence: 99%
“…Finally, the HMM-crAsslike profile was built using hmmbuild from the HMMER package v.3.1b2 (http://hmmer.org/) (Eddy, 1998). For the Microviridae case, all HMM-profiles for the viral protein 1 (VP1) developed by Alves et al (2016) were adopted.…”
Section: Hmm Annotationmentioning
confidence: 99%
“…Thus, it allows for an unbiased diagnostic analysis. There is a variety of tool able to address NGS-based pathogen related questions with different focuses: either aiming to discover yet unknown genomes [5][6][7][8][9][10][11][12][13][14][15][16][17][18][19][20][21][22] or to detect known species in a sample [23][24][25][26][27][28][29][30][31][32][33][34][35][36][37][38][39][40]. Among both groups, there are different underlying algorithms, the main distinction running between alignment-based [15-17, 19, 23, 25, 26, 28-31, 33, 35-37, 39, 40] and alignment-free methods [6,9,12,21,32,38].…”
Section: Introductionmentioning
confidence: 99%
“…Among both groups, there are different underlying algorithms, the main distinction running between alignment-based [15-17, 19, 23, 25, 26, 28-31, 33, 35-37, 39, 40] and alignment-free methods [6,9,12,21,32,38]. Many tools of course combine both approaches [5,7,8,10,11,13,14,18,20,22,27,34]. While being faster in most cases, alignment-free methods are limited to the detection of sequences, whereas alignment-based methods potentially allow for a more detailed characterization of genomes.…”
Section: Introductionmentioning
confidence: 99%
“…These can include GC bias, highly repetitive sequences, sheer scale or even financial limitation. Recently, Keinath and colleagues generated 20x short read Illumina coverage of similar to previous technologies designed to fill gaps within genome assemblies (such as Gapfiller and IMAGE) (Boetzer and Pirovano 2012;Tsai et al 2010), assemble flanking data (GenSeed, Tracembler) (Alves et al 2016;Dong et al 2007) and assemble exons (Lamichhaney et al 2012). A similar protocol has been independently proposed by Aluome and colleagues, although no pipeline was made available (Aluome et al 2016).…”
Section: Introductionmentioning
confidence: 99%