2015
DOI: 10.1186/1471-2164-16-s10-s11
|View full text |Cite
|
Sign up to set email alerts
|

Ancestral gene synteny reconstruction improves extant species scaffolding

Abstract: We exploit the methodological similarity between ancestral genome reconstruction and extant genome scaffolding. We present a method, called ARt-DeCo that constructs neighborhood relationships between genes or contigs, in both ancestral and extant genomes, in a phylogenetic context. It is able to handle dozens of complete genomes, including genes with complex histories, by using gene phylogenies reconciled with a species tree, that is, annotated with speciation, duplication and loss events. Reconstructed ancest… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
4
3
1

Relationship

3
5

Authors

Journals

citations
Cited by 17 publications
(8 citation statements)
references
References 40 publications
0
8
0
Order By: Relevance
“…The synteny-based adjacencies were used to define well-supported consensus sets, which were then validated with and complemented by physical mapping and/or RNAseq and/or re-sequencing data for 14 assemblies. This followed a reconciliation workflow to integrate the different sets of scaffold adjacencies from synteny, physical mapping, RNAseq, or alignment data for each assembly (see the “Methods” section; Additional file 1: Figure S1) [2950]. Applying this integrative approach produced updated reference assemblies with increased scaffold N50 values (a median-like metric where half the genome is assembled into scaffolds of length N50 or longer) and reduced scaffold counts (Table 1).…”
Section: Resultsmentioning
confidence: 99%
“…The synteny-based adjacencies were used to define well-supported consensus sets, which were then validated with and complemented by physical mapping and/or RNAseq and/or re-sequencing data for 14 assemblies. This followed a reconciliation workflow to integrate the different sets of scaffold adjacencies from synteny, physical mapping, RNAseq, or alignment data for each assembly (see the “Methods” section; Additional file 1: Figure S1) [2950]. Applying this integrative approach produced updated reference assemblies with increased scaffold N50 values (a median-like metric where half the genome is assembled into scaffolds of length N50 or longer) and reduced scaffold counts (Table 1).…”
Section: Resultsmentioning
confidence: 99%
“…ADSEQ builds upon a family of methods aimed at reconstructing the evolutionary history of gene adjacencies introduced with the DECO algorithm [ 50 ] and extended along several lines in [ 51 , 52 ]. It is implemented within the package DECOSTAR [ 53 ].…”
Section: Methodsmentioning
confidence: 99%
“…In the present work, the prior scores of extant adjacencies are either 1.0 for adjacencies that are observed in a contig or scaffold, or a scaffolding score obtained from sequencing data using the scaffolding software BESST . Pairs of genes located at the extremities of contigs and for which sequencing data do not provide any evidence for a scaffolding adjacency receive a small prior score as described in [ 52 ] (see also Additional file 1 ). The posterior scores are defined as the frequency out of a sample of 100 solutions with temperatures kT = kT 0 =0.1 that skews the Gibbs-Boltzmann distribution toward optimal solutions.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…These methods are based on jumping libraries [2][3][4][5][6][7][8], long error-prone reads (such as PacBio or MinION reads) [9][10][11][12][13], homology relationship between multiple genomes [14][15][16], wet-lab experiments such as the fluorescence in situ hybridization (FISH) [17,18], genome maps [19][20][21], higher order chromatin interactions [22], and so on. Depending on the nature and accuracy of utilized information and techniques, assemblies produced by different methods may still be incomplete and contain errors, thus deviating from each other.…”
Section: Introductionmentioning
confidence: 99%