2021
DOI: 10.1101/2021.11.11.468134
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Assembly of complete diploid phased chromosomes from draft genome sequences

Abstract: De novo genome assembly is essential for genomic research. High-quality genomes assembled into phased pseudomolecules are challenging to produce and often contain assembly errors caused by repeats, heterozygosity, or the chosen assembly strategy. Although algorithms exist that produce partially phased assemblies, haploid draft assemblies that may lack biological information remain favored because they are easier to generate and use. We developed HaploSync, a suite of tools that produces fully phased, chromosom… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 80 publications
(126 reference statements)
0
6
0
Order By: Relevance
“…Several approaches have been used to try and correct haplotype switching and produce an accurate fully phased genome. Prior to HiFi reads, haplotype phasing was often highly involved and relied on single nucleotide polymorphism (SNP) data or germ cell sequencing (Minio et al, 2017;Shi et al, 2019;Campoy et al, 2020;Minio et al, 2022). Falcon and Falcon-Unzip assemblers corrected the high error rate of PacBio Continuous Long Read Sequencing (CLR) and used differences in SNPs to partition haplotypes (Chin et al, 2016).…”
Section: Discussionmentioning
confidence: 99%
“…Several approaches have been used to try and correct haplotype switching and produce an accurate fully phased genome. Prior to HiFi reads, haplotype phasing was often highly involved and relied on single nucleotide polymorphism (SNP) data or germ cell sequencing (Minio et al, 2017;Shi et al, 2019;Campoy et al, 2020;Minio et al, 2022). Falcon and Falcon-Unzip assemblers corrected the high error rate of PacBio Continuous Long Read Sequencing (CLR) and used differences in SNPs to partition haplotypes (Chin et al, 2016).…”
Section: Discussionmentioning
confidence: 99%
“…Trio binning approach differs from the applied for assemblies of other grapevine cultivars generated as pseudo-haplotypes, which exhibit inflated primary assemblies compared to the haplotigs 6,9,1114,42 . Nonetheless, recently developed bioinformatic tools help to improve the proper assignment of primary contigs to the two haplophases in those cases 16 .…”
Section: Discussionmentioning
confidence: 99%
“…In addition to the PN40024 reference genomes, several additional cultivars have seen their genomes assembled from Pacific Biosciences (PacBio) long reads using the haplotype-aware assemblers FALCON-unzip 6,9,1114 . FALCON-unzip outputs pseudo-haplotypes, which usually represent a single allele at each position without preserving phase information across the genome 15 , although tools such as HaploSync have been developed to completely phase this type of assemblies 16 . In this context, the combination of multiple sequencing technologies such as Oxford Nanopore Technologies (ONT) and short reads 21 , the use of PacBio HiFi and newer algorithms such as HiFiasm 17 , as well as information from parent-offspring trios 15 have notably improved assembly phasing and quality of grapevine genomes 4,18 19 .…”
Section: Introductionmentioning
confidence: 99%
“…BRAKER was used in default mode for RNASeq data, with an additional option for softmasked reference assemblies (--softmasking). Finally, we excluded (as possible pseudogenes) any genes with exons that overlapped annotated repeats and filtered for specific criterion related to gene integrity (Minio et al ., 2021).…”
Section: Methodsmentioning
confidence: 99%