2022
DOI: 10.1101/2022.12.01.518658
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Assembly of 43 diverse human Y chromosomes reveals extensive complexity and variation

Abstract: The prevalence of highly repetitive sequences within the human Y chromosome has led to its incomplete assembly and systematic omission from genomic analyses. Here, we present long-read de novo assemblies of 43 diverse Y-chromosomes, three contiguously assembled including two from deep-rooted African Y lineages. Examination of the full extent of genetic variation between Y chromosomes across 180,000 years of human evolution reveals its remarkable complexity and diversity in size and structure, in contrast with … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
34
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
3

Relationship

3
5

Authors

Journals

citations
Cited by 22 publications
(35 citation statements)
references
References 129 publications
1
34
0
Order By: Relevance
“…In contrast, our T2T-Y assembly resolved 46 protein-coding TSPY copies, including TSPY2, which was found in the distal part of the IR3, downstream of the TSPY array (at ~10 Mb). The distal positioning of TSPY2 in HG002 was confirmed among all other Y haplogroups except R and Q, which match the proximal positioning of GRCh38-Y 33 . All 45 protein-coding copies in the TSPY array were embedded in an array of composite repeat units, with one composite unit (~20.2 kb in size) per gene, such that an array of composite units includes multiple TSPY gene copies in tandem (Fig.…”
Section: Structure Of the Tspy Ampliconic Gene Familysupporting
confidence: 62%
See 2 more Smart Citations
“…In contrast, our T2T-Y assembly resolved 46 protein-coding TSPY copies, including TSPY2, which was found in the distal part of the IR3, downstream of the TSPY array (at ~10 Mb). The distal positioning of TSPY2 in HG002 was confirmed among all other Y haplogroups except R and Q, which match the proximal positioning of GRCh38-Y 33 . All 45 protein-coding copies in the TSPY array were embedded in an array of composite repeat units, with one composite unit (~20.2 kb in size) per gene, such that an array of composite units includes multiple TSPY gene copies in tandem (Fig.…”
Section: Structure Of the Tspy Ampliconic Gene Familysupporting
confidence: 62%
“…Because the validation signal at the three HSat positions was ambiguous, these regions were noted but left unchanged. The P5 inversion error was discovered only after the T2T-Y assembly had been fully annotated and released, and because this inversion appears as a true recurrent inversion in other individuals 33 , it was noted but left uncorrected in this release. The described T2T-Y assembly is 62,460,029 bases in length with no gaps or model sequences and an estimated error rate of less than 1 error per 10 Mb (Phred Q73.8), as measured by Merqury using a hybrid k-mer set from Illumina and HiFi reads 14,15 (Table 1, Supplementary Table 3).…”
Section: Assembly Validation and Annotation Of T2t-ymentioning
confidence: 89%
See 1 more Smart Citation
“…On the Y chromosome, deletions within the ampliconic regions have been previously linked to infertility 109,110 . Additional intraspecific studies, comparing the complete sex chromosomes of multiple individuals within each species (as was recently done for humans 111 ), are now needed to reveal the full landscape of ape sex chromosome evolution and function.…”
Section: Discussionmentioning
confidence: 99%
“…However, the Y chromosome has long been a thorn in the side of human geneticists: More than half of the Y chromosome is absent from GRCh38 76 . Two recent papers used combinations of multiple long-read next-generation sequencing technologies to generate much more complete sequence of the Y chromosome, and they also described a high degree of heterogeneity in chromosome length and content between individuals 76,77 .…”
Section: The Y Chromosomementioning
confidence: 99%