2018
DOI: 10.1038/s41588-018-0273-y
|View full text |Cite|
|
Sign up to set email alerts
|

Assembly of a pan-genome from deep sequencing of 910 humans of African descent

Abstract: We used a deeply sequenced dataset of 910 individuals, all of African descent, to construct a set of DNA sequences present in these individuals but missing from the reference human genome. We aligned 1.19 trillion reads from the 910 individuals to the reference genome (GRCh38), collected all reads that failed to align, and assembled these reads into contiguous sequences (contigs). We then compared all contigs to one another to identify a set of unique sequences representing regions of the African pan-genome mi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

10
309
2

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
5

Relationship

1
9

Authors

Journals

citations
Cited by 302 publications
(341 citation statements)
references
References 36 publications
10
309
2
Order By: Relevance
“…Since these insertions are present and homozygous in all 100 genotyped individuals, the reference sequence reflects either rare deletion or errors in GRCh38 36 . Specifically, the 1,638 bp exonic insertion in UBE2QL1 was also reported at high frequency in two previous studies 37,38 .…”
Section: Genotyping In Tandem Repeatssupporting
confidence: 51%
“…Since these insertions are present and homozygous in all 100 genotyped individuals, the reference sequence reflects either rare deletion or errors in GRCh38 36 . Specifically, the 1,638 bp exonic insertion in UBE2QL1 was also reported at high frequency in two previous studies 37,38 .…”
Section: Genotyping In Tandem Repeatssupporting
confidence: 51%
“…In spite of the growing number of reports of pan-genomes in eukaryotes and the recent extension of such investigations to human populations (32), so far the impact of PAV on the intraspecific diversity of animals has been presumed to be minimal, and sometimes linked to the development of pathologies (38). To the best of our knowledge, PAV has been only marginally explored in bivalves as a phenomenon linked to gene families involved in immune functions, i.e., big defensins in Crassostrea gigas (39) and myticalins in M. galloprovincialis (40).…”
Section: The Mussel Pan-genomementioning
confidence: 99%
“…Hundreds of thousands of human genomes have been sequenced worldwide and ongoing consortiums and individual projects are sequencing many more 1,2 . The exponential growth of human whole-genome sequencing (WGS) data have already allowed the creation of population or cohort-specific human sequences to catalogue the genomic variation across different populations [3][4][5] . One of the main aims of pan-genome studies is to uncover novel non-reference sequences (NRSs) or variants not represented in the human reference genome (GRCh38).…”
Section: Introductionmentioning
confidence: 99%