2021
DOI: 10.2139/ssrn.3967671
|View full text |Cite
|
Sign up to set email alerts
|

High Coverage Whole Genome Sequencing of the Expanded 1000 Genomes Project Cohort Including 602 Trios

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
59
1

Year Published

2022
2022
2023
2023

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 37 publications
(60 citation statements)
references
References 46 publications
0
59
1
Order By: Relevance
“…This over-merging means that sequence differences between individuals are getting lost. Previous publications are indeed hinting towards a potential over-merge, but further investigations are needed as the number of studies investigating insertions, particularly those using long-reads, are small but rising [26][27][28][29][30]. Related to this is the overall number of SVs that one might assume in healthy human genomes (Figure 3a, Supplementary Figure 5).…”
Section: Discussionmentioning
confidence: 97%
See 1 more Smart Citation
“…This over-merging means that sequence differences between individuals are getting lost. Previous publications are indeed hinting towards a potential over-merge, but further investigations are needed as the number of studies investigating insertions, particularly those using long-reads, are small but rising [26][27][28][29][30]. Related to this is the overall number of SVs that one might assume in healthy human genomes (Figure 3a, Supplementary Figure 5).…”
Section: Discussionmentioning
confidence: 97%
“…Truvari anno trf incorporates a wrapper around tandem-repeat finder (TRF) [36]. We ran Truvari anno trf to annotate all SVs on GRCh38 that intersected the SimpleRepeats track procured from UCSC Table Browser [29]. Each intersecting variant is used to alter the SimpleRepeat reference region to reconstruct the sample's haplotype.…”
Section: Tandem Repeat Experimentsmentioning
confidence: 99%
“…We accessed VCFs from the 1000 Genomes Project ( (accessed on 10 December 2021)) [ 56 ] for our natural human population sample ( Table S1 ). These data include 2504 individuals (i.e., 5008 COL1A1 allele copies) with no known bone abnormalities (i.e., a random sample with respect to phenotypic diversity), representing 26 geographically and ethnically diverse populations (“1000G”, hereafter).…”
Section: Methodsmentioning
confidence: 99%
“…We ran ImmunoTyper-SR on 12 publicly available high-coverage WGS of 1kGP samples, sequenced at ∼30X on the Illumina NovaSeq 6000 Platform [29]. These samples have had independent de novo IGH haplotyping performed by Rodriguez et al (unpublished) using a targeted long read sequencing and assembly protocol called IGenotyper [8] tailor-made for the IGH region.…”
Section: Validation Using 1kgp Samplesmentioning
confidence: 99%
“…We have validated ImmunoTyper-SR on 12 individuals with diverse genetic backgrounds from the 1000 Genomes Project [28] (1kGP), by comparing ImmunoTyper-SR genotype calls on Illumina WGS data from the NYGC [29] against targeted long read-based IGH assemblies generated using IGenotyper [8]. We then applied ImmunoTyper-SR to WGS data from a cohort of 585 individuals from the NIAID COVID Consortium (from here on the "NIAID cohort") to investigate associations between IGHV genotypes, type I IFN autoantibodies and COVID-19 disease severity.…”
Section: Introductionmentioning
confidence: 99%