The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
Biobanks and archived datasets collecting samples and data have become crucial engines of genetic and genomic research. Unresolved, however, is what responsibilities biobanks should shoulder to manage incidental findings (IFs) and individual research results (IRRs) of potential health, reproductive, or personal importance to individual contributors (using “biobank” here to refer to both collections of samples and collections of data). This paper reports recommendations from a 2-year, NIH-funded project. The authors analyze responsibilities to manage return of IFs and IRRs in a biobank research system (primary research or collection sites, the biobank itself, and secondary research sites). They suggest that biobanks shoulder significant responsibility for seeing that the biobank research system addresses the return question explicitly. When re-identification of individual contributors is possible, the biobank should work to enable the biobank research system to discharge four core responsibilities: to (1) clarify the criteria for evaluating findings and roster of returnable findings, (2) analyze a particular finding in relation to this, (3) re-identify the individual contributor, and (4) recontact the contributor to offer the finding. The authors suggest that findings that are analytically valid, reveal an established and substantial risk of a serious health condition, and that are clinically actionable should generally be offered to consenting contributors. The paper specifies 10 concrete recommendations, addressing new biobanks and biobanks already in existence.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.