Maika Malig scite author profile

The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

show abstract

An integrated map of structural variation in 2,504 human genomes

Sudmant¹,

Rausch²,

Gardner³

et al. 2015

Nature

2,155

124

2,439

View full text Add to dashboard Cite

Summary Structural variants (SVs) are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight SV classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype-blocks in 26 human populations. Analyzing this set, we identify numerous gene-intersecting SVs exhibiting population stratification and describe naturally occurring homozygous gene knockouts suggesting the dispensability of a variety of human genes. We demonstrate that SVs are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of SV complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex SVs with multiple breakpoints likely formed through individual mutational events. Our catalog will enhance future studies into SV demography, functional impact and disease association.

show abstract

Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations

O’Roak

Vives

Girirajan

et al. 2012

Nature

2,001

1,864

View full text Add to dashboard Cite

It is well established that autism spectrum disorders (ASD) have a strong genetic component. However, for at least 70% of cases, the underlying genetic cause is unknown1. Under the hypothesis that de novo mutations underlie a substantial fraction of the risk for developing ASD in families with no previous history of ASD or related phenotypes—so-called sporadic or simplex families2,3, we sequenced all coding regions of the genome, i.e. the exome, for parent-child trios exhibiting sporadic ASD, including 189 new trios and 20 previously reported4. Additionally, we also sequenced the exomes of 50 unaffected siblings corresponding to these new (n = 31) and previously reported trios (n = 19)4, for a total of 677 individual exomes from 209 families. Here we show de novo point mutations are overwhelmingly paternal in origin (4:1 bias) and positively correlated with paternal age, consistent with the modest increased risk for children of older fathers to develop ASD5. Moreover, 39% (49/126) of the most severe or disruptive de novo mutations map to a highly interconnected beta-catenin/chromatin remodeling protein network ranked significantly for autism candidate genes. In proband exomes, recurrent protein-altering mutations were observed in two genes, CHD8 and NTNG1. Mutation screening of six candidate genes in 1,703 ASD probands identified additional de novo, protein-altering mutations in GRIN2B, LAMC3, and SCN1A. Combined with copy number variant (CNV) data, these results suggest extreme locus heterogeneity but also provide a target for future discovery, diagnostics, and therapeutics.

show abstract

Great ape genetic diversity and population history

Prado-Martinez

Sudmant

Kidd

et al. 2013

Nature

822

1,210

View full text Add to dashboard Cite

Most great ape genetic variation remains uncharacterized; however,\ud its study is critical for understanding population history, recombination,\ud selection and susceptibility to disease.Herewe sequence\ud to high coverage a total of 79 wild- and captive-born individuals\ud representing all six great ape species and seven subspecies and report\ud 88.8 million single nucleotide polymorphisms. Our analysis provides\ud support for genetically distinct populations within each species,\ud signals of gene flow, and the split of common chimpanzees\ud into two distinct groups: Nigeria–Cameroon/western and central/\ud eastern populations.We find extensive inbreeding in almost all wild\ud populations, with eastern gorillas being the most extreme. Inferred\ud effective population sizes have varied radically over timein different\ud lineages and this appears to have a profound effect on the genetic\ud diversity at, or close to, genes in almost all species. We discover and\ud assign 1,982 loss-of-function variants throughout the human and\ud great ape lineages, determining that the rate of gene loss has not\ud been different in the human branch compared to other internal\ud branches in the great ape phylogeny. This comprehensive catalogue\ud of great ape genomediversity provides a framework for understanding\ud evolution and a resource for more effective management of wild\ud and captive great ape populations

show abstract

Mapping and sequencing of structural variation from eight human genomes

Kidd

Cooper²,

Donahue³

et al. 2008

Nature

997

1,065

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Maika Malig

A global reference for human genetic variation

An integrated map of structural variation in 2,504 human genomes

Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations

Great ape genetic diversity and population history

Mapping and sequencing of structural variation from eight human genomes

Contact Info

Product

Resources

About