The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
Assessment and characterization of gut microbiota has become a major research area in human disease, including type 2 diabetes, the most prevalent endocrine disease worldwide. To carry out analysis on gut microbial content in patients with type 2 diabetes, we developed a protocol for a metagenome-wide association study (MGWAS) and undertook a two-stage MGWAS based on deep shotgun sequencing of the gut microbial DNA from 345 Chinese individuals. We identified and validated approximately 60,000 type-2-diabetes-associated markers and established the concept of a metagenomic linkage group, enabling taxonomic species-level analyses. MGWAS analysis showed that patients with type 2 diabetes were characterized by a moderate degree of gut microbial dysbiosis, a decrease in the abundance of some universal butyrate-producing bacteria and an increase in various opportunistic pathogens, as well as an enrichment of other microbial functions conferring sulphate reduction and oxidative stress resistance. An analysis of 23 additional individuals demonstrated that these gut microbial markers might be useful for classifying type 2 diabetes.
We performed a genome-wide association study (GWAS) of systemic lupus erythematosus (SLE) in a Chinese Han population by genotyping 1,047 cases and 1,205 controls using Illumina Human610-Quad BeadChips and replicating 78 SNPs in two additional cohorts (3,152 cases and 7,050 controls). We identified nine new susceptibility loci (ETS1, IKZF1, RASGRP3, SLC15A4, TNIP1, 7q11.23, 10q11.22, 11q23.3 and 16p11.2; 1.77 x 10(-25) < or = P(combined) < or = 2.77 x 10(-8)) and confirmed seven previously reported loci (BLK, IRF5, STAT4, TNFAIP3, TNFSF4, 6q21 and 22q11.21; 5.17 x 10(-42) < or = P(combined) < or = 5.18 x 10(-12)). Comparison with previous GWAS findings highlighted the genetic heterogeneity of SLE susceptibility between Chinese Han and European populations. This study not only advances our understanding of the genetic basis of SLE but also highlights the value of performing GWAS in diverse ancestral populations.
Therapeutic proteins and antibodies represent a $125 billion annual market. Chinese Hamster Ovary (CHO) derived cell lines are the preferred host cells for the production of therapeutic proteins. Here, we present a draft genomic sequence of the CHO-K1 ancestral cell line. The assembly comprises 2.45Gb genomic sequence with 24,383 predicted genes. We associate most scaffolds to 21 microfluidically-isolated chromosomes to identify chromosomal locations of genes. Furthermore, we investigate genes involved in glycosylation, which affects therapeutic protein quality, and viral susceptibility genes, which affect cell engineering and regulatory concerns. Specifically, homologs for most human glycosylation-associated genes are identified in the CHO-K1 genome, although 141 are not expressed under exponential growth. In addition, many important viral entry genes are present in the genome but not expressed, which may explain the unusual viral resistance property of CHO cell lines. We demonstrate how the availability of this genome sequence may facilitate genome-scale science for biopharmaceutical protein production.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.