We present the DNA sequence of 17,367 protein-coding genes in two Neandertals from Spain and Croatia and analyze them together with the genome sequence recently determined from a Neandertal from southern Siberia. Comparisons with present-day humans from Africa, Europe, and Asia reveal that genetic diversity among Neandertals was remarkably low, and that they carried a higher proportion of amino acid-changing (nonsynonymous) alleles inferred to alter protein structure or function than present-day humans. Thus, Neandertals across Eurasia had a smaller long-term effective population than present-day humans. We also identify amino acid substitutions in Neandertals and present-day humans that may underlie phenotypic differences between the two groups. We find that genes involved in skeletal morphology have changed more in the lineage leading to Neandertals than in the ancestral lineage common to archaic and modern humans, whereas genes involved in behavior and pigmentation have changed more on the modern human lineage.ancient DNA | exome capture | site frequency spectra | paleogenetics A nalyses of the coding regions of multiple present-day human individuals have uncovered many amino acid-changing SNPs segregating at low frequency in present-day human populations (1-7). It also has been shown that Europeans carry a larger proportion than Africans of amino acid-changing SNPs inferred to alter the structure or function of proteins and thus to be potentially deleterious (4), a fact attributed to the bottleneck and subsequent expansion of human populations during and after the migration out of Africa (4). In contrast, little is known about the coding variation in Neandertals, an extinct hominin group closely related to presentday humans. The main reasons for this are the rarity of Neandertal remains and the fact that >99% of the DNA extracted from typical Neandertal bones is derived from microbes (8, 9), making shotgun sequencing of the endogenous DNA impractical.Here, we use a hybridization approach to enrich and sequence the protein-coding fractions of the genomes of two Neandertals from Spain and Croatia. We analyze them together with the genome sequence recently determined from a Neandertal from southern Siberia (10) and show that the genetic diversity, pattern of coding variation, and genes that may underlie phenotypic changes in Neandertals are remarkably different from those in present-day humans.
Results and DiscussionWe used densely tiled oligonucleotide probes (9) and a recently described protocol (11) to capture the protein-coding exons from 17,367 genes in a ∼49,000-y-old (12) (uncalibrated radiocarbon date) Neandertal from Spain (SD1253, El Sidrón Cave) and a ∼44,000-y-old (uncalibrated radiocarbon date) Neandertal from Croatia (Vi33.15, Vindija Cave). The DNA libraries from these two Neandertals contain 0.2% and 0.5% endogenous DNA, respectively, and the capture approach enriched the endogenous DNA 325-fold and 153-fold, resulting in an average coverage of 12.5-fold and 42.0-fold for the El Sidrón and Vindi...