The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
We report the draft genome of the black cottonwood tree,
Populus trichocarpa
. Integration of shotgun sequence assembly with genetic mapping enabled chromosome-scale reconstruction of the genome. More than 45,000 putative protein-coding genes were identified. Analysis of the assembled genome revealed a whole-genome duplication event; about 8000 pairs of duplicated genes from that event survived in the
Populus
genome. A second, older duplication event is indistinguishably coincident with the divergence of the
Populus
and
Arabidopsis
lineages. Nucleotide substitution, tandem gene duplication, and gross chromosomal rearrangement appear to proceed substantially more slowly in
Populus
than in
Arabidopsis. Populus
has more protein-coding genes than
Arabidopsis
, ranging on average from 1.4 to 1.6 putative
Populus
homologs for each
Arabidopsis
gene. However, the relative frequency of protein domains in the two genomes is similar. Overrepresented exceptions in
Populus
include genes associated with lignocellulosic wall biosynthesis, meristem development, disease resistance, and metabolite transport.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.