Chromosome 19 has the highest gene density of all human chromosomes, more than double the genome-wide average. The large clustered gene families, corresponding high G1C content, CpG islands and density of repetitive DNA indicate a chromosome rich in biological and evolutionary significance. Here we describe 55.8 million base pairs of highly accurate finished sequence representing 99.9% of the euchromatin portion of the chromosome. Manual curation of gene loci reveals 1,461 protein-coding genes and 321 pseudogenes. Among these are genes directly implicated in mendelian disorders, including familial hypercholesterolaemia and insulin-resistant diabetes. Nearly one-quarter of these genes belong to tandemly arranged families, encompassing more than 25% of the chromosome. Comparative analyses show a fascinating picture of conservation and divergence, revealing large blocks of gene orthology with rodents, scattered regions with more recent gene family expansions and deletions, and segments of coding and non-coding conservation with the distant fish species Takifugu.
Nucleolytic processing of chromosomal DNA is required in operations such as DNA repair, recombination and replication. We have identified a human gene, named HEX1 forhumanexonuclease 1, by searching the EST database for cDNAs that encode a homolog to the Saccharomyces cerevisiae EXO1 gene product. Based on its homology to this and other DNA repair proteins of the Rad2 family, most notably Schizosaccharomyces pombe exonuclease 1 (Exo1), Hex1 presumably functions as a nuclease in aspects of recombination or mismatch repair. Similar to the yeast proteins, recombinant Hex1 exhibits a 5'-->3' exonuclease activity. Northern blot analysis revealed that HEX1 expression is highest in fetal liver and adult bone marrow, suggesting that the encoded protein may operate prominently in processes specific to hemopoietic stem cell development. HEX1 gene equivalents were found in all vertebrates examined. The human gene includes 14 exons and 13 introns that span approximately 42 kb of genomic DNA and maps to the chromosomal position 1q42-43, a region lost in some cases of acute leukemia and in several solid tumors.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.