About 25% of the genes in the fully sequenced and annotated Arabidopsis genome have structures that are predicted solely by computer algorithms with no support from either nucleic acid or protein homologs from other species or expressed sequence matches from Arabidopsis. These are referred to as "hypothetical genes." On chromosome 2, sequenced by The Institute for Genomic Research, there are approximately 800 hypothetical genes among a total of approximately 4,100 genes. To test their expression under various growth conditions and in specific tissues, we used six cDNA populations prepared from cold-treated, heat-treated, and pathogen (Xanthomonas campestris pv campestris)-infected plants, callus, roots, and young seedlings. To date, 169 hypothetical genes were tested, and 138 of them are found to be expressed in one or more of the six cDNA populations. By sequencing multiple clones from each 5Ј-and 3Ј-rapid amplification of cDNA ends (RACE) product and assembling the sequences, we generated full-length sequences for 16 of these genes. For 14 genes, there was one full-length assembly that precisely supported the intron-exon boundaries of their gene predictions, adding only 5Ј-and 3Ј-untranslated region sequences. However, for three of these genes, the other assemblies represent additional exons and alternatively spliced or unspliced introns. For the remaining two genes, the cDNA sequences reveal major differences with predicted gene structures. In addition, a total of six genes displayed more than one polyadenylation site. These data will be used to update gene models in The Institute for Genomic Research annotation database ATH1.With the combined efforts of scientists from Europe, Japan, and the United States, the first higher plant genome-sequencing project, whole-genome sequencing of Arabidopsis has been completed. The sequences of chromosome 2 and 4 were first released in 1999 (Lin et al., 1999;Mayer et al., 1999), and the remaining three chromosomes were sequenced by the end of 2000 (Salanoubat et al., 2000;Tabata et al., 2000;Theologis et al., 2000). This provides scientists a wealth of information and knowledge with which to understand plant biology from a genomic perspective. The whole Arabidopsis genome encodes approximately 25,000 genes (The Arabidopsis Genome Initiative, 2000) and the functional analysis of these genes is a major challenge in this post-sequencing era. One approach to this, taken by several groups (Ceres, Stanford Genome Center, Salk Institute, Plant Gene Expression Center [in collaboration with RIKEN Genomic Sciences Center], and Institut National de la Recherche Agronomique/Genoplante) is to produce full-length cDNAs for all of the 25,000ϩ genes in the Arabidopsis genome, because these complete sequences are essential to fully understand their structure and function (Seki et al., 2001a(Seki et al., , 2001b. Recent comparison of full-length cDNAs from Ceres and SSP with previous genomic annotation revealed that the structures of about one-third of the genes could be improved based on c...