An important step toward improving the annotation of the human genome is to identify cis-acting regulatory elements from primary DNA sequence. One approach is to compare sequences from multiple, divergent species. This approach distinguishes multispecies conserved sequences (MCS) in noncoding regions from more rapidly evolving neutral DNA. Here, we have analyzed a region of Ϸ238kb containing the human ␣ globin cluster that was sequenced and͞or annotated across the syntenic region in 22 species spanning 500 million years of evolution. Using a variety of bioinformatic approaches and correlating the results with many aspects of chromosome structure and function in this region, we were able to identify and evaluate the importance of 24 individual MCSs. This approach sensitively and accurately identified previously characterized regulatory elements but also discovered unidentified promoters, exons, splicing, and transcriptional regulatory elements. Together, these studies demonstrate an integrated approach by which to identify, subclassify, and predict the potential importance of MCSs.conserved noncoding elements ͉ gene regulation ͉ globin gene ͉ comparative genomics W ith the development of multiple computational algorithms and appropriate biological data, it is now possible to detect coding sequences relatively efficiently and accurately from primary DNA sequence (1). By contrast, it is still difficult to identify and assign function to noncoding sequences that include critical cisacting regulatory elements such as promoters, enhancers, silencers, locus control elements, nuclear matrix attachment sites, and origins of replication. A promising approach is to search for conserved orthologous noncoding sequences from multiple, evolutionarily diverse species (2-8), so-called multispecies conserved sequences (MCSs). However, at present, it is not established how best to design and carry out such searches. Furthermore, even when complete, it is not clear how to evaluate and prioritize MCSs for functional analysis.An important issue is to know which species to include in searches for MCSs. When comparing the sequences of distantly related species [e.g., birds and mammals, which diverged 310 million years (my) ago], cis elements may have diverged too much to be easily identified. By contrast, when comparing more closely related species (e.g., rodents and primates, which diverged 64-74 my ago), it may be difficult to distinguish between sequence homology resulting from a slow rate of evolution and the conservation of functionally important cis elements. Nevertheless, it is becoming clear that analyzing multiple species in different combinations adds significant power to the identification of functionally important cis-acting sequences (2, 3).Even when MCS can be unequivocally identified, there is often no way to judge how sensitive the searches have been. Do these routines overcall or undercall regulatory elements? Furthermore, the true functional significance of MCSs often remains untested and͞or unknown because there is u...