We present a mass spectrometry (MS)-based nucleoside-specific mass-tagging method to validate genomic DNA sequences containing ambiguities not resolved by gel electrophoresis. Selected types of 13 C/ 15 N-labeled dNTPs are used in PCR amplification of target regions followed by matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF)-MS analysis. From the mass difference between the PCR products generated with unlabeled nucleosides and products containing 13 C/ 15 N-labeled nucleosides, we determined the base composition of the genomic regions of interest. Two approaches were used to verify the target regions: The first approach used nucleosides partially enriched with stable isotopes to identify a single uncalled base in a gel electrophoresis-sequenced region. The second approach used mass tags with 100% heavy nucleosides to examine a GC-rich region of a polycytidine string with an unknown number of cytidines. By use of selected 13 C/ 15 N-labeled dNTPs (dCTPs) in PCR amplification of the target region in tandem with MALDI-TOF-MS, we determined precisely that this string contains 11 cytidines. Both approaches show the ability of our MS-based mass-tagging strategy to solve critical questions of sequence identities that might be essential in determining the proper reading frames of the targeted regions.The major challenge in biology is to understand the function of living cells at the molecular level. To address this challenge, a major effort is now underway to sequence the genomes of a range of eukaryotic and prokaryotic organisms. Through these efforts, the draft sequence of the human genome has been completed very recently (Genome Consortium 2001). This draft sequence provides the foundation for the identification of gene sequences and the ultimate understanding of the functions of their protein products in both human diversity and disease development (Godovac-Zimmermann and Brown 2001). To accomplish these objectives, an accurate genome sequence is essential for the subsequent studies of gene functions; however, the current draft of the human genome sequence contains many short regions that are still unresolved mainly because of difficulties in sequencing these regions or artifacts caused by gel electrophoresis. These regions are found throughout the genome and are either primarily GC rich or consist of long repeats (Genome Consortium 2001). They form "hot spots" in the genome that are difficult to sequence and often result in inconsistent data on resequencing. It is essential that these inconsistencies be resolved accurately to prevent problems in interpreting the expression of genes in these regions.In the past decade, mass spectrometry (MS) has become an essential tool in both DNA and protein analyses, as well as the key technology in the emerging fields of proteomics and functional genomics (Godovac-Zimmermann and Brown 2001). Developed in the late 1980s, matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF)-MS (Karas and Hillenkamp 1988) provided fast and accurate measureme...