The simplicity and sensitivity of matrix-assisted laser desorption/ionization time-of-flight mass spectrometry have increased its application in recent years. The most common method of "peptide mass fingerprint" analysis often does not provide robust identification. Additional sequence information, obtained by post-source decay or collision induced dissociation, provides additional constraints for database searches. However, de novo sequencing by mass spectrometry is not yet common practice, most likely because of the difficulties associated with the interpretation of high and low energy CID spectra. Success with this type of sequencing requires full sequence coverage and demands better quality spectra than those typically used for data base searching. In this report we show that full-length de novo sequencing is possible using MALDI TOF/TOF analysis. The interpretation of MS/MS data is facilitated by N-terminal sulfonation after protection of lysine side chains (Keough et al., Proc. Natl. Acad. Sci. U.S. A. 1999, 96, 7131-7136). Reliable de novo sequence analysis has been obtained using sub-picomol quantities of peptides and peptide sequences of up to 16 amino acid residues in length have been determined. The simple, predictable fragmentation pattern allows routine de novo interpretation, either manually or using software. Characterization of the complete primary structure of a peptide is often hindered because of differences in fragmentation efficiencies and in specific fragmentation patterns for different peptides. These differences are controlled by various structural parameters including the nature of the residues present. The influence of the presence of internal Pro, acidic and basic residues on the TOF/TOF fragmentation pattern will be discussed, both for underivatized and guanidinated/sulfonated peptides. (J Am Soc Mass Spectrom 2004, 15, 1838 -1852) © 2004 American Society for Mass Spectrometry P rotein identification protocols in proteomics currently rely on the peptide mass fingerprinting (PMF) technique in which a (mostly gel-purified) protein is digested with an endoprotease of known cleavage specificity. The masses of the resulting peptides are measured by mass spectrometry, usually matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF), and matched to peptide masses that have been generated theoretically from proteins in databases. Notwithstanding the fact that the PMF approach is useful for identifying proteins in simple mixtures (e.g., 2D-PAGE gel spots), the identification of proteins in more complex mixtures often requires partial peptide sequence data obtained by tandem mass spectrometry (MS/MS) methods. If accurate genome sequence information is available, database search algorithms can be used in conjunction with MS/MS to readily identify proteins. For proteins not contained within sequence databases, it is necessary to determine partial or complete amino acid sequences using either manual or automated de novo peptide sequence analysis methods. This genera...