The novel transcription system of bacteriophage T7 was used to express Escherichia coli genes preferentially with a new low-copy-number plasmid vector, pFN476, to minimize toxic gene effects. Selected E. coli chromosomal fragments from an ordered genomic library (Y. Kohara, K. Ikiyama, and K. Isono, Cell 50: [495][496][497][498][499][500][501][502][503][504][505][506][507][508] 1987) were recloned into this vector, and their genes were preferentially expressed in vivo utilizing its T7 promoter. The protein products were analyzed by two-dimensional gel electrophoresis. By using DNA sequence information, the gel migration was predicted for the protein products of open reading frames from these segments, and this information was used to identify gene products visualized as spots on two-dimensional gels. Even in the absence of DNA sequence information, this approach offers the opportunity to identify all gene products of E. coli and map their genes to within 10 kb on the E. coli genome; with sequence information, this approach can produce a definitive expression map of the E. coil genome.A central goal in biology is achieving a cellular paradigm, i.e., solving the structure of some particular cell and describing how it functions. This intellectual feat will require the best efforts of reductionist approaches to learn the parts of a cell and how they function and the best efforts of systems analysis to discover how the parts are welded together into a functioning unit.There is little question that the bacterium Escherichia coli offers the best opportunity for achieving this paradigm. This is true, first, because of the vast amount of information gained from half a century of intensive study of this organism. From mapping, sequencing, and identification of genes to investigation of gene products, metabolic pathways, gene regulation, and global regulatory networks, the wealth of our acquired knowledge of this organism far surpasses that of any other (11). The second advantage of E. coli is its unmatched accessibility by genetic, biochemical, and molecular techniques that facilitate both in vivo and in vitro analysis of its genome, its metabolism, and its higher-order functions of chemotaxis, stress reactions, genetic exchange, cell division, and molecular transport and translocation.Our laboratory has been committed to developing a cellular protein data base for E. coli as a system of organizing and storing the vast amount of information about this cell. This data base is genomically linked, and the goal is ultimately to trace every protein to its structural gene and to account for every gene of the cell (23).Recent advances in DNA cloning and gene expression, in combination with two-dimensional (2D) gel electrophoresis to resolve the cell's complement of proteins, now make it feasible to create a complete linkage between a cellular protein data base and the genome of E. coli. In 1987 Kohara * Corresponding author.t Present address: