Membrane proteins are key molecules in the cell, and are important targets for pharmaceutical drugs. Few three-dimensional structures of membrane proteins have been obtained, which makes computational prediction of membrane proteins crucial for studies of these key molecules. Here, seven membrane protein topology prediction methods based on different underlying algorithms, such as hidden Markov models, neural networks and support vector machines, have been used for analysis of the protein sequences from the 21,416 annotated genes in the human genome. The number of genes coding for a protein with predicted alpha-helical transmembrane region(s) ranged from 5508 to 7651, depending on the method used. Based on a majority decision method, we estimate 5539 human genes to code for membrane proteins, corresponding to approximately 26% of the human protein-coding genes. The largest fraction of these proteins has only one predicted transmembrane region, but there are also many proteins with seven predicted transmembrane regions, including the G-protein coupled receptors. A visualization tool displaying the topologies suggested by the eight prediction methods, for all predicted membrane proteins, is available on the public Human Protein Atlas portal (www.proteinatlas.org).
Here, we present an antigen selection strategy based on a whole-genome bioinformatics approach, which is facilitated by an interactive visualization tool displaying protein features from both public resources and in-house generated data. The web-based bioinformatics platform has been designed for selection of multiple, non-overlapping recombinant protein epitope signature tags by display of predicted information relevant for antigens, including domain- and epitope sized sequence similarities to other proteins, transmembrane regions and signal peptides. The visualization tool also displays shared and exclusive protein regions for genes with multiple splice variants. A genome-wide analysis demonstrates that antigens for approximately 80% of the human protein-coding genes can be selected with this strategy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.