Identification of protein structural neighbors to a query is fundamental in structure and function prediction. Here we present BS-align, a systematic method to retrieve backbone string neighbors from primary sequences as templates for protein modeling. The backbone conformation of a protein is represented by the backbone string, as defined in Ramachandran space. The backbone string of a query can be accurately predicted by two innovative technologies: a knowledge-driven sequence alignment and encoding of a backbone string element profile. Then, the predicted backbone string is employed to align against a backbone string database and retrieve a set of backbone string neighbors. The backbone string neighbors were shown to be close to native structures of query proteins. BS-align was successfully employed to predict models of 10 membrane proteins with lengths ranging between 229 and 595 residues, and whose high-resolution structural determinations were difficult to elucidate both by experiment and prediction. The obtained TM-scores and root mean square deviations of the models confirmed that the models based on the backbone string neighbors retrieved by the BS-align were very close to the native membrane structures although the query and the neighbor shared a very low sequence identity. The backbone string system represents a new road for the prediction of protein structure from sequence, and suggests that the similarity of the backbone string would be more informative than describing a protein as belonging to a fold. Molecular & Cellular Proteomics 11: 10.1074/mcp. M111.016808, 1-7, 2012.Determining the structures of membrane proteins remains a relatively unexplored frontier in structural biology (1). Current computational methods include de novo protein modeling and comparative modeling. Compared with de novo modeling, the comparative modeling is more successful when sequence homologies are available. However, because relatively few membrane proteins have been identified through experimentation, building membrane protein conformations remain an extremely difficult and daunting undertaking.In comparative modeling or other methods based on known protein structures, the identification of the best structure neighbor (template), if indeed any are available, is critical. The typical method of template identification relies on serial pairwise sequence alignments aided by database search engines such as FASTA (2) and BLAST (3). More sensitive methods based on multiple sequence alignments, including. PSI-BLAST (4), CLUSTALW (5), and HMMER (6), are available. MSAs have been shown to produce a greater number of potential templates and to better identify templates for sequences that have homologue relationships to other solved structures. However, when there is no significant homology found, most of the target-template pairs are evolutionarily too distant to be detected with the current threading approaches (7). If no other information about the target is known, aside from the sequence, it becomes difficult to identify possi...