2013
DOI: 10.1093/bib/bbt057
|View full text |Cite
|
Sign up to set email alerts
|

Bioinformatics approaches for improved recombinant protein production in Escherichia coli: protein solubility prediction

Abstract: The solubility of recombinant protein expressed in Escherichia coli often represents the production yield. However, up-to-date, instances of successful production of soluble recombinant proteins in E. coli expression system with high yield remain scarce. This is mainly due to the difficulties in improving the overall production capacity, as most of the well-established strategies usually involve a series of trial and error steps with unguaranteed success. One way to concurrently improve the production yield an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
94
2

Year Published

2014
2014
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 73 publications
(96 citation statements)
references
References 38 publications
0
94
2
Order By: Relevance
“…size of the recombinant gene, composition of nucleotides), but due to the existence of several parameters and their complicated relationships, which are partially known, it is not possible to know the expected level of overexpression in advance; thus necessitating a resource intensive "trial and error" approach. The provision of a tool to conduct theoretical prediction of protein overexpression level will facilitate the development of large-scale proteomics studies [4]. To the best of our knowledge, no previous study has addressed the problem of recombinant protein overexpression level prediction.…”
Section: Recombinant Prmentioning
confidence: 99%
“…size of the recombinant gene, composition of nucleotides), but due to the existence of several parameters and their complicated relationships, which are partially known, it is not possible to know the expected level of overexpression in advance; thus necessitating a resource intensive "trial and error" approach. The provision of a tool to conduct theoretical prediction of protein overexpression level will facilitate the development of large-scale proteomics studies [4]. To the best of our knowledge, no previous study has addressed the problem of recombinant protein overexpression level prediction.…”
Section: Recombinant Prmentioning
confidence: 99%
“…Common strategies include using weak promoters and reducing E. coli culture temperature [15,16], coexpression with molecular chaperones [17,18], fusion with solubility enhancing tags [19], site-directed molecular evolution method [20] and structure-guided molecular mutagenesis if a 3D structure is available [21]. These efforts are actually very time consuming, costly and the success rate has also been relatively low [22]. In addition, they mainly concentrated on the host cells themselves at the levels of genetic manipulation or optimization of critical process parameters.…”
Section: Challenges In Process Development For Soluble Protein Expresmentioning
confidence: 99%
“…Several prediction models have been established (6,9), such as the Harrison prediction model (10), multiple linear regression (MLR) model (11), solubility index-based model (12), support vector machine-based model (13,14), PROSO model (15), SOLpro model (16), cc SOL model (17) and PROSO II model (18). These bioinformatics models can significantly reduce trial and error procedures involved in optimization of expression systems to increase the soluble expression level of heterologous proteins.…”
Section: Introductionmentioning
confidence: 99%
“…These bioinformatics models can significantly reduce trial and error procedures involved in optimization of expression systems to increase the soluble expression level of heterologous proteins. However, there has been limited application of these prediction models, partly because of the significant differences among the proteins chosen for building them and also because of the adoption of inconsistent culture conditions for expression of proteins (6,8,9). Domain antibodies (dAbs), which consist of only variable regions of heavy (V H ) or light (V L ) chains (19), have simple tertiary structures ( Fig.…”
Section: Introductionmentioning
confidence: 99%