Indian Journal of Animal Research

  • Chief EditorK.M.L. Pathak

  • Print ISSN 0367-6722

  • Online ISSN 0976-0555

  • NAAS Rating 6.40

  • SJR 0.263

  • Impact Factor 0.4 (2024)

Frequency :
Monthly (January, February, March, April, May, June, July, August, September, October, November and December)
Indexing Services :
Science Citation Index Expanded, BIOSIS Preview, ISI Citation Index, Biological Abstracts, Scopus, AGRICOLA, Google Scholar, CrossRef, CAB Abstracting Journals, Chemical Abstracts, Indian Science Abstracts, EBSCO Indexing Services, Index Copernicus
Indian Journal of Animal Research, volume 54 issue 6 (june 2020) : 653-660

Recombinant protein expression optimization in Escherichia coli: A review

N. Hemamalini1, S. Ezhilmathi1,*, A. Angela Mercy1
1Department of Aquaculture, Dr. M.G.R Fisheries College and Research Institute, Ponneri-601 204, Tamil Nadu, India.
2Department of Fisheries Biotechnology, Institute of Fisheries post graduate Studies, Chennai-603 103, Tamil Nadu, India.
Cite article:- Hemamalini N., Ezhilmathi S., Mercy Angela A. (2019). Recombinant protein expression optimization in Escherichia coli: A review . Indian Journal of Animal Research. 54(6): 653-660. doi: 10.18805/ijar.B-3808.
Escherichia coli is the most extensively used organism in recombinant protein production. It has several advantages including a very short life cycle, ease of genetic manipulation and the well-known cell biology etc. which makes E. coli as the perfect host for recombinant protein expression. Despite many advantages, E. coli also have few disadvantages such as coupled transcription and translation and lack of eukaryotic post-translational modifications. These challenges can be overcome by adopting several strategies such as, using different E. coli expression vectors, changing the gene sequence without altering the functional domain, modified E. coli strain usage, changing the culture parameters and co-expression with a molecular chaperone. In this review, we present the level of strategies used to enhance the recombinant protein expression and its stability in E. coli.
Recombinant protein plays a major role in all aspects of biological science. Selection of a suitable expression system is one of the most important strategies for the production of recombinant protein. Although many nonbacterial expression systems such as yeast, baculovirus, a mammalian cell and cell-free systems have been successfully applied for the production of protein. Escherichia coli still remains the most preferred organism of choice (Chen, 2012). The advantages of E. coli are its fast growth, relatively high protein yields, low cost, easy handling and versatile strains for the production of demanding target proteins (Yin et al., 2007). However, E. coli also has several disadvantages including lack of eukaryotic post-translational modifications, production of recombinant products in a nonfunctional state / insoluble expression of proteins, tightly coupled transcription and translation, lack of required cell machinery and overexpression of heterologous proteins in the cytoplasm can result in the formation unfolded / misfolded protein and inclusion bodies (Zhang et al., 2004). Some targets fail to express in E. coli or express insolubly as inclusion bodies. Heterologous protein production needs longer time and requires molecular chaperones to fold correctly. In recent years’ considerable efforts have been taken to enhance solubilization in E. coli. To successfully overcome these disadvantages five levels of strategies can be followed; (1) Optimization of target DNA, (2) Changing the vector, (3) Changing the host, (4) Changing the culture parameters of the recombinant host, (5) Co-expression with other genes which may help to increase expression and the proper folding of desired protein. This review mainly focuses on the strategies which are used for the successful production of recombinant proteins in E. coli.
 
Strategies for the production of recombinant protein
(I) Optimization of target dna
 
Properties of the gene that affects the production of soluble proteins are, the presence of rare codons in the target mRNA, size of the protein and the protein sequence. Rare or low usage codons have been found in many organisms including E. coli. The rarest codons such as AGG, AGA (Arginine), CUA (leucine), AUA (isoleucine) and CCC (proline) of E. coli are playing an important role in expression regulation of different proteins (Grunberg-Manago, 1999; Harrison, 2000). The relative positions of these rare codons in the target leads to the suppression of the protein expression (Lee et al., 1987b; Dumon-Seignovert et al., 2004; Hu et al., 2011). This suppression mostly occurs at the translational level, due to unavailability of the cognate tRNAs for those rare codons.
        
Moreover, if they expressed also, these rare codon rich targets incorrectly translated and high level of misincorporation will occur, mostly lysine will be incorporated instead of arginine (Calderone et al., 1996). The codon bias of E. coli can overcome this problem. Nowadays, multiple websites are available to identify the location of the rare codons in the target genome and also to quantify them, eg. The Rare Codon Calculator (RaCC). Two approaches can be taken to overcome the rare codon issue. The first method is codon optimization of the target gene. Codon optimization is a key step for the successful expression of protein in E. coli even from a distant host (Snajder et al., 2015). Indeed, over the last decade, the use of codon-optimized genes in industrial biotechnology has reduced the cost of protein production, through improved protein expression (Elena et al., 2014). Site-directed mutagenesis or gene synthesizing is the way for codon optimization. The later is done by multiple companies and this method is often faster and cheaper than site-directed mutagenesis. However, gene synthesis not only changes the rare codons but also changes the secondary structure of mRNA, which affects translation efficiency (Hatfield and Roth, 2007; Burgess-Brown et al., 2008; Welch et al., 2009; Menzella, 2011). In the second method, the expression host is altered such a way that they can express the target which contains rare codon in it (Francis and Page, 2010).
        
Signal peptides are usually involved in the export of protein from the site of synthesis. Matured endogenous protein does not have the signal peptide. The recombinant protein which contains signal peptides may alter its function and biochemical characteristics. So, removing the signal peptide coding sequence from the target protein sequence increases the stability and expression of the recombinant protein (Gopal and Kumar, 2013).
 
(II) Changing the properties of a vector
 
Once the target gene is ready, it should be subcloned into a vector which contains all the elements essential for transcription and translation of the target (Studier and Moffatt, 1986). E. coli expression vector should contain fusion tags and other DNA sequence elements include the origin of replication, promoters, regulatory elements, transcription terminators, antibiotic-resistant gene, etc.
 
Origin of replication
 
Origin of replication is a particular sequence at which replication is initiated. The origin of replication is considered important when conducting the co-expression experiment in which two different plasmids with different protein sequence are expressed in the same expression host (Johnston and Marmorstein, 2003). In such a situation, the origin of replication should be different for allowing the expression of both the proteins.
 
Selecting the suitable promoters
 
An effective promoter for protein expression should possesses some key characteristics, i.e. should be strong enough to allow the accumulation of recombinant protein to ≥ 10-30% of the total cellular proteins, should exhibit minimal basal transcriptional activity, should enable simple and inexpensive induction (Jia and Jeon, 2016 ). The stringent regulation of promoter is essential for the synthesis of proteins which are detrimental to the host cell. Selection of the appropriate promoter entirely depends on the nature of the target protein and its downstream use. If the target is a toxic protein, one should use the promoter system that has an extremely low basal expression (araBAD promoter). On the other hand, for higher yields, a strong promoter such as T7 or tac promoter should be selected (Lee et al., 1987b). Cold-shock promoters are used for aggregation-prone proteins so that expression can also occur at low temperature.
        
Nowadays, the promoters are genetically engineered for improving expression of recombinant protein in the host cell. These engineered promoters are showing three to four-fold increase in the activity than the natural promoters. A mutant promoter library was constructed with the randomization of E. coli consensus promoter sequences. The mutant promoter library exhibited 27.5 fold higher activities than the lac promoter (DeMey et al., 2007). Recent advances in the field of genetic engineering have paved the way for the system that can utilize two promoters which helps in the production of two different recombinant proteins simultaneously in the same expression system (Joseph et al., 2015). Some of the most commonly used promoters are T7 RNA promoter, araBAD promoter, cspA promoter and the hybrid promoters.
 
a. T7 promoter
 
In T7 promoter gene expression is driven by T7 RNA Polymerase of T7 bacteriophage. It is the most widely used promoter system for heterogeneous expression in E. coli (Gräslund et al., 2008). It can able to transcribe the DNA five times faster than bacterial RNA polymerase (Studier, 1991). This enzyme is absent in E. coli, so it has to be delivered from the external via an inducible promoter (Studier and Moffatt, 1986). IPTG addition drives the T7 RNA polymerase transcription and synthesize. T7 RNA polymerase initiates transcription of the target gene by binding to the T7 promoter. If the inducer is absent, the lacUV5 promoter controls the T7 RNA polymerase gene, so there will be no transcription occurs (Studier, 1991). Once the system is activated, it can accumulate up to 50% of total cell protein (Studier and Moffatt, 1986; Studier, 1991). Leakey expression is one of the major drawbacks of this promoter. However, even minimal production of RNA polymerase leads to leaky expression of target proteins which are toxic to the hosts. To overcome this problem one can use the bacterial strains which are specially developed for the expression of toxic proteins (Moffatt and Studier, 1987).
 
b. araBAD promoter
 
The araBAD promoter is also known as the arabinose promoter. araBAD is a strong, tightly regulated and titrable promoter system (Lee et al., 1987b). This promoter is mainly used for expressing highly toxic proteins. It exhibits the lowest basal transcriptional activity. L-arabinose acts as the inducer for this promoter (Lee et al., 1987b). The absence of L-arabinose or addition of glucose will suppress the expression of the protein.
 c. cspA promoter
 
cspA promoter is known as cold-shock protein promoter. It is efficiently expressed in low temperature and the expression optimal between 10°C to 25°C. It can reduce the formation of inclusion bodies and improves folding and it is suitable for expressing aggregation-prone proteins (Francis and Page, 2010). The major drawback of this promoter is that it does not completely repress at a higher temperature which leads to basal target protein expression (Qing et al., 2004).
 
d. Hybrid promoters
 
The trc and tac promoters are hybrids of naturally occurring trp and lacUV5 promoter. This promoters consisting of -10 regions of the lacUV5 promoter and the -35 regions of the trp promoter (Khlebnikov and Keasling, 2002). The spacing between -35 and -10 sequences are 16 bp in tac promoter whereas trc promoter has 17 bp spacing. This promoter can accumulate 15%-30% of the total cell protein. The only disadvantage of this promoter system is a very leaky expression of the target protein. Therefore, this promoter cannot be used for toxic protein expression (Brosius et al., 1985).
 
3. Selection of appropriate terminator
 
Termination of transcription plays an important role in the host cellular energy minimization. Transcription termination in prokaryotes is based on two different mechanisms; they are 1) Rho-dependent and 2) Rho-independent, in Rho-dependent mechanism the hexameric protein rho will help to release the RNA transcript from template, while in case of rho-independent mechanism, transcription termination entirely depends on the signals which are present in the template (Richardson and Roberts, 1993; Yang et al., 1995).  Transcription termination reduces the metabolic burden of the host and also forms secondary structure at 3' end of the mRNA which will increase the stability of mRNA (Newbury et al., 1987). Promoter occlusion is one of the criteria which inhibits its function. By inserting the transcription terminator in downstream or in upstream of the coding sequence will prevent continuous transcription with another promoter and also minimize background transcription (Tohru et al., 1994). Stop codon usage plays a vital role in the regulation of gene expression. Universal stop codons are TAA, TGA and TAG, sequence analysis for several genes in E. coli reveals that TAA is the major codon used. TAA can be read by both the release factors and as a stop codon it will not only secure termination but also ensures the termination at high speed with accuracy (Saida et al., 2006).
 
4. Fusion tags
 
Fusion tags are generally divided into purification and solubility tags. Affinity tags allow rapid and efficient purification of proteins. While the solubility tags enhance the proper folding and solubility of a protein. Hence, they are frequently used in tandem with an affinity tag to aid purification (Zhao et al., 2013). Some of the commonly used fusion tags are Glutathione S-transferase (GST), Maltose-binding protein (MBP), N-utilization substance (NusA) and Small ubiquitin modifier (Sumo). They have been widely reported to increase protein expression and solubility (Esposito and Chatterjee, 2006).
 
a. Glutathione S-transferase (GST)
 
The GST is 211 amino acids (roughly 28 kDa) in size. Glutathione S-transferase tags are generally used for expression and purification and which is not used as a solubility enhancing tag (Esposito and Chatterjee, 2006; Brown et al., 2008). GST tags are widely used due to their specific and robust binding towards glutathione agarose and allow single-step purification of expressed protein (Smith and Johnson, 1988).
 
b. Maltose-binding protein (MBP)
 
MBP is 396 amino acids, approximately 42 kDa in size. MBP is naturally present in E. coli which is encoded by the malE gene. In E. coli it is responsible for the uptake, transport, and breakdown of maltodextrin (Routzahn and Waugh, 2002; Nallamsetty et al., 2005). It significantly enhances the solubility of a recombinant protein, which can even solubilize misfolded / unfolded protein (Francis and Page, 2010; Hewitt et al., 2011). MBP functions as an affinity tag as it binds efficiently to other sugars and enables protein purification. MBP can be used at both the ends of a protein so it can enhance the solubility at both N- and C- terminal end (Dyson et al., 2004; Francis and Page, 2010).
 
c. N-utilization substance (NusA)
 
N-utilization substance (NusA) is a recently developed tag to enhance the solubility of a diverse set of proteins. NusA is 535 amino acids (59 kDa) in size. It is a transcription termination/ antitermination factor in E. coli. NusA is not an affinity tag, so it has to be coupled with His 6-tag to facilitate protein purification (DeMarco, 2006).
 
d. Small ubiquitin modifier (Sumo)
 
SUMO is a prokaryotic expression system. It was developed based on the observation that the addition of ubiquitin to the recombinant protein facilitates its solubility (Peroutka et al., 2011). Several studies have been demonstrated the expression and solubility of Small ubiquitin modifier tag. Size of the Sumo tag is 11.2 kDa; the size of the tag is tiny compared to other tags. One of the advantages of Sumo is it has its specific protease, Ulp which recognizes the tertiary structure of SUMO protein rather than a specific amino acid sequence and cleaves immediately after the C- terminal residue of the SUMO protein (Butt et al., 2005). The only disadvantage of SUMO protease is if proline is an N- terminal amino acid of the target protein it restricts the SUMO protease active site access.
 
(III) Changing the host
 
The efficiency of protein expression depends on the appropriate selection of expression host (Joseph et al., 2015). Although a number of expression hosts are available for protein production, the standard in the field still remains E. coli. Nowadays, so many companies provide different types of genetically altered E. coli strains as per suitability of expression of foreign genes. Many bacterial hosts were selected and tested for efficient protein expression and some of those strains were modified to improve the protein expression (Bass and Yansura, 2000). Commercially available E. coli strains are specifically designed to express proteins that contain rare codons, are susceptible to proteolysis or require disulfide bond. BL21 is one of the widely used strain to check the basic protein expression in E. coli.
 
Protease-deficient strains
1.  BL21(DE3)
 
This E. coli strain has T7 polymerase encoding gene introduced in its genome as well as deprived of Lon and OmpT protease. OmpT-is a bacterial endoprotease that readily cleaves T7 RNA polymerase. Lon protease is an ATP-dependent enzyme that rapidly degrades misfolded and recombinant proteins. Deletion of these two genes correlated with increased protein expression (Gottesman, 1990).
 
2.  BL21Star (DE3)
 
The protein yield also depends on the stability of the corresponding mRNA. rne encodes RNase E, an enzyme that functions as an essential part of the “degradosome” to actively degrade mRNA within the cell. BL21Star (DE3) (Invitrogen) is a derivative of the BL21 (DE3) strain. This strain contains an additional mutation in the rne 131 gene. The use of BL21Star (DE3) strain increases mRNA stability, which in turn increases protein expression (Carpousis, 2007).
 
Codon-supplemented strains
 
The codon frequency difference between the target gene and the expression host can lead to premature translation termination, translational stalling and amino acid misincorporation. Rare codons are codon for arginine (AGA, AGG), isoleucine (AUA), leucine (CUA) and proline (CCC). There are two approaches followed to overcome rare codon associated problems. They are 1. Changes are made to the gene and 2. Changes are made to the expression host; E. coli, expression strains supplemented with the rare tRNAs. These rare tRNAs are co-expressed with the wild-type (non-optimized) target gene.
 
1.  CodonPlus-RIL (BL21-RIL)
 
CodonPlus-RIL strains have tRNAs that restrict translation of heterologous proteins from organisms that have AT-rich genomes (Joseph et al., 2015). BL21 strains are engineered to contain extra copies of the gene that encodes rare tRNAs for Arg, Ile and Leu (Rosano and Ceccarelli, 2009).
 
2.  CodonPlus-RP (BL21-RP)
 
Strain is used to overcome GC rich genome bias. These bacterial strains contain extra copies of the ileY, argU and leuW tRNA genes. These genes encode tRNAs that recognize the codon of isoleucine, arginine and the leucine, (Joseph et al., 2015).
 
3. Rosetta
 
Host strains are derivatives of BL21(DE3) strain, designed to enhance the expression of proteins which contains rare codons used in E.coli (Joseph et al., 2015); these strains contain pRARE plasmid, which supplies tRNAs for all the above-mentioned codons plus GGA (Gly). Use of this strain will increase the heterologous protein expression, but it will decrease protein solubility (Milisavljević et al., 2009).
 
Strains to express disulfide-bonded proteins
 
Mutation in glutathione reductase (gor) and thioredoxin reductase (trxB) gene in the host strains will aid the formation of cytosolic disulfide bonds, and it will enhance the solubility of folded, disulfide-containing proteins (Prinz et al., 1997).
 
1.  E. coli Origam
 
Strain of ‘Novagene’. The Origami strain is trxB (thioredoxin reductase) mutants, so disulfide bond formation in the cytoplasm is enhanced and allows proper folding of the recombinant protein; the Origami strain also lacks the glutathione reductase gene.
 
2. ‘SHuffle’ E. coli strain
 
From ‘NEB’ are better than ‘Origami’ strain for the expression of putative disulfide bond forming protein. SHuffle strains express DsbC within the cytoplasm in addition to trxB and gor mutation. DsbC directs correct disulfide bond formation and also acts as a general chaperone for protein folding (Lobstein et al., 2012).
 
Strains to express toxic proteins
 
Leaky expression of T7 polymerase is observed in BL21(DE3). To minimize the leaky expression of a toxic gene, the BL21 host strain was improved.
 
1. BL21-AI
 
araBAD promoter controls the T7 RNA polymerase gene in BL21-AI strain. The araBAD promoter system is optimal for expressing toxic proteins. In BL21-AI cells, T7 RNA polymerase basal expression and the subsequent target gene expression is highly reduced in the presence of glucose and absence of arabinose (Chen and Leong, 2009; Yao et al., 2009).
 
2.  BL21(DE3) pLysS
 
BL21(DE3) pLysS strains, express T7 phage lysozyme, an enzyme that effectively inhibits T7 RNA polymerase activity. So, the basal expression of the target protein is decreased. Culturing of BL21(DE3) pLysS requires Chloramphenicol (Lefebvre et al., 2008).
 
Strains to express globular or membrane protein
 
Lemo21 (DE3) is a derivative of BL21(DE3) strain. This strain contains well-titrable rhamnose promoter (Prha) with T7 polymerase. Its inhibitor T7 lysozyme controls the activity of Prha promoter. Lemo21 (DE3) strain is compatible with ColE1 or pMB1 origin-containing plasmids. Chloram phenicol is required for the maintenance of this strain (Schlegel et al., 2012).
 
Methylation deficient strains
 
Bacteria possess restriction and modification systems that allow them to identify and destroy foreign DNA. This is one of the significant problems in cloning and resulting in the substantially reduced recovery of desired sequences. Most of the E. coli strains contain several methylation-dependent restriction systems, namely McrA, McrBC and Mrr. These problems can be avoided by using the strains in which restriction and modification systems are disabled.
 
1. McrA, McrBC
 
The methylcytosine restricting endonucleases (McrA, McrBC) cleave methylcytosines in the sequences CG and (A/C) G, respectively. Inactivation of the pathway that cleaves methylated cytosine DNA allows uptake of foreign DNA.
 
2. Mrr
 
Mrr (Methyl adenine recognition and restriction) will attack DNA with methyladenine in specific sequences. Inactivation of the pathway that cleaves methylated adenine DNA allows uptake of foreign DNA.
 
Strain for expression at low temperature
 
When E. coli is transformed to manufacture large amounts of recombinant protein, the protein sometimes forms dense aggregates of insoluble misfolded proteins, known as inclusion bodies. Reduction in cultivation temperature (15-25°C) avoids or decreases the inclusion body formation. San-Miguel et al., (2013) reported the successful protein expression at 4°C for 72 h. At low temperature, the expression of chaperone, which folds newly synthesized or misfolded protein, also reduces drastically.
 
Arctic Express
 
(Agilent technologies) strain is derived from the high-performance Stratagene BL21-Gold. This strain co-expresses the cold-adapted chaperonins such as Cpn10 and Cpn6 from a psychrophilic bacterium, Oleispira antarctica (Ferrer et al., 2004). Cpn10 and Cpn60 are effective folding-modulators at low temperatures (4°C to 12°C) and confer an enhanced ability for E. coli to grow at lower temperatures (Ferrer et al., 2003).
 
(IV) Changing the culture condition of the recombinant host
 
Culture condition of E. coli also changes the recombinant protein expression and its solubility. E. coli is the prokaryotic organism, so transcription and translation are coupled. Using strong promoters results in aggregation of protein before folding. This problem can be solved by reducing the rate of transcription or translation so that the protein folds properly (Francis and Page, 2010). The culture condition of the particular recombinant strain should be changed to enhance the solubility of the protein.
 
1. Temperature and inducer concentration
 
Prolonged induction of low temperature increases the solubility of recombinant protein (Kataeva et al., 2005; Volontè et al., 2008; Piserchio et al., 2009). At lower expression temperature, most of the proteases get inactivated, so the rate of protein denaturation also reduced. Lower temperature reduces the rate of metabolism of bacteria, so it leads to reduced protein aggregation (Sahdev et al., 2008). However, this comes along with prolonged cultivation times. In addition to lowering the expression temperature also reducing the inducer concentration and IPTG will result in reduced transcription rate and enhanced recombinant protein solubility (Turner et al., 2005; Francis and Page, 2010; Gopal and Kumar, 2013).
        
At lower temperatures, cell processes slow down, and thus lead to reduced rates of transcription, translation, cell division and reduced protein aggregation. Lowering the expression temperature also results in a reduction in the degradation of proteolytically sensitive proteins.
 
2. Media
 
Batch culture is the most common method used to cultivate the bacterial cells for recombinant protein expression. To optimize the level of expression, it is necessary to fine tune the culture medium because it is much cheaper and easier to manipulate the media compositions (Vincentelli et al., 2003). Although changing the media concentration has the limited impact on the recombinant protein expression, all the essential nutrients required for the growth must be provided from the beginning (Sahdev et al., 2008). Various media like LB, TB and 2YT can be used to optimize the protein concentration and addition of prosthetic groups or cofactors in the culture medium will prevent the formation of inclusion bodies (Joseph et al., 2015).
 
(V) Co-Expression with other genes
 
Simultaneous expression of more genes is required to stabilize the recombinant protein or the protein produced by the counterparts will interact with it. The gene coding those proteins should be co-expressed with the target protein. One of such protein is a molecular chaperone; when chaperones co-expressed with the gene of interest, it will increase the solubility and expression of the protein (Francis and Page, 2010; DeMarco et al., 2005; Gopal and Kumar, 2013). Co-expression of single chain variable fragment (scFv) antibody with a molecular chaperon effectively improved the correct folding and enhanced the solubility of scFv (Sonoda et al., 2011).
The ideal recombinant protein expression should contain the elements that are essential for transcription and translation of the gene to produce an authentic protein of interest. Selection of the appropriate host will result in increased yield of recombinant protein and enhance its solubility. Recombinant protein production in E. coli is less costly than using other hosts and the handling is also easier. It would be better to use an existing E. coli strain to save time and resources but, if the existing host strain does not fit to the research design, then a host can be constructed. Host strains with a combination of two or more features can be used. E.g.: ‘Rosetta-gami’ (disulfide bond formation and also prevent the effect of codon bias) and Arctic Express. If one E. coli strain possess all the characteristics required for heterologous gene expression such as disulfide bond formation, rare t-RNA, low temperature adapted chaperone, slow expression, etc. it would be ideal for recombinant protein production.

  1. Bass, S.H. and Yansura, D.G. (2000). Application of the E. coli trp promoter. Molecular biotechnology, 16(3):253-260.

  2. Brosius, J., Erfle, M. and Storella, J. (1985). Spacing of the-10 and-35 regions in the tac promoter. Effect on its in vivo activity. Journal of Biological Chemistry, 260(6): 3539-3541.

  3. Brown, B.L., Hadley, M. and Page, R. (2008). Heterologous high-level E. coli expression, purification and biophysical characterization of the spine-associated RapGAP (SPAR) PDZ domain. Protein Expression and Purification, 62(1): 9-14.

  4. Burgess-Brown, N.A., Sharma, S., Sobott, F., Loenarz, C., Oppermann, U. and Gileadi, O. (2008). Codon optimization can improve expression of human genes in Escherichia coli: A multi-gene study. Protein Expression and Purification, 59(1): 94-102.

  5. Butt, T.R., Edavettal, S.C., Hall, J.P. and Mattern, M.R. (2005). SUMO fusion technology for difficult-to-express proteins. Protein Expression and Purification, 43(1): 1-9.

  6. Calderone, T.L., Stevens, R.D. and Oas, T.G. (1996). High-level misincorporation of lysine for arginine at AGA Codons in a fusion protein expressed in Escherichia coli. Journal of molecular Biology, 262(4): 407-412.

  7. Carpousis, A.J. (2007). The RNA degradosome of Escherichia coli: an mRNA-degrading machine assembled on RNase E. Annu. Rev. Microbiol., 61: 71-87.

  8. Chen, R. (2012). Bacterial expression systems for recombinant protein production: E. coli and beyond. Biotechnology Advances, 30:1102.

  9. Chen, Y. and Leong, S.S.J. (2009). Adsorptive refolding of a highly disulfide-bonded inclusion body protein using anion-exchange chromatography. Journal of Chromatography A, 1216(24): 4877-4886.

  10. DeMarco, A. (2006). Two-step metal affinity purification of double-tagged (NusA–His 6) fusion proteins. Nature Protocols, 1(3):1538.

  11. DeMarco, A., Vigh, L., Diamant, S. and Goloubinoff, P. (2005). Native folding of aggregation-prone recombinant proteins in Escherichia coli by osmolytes, plasmid- or benzyl alcohol-overexpressed molecular chaperones. Cell Stress Chaperones, 10:329–339.

  12. DeMey, M., Maertens, J., Lequeux, G.J., Soetaert, W.K. and Vandamme, E.J. (2007). Construction and model-based analysis of a promoter library for E. coli: an indispensable tool for metabolic engineering. BMC Biotechnology, 7(1): 34.

  13. Dumon-Seignovert, L., Cariot, G. and Vuillard, L. (2004). The toxicity of recombinant proteins in Escherichia coli: a comparison of overexpression in BL21 (DE3), C41 (DE3), and C43 (DE3). Protein Expression and Purification, 37(1): 203-206.

  14. Dyson, M.R., Shadbolt, S.P., Vincent, K.J., Perera, R.L. and McCafferty, J. (2004). Production of soluble mammalian proteins in Escherichia coli: identification of protein features that correlate with successful expression. BMC Biotechnology, 4(1): 32.

  15. Elena, C., Ravasi, P., Castelli, M.E., Peiru, S. and Menzella, H.G. (2014). Expression of codon optimized genes in microbial systems: current industrial applications and perspectives. Frontiers in Microbiolology, 5.

  16. Esposito, D. and Chatterjee, D.K. (2006). Enhancement of soluble protein expression through the use of fusion tags. Current Opinion in Biotechnology, 17(4): 353-358.

  17. Ferrer, M., Chernikova, T.N., Yakimov, M., Golyshin, P.N. and Timmis, K.N. (2003). Chaperonins govern growth of Escherichia coli at low temperatures. Nature Biotechnology, 21(11):1266.

  18. Ferrer, M., Lunsdorf, H., Chernikova, T.N., Yakimov, M., Timmis, K.N. and Golyshin, P.N. (2004). Functional consequences of single double ring transitions in chaperonins: life in the cold. Molecular Microbiology, 53(1):167-182.

  19. Francis, D.M. and Page, R. (2010). Strategies to optimize protein expression in E. coli. Current protocols in Protein Science, 61(1): 5-24.

  20. Gopal, G.J. and Kumar, A. (2013). Strategies for the production of recombinant protein in Escherichia coli. The protein journal, 32(6): 419-425.

  21. Gottesman, S. (1990). Minimizing proteolysis in Escherichia coli: genetic solutions. In Methods in enzymology, 185: 119-129.

  22. Gräslund, S., Nordlund, P., Weigelt, J., Hallberg, B.M., Bray, J., Gileadi, O., Knapp, S., Oppermann, U., Arrowsmith, C., Hui, R. and Ming, J., (2008). Protein production and purification. Nature Methods, 5(2): 135.

  23. Grunberg-Manago, M. (1999). Messenger RNA stability and its role in control of gene expression in bacteria and phages. Annual Review of Genetics, 33(1): 193-227.

  24. Harrison, R.G. (2000). Expression of soluble heterologous proteins via fusion with NusA protein. Innovations, 11:4-7.

  25. Hatfield, G.W. and Roth, D.A. (2007). Optimizing scaleup yield for protein production: Computationally Optimized DNA Assembly (CODA) and Translation Engineering™. Biotechnology Annual Review, 13: 27-42.

  26. Hewitt, S.N., Choi, R., Kelley, A., Crowther, G.J., Napuli, A.J. and Voorhis, W.C.V. (2011). Expression of proteins in Escherichia coli as fusions with maltose-binding protein to rescue non-expressed targets in a high-throughput protein-expression and purification pipeline. Acta Crystallographica F, 67(9): 1006–1009.

  27. Hu, J., Qin, H., Gao, F.P. and Cross, T.A. (2011). A systematic assessment of mature MBP in membrane protein production: overexpression,    membrane targeting and purification. Protein Expression and Purification, 80(1): 34-40.

  28. Jia, B and Jeon, C.O. (2016). High-throughput recombinant protein expression in Escherichia coli: current status and future perspectives. Open Biology, 6(8): 160-196.

  29. Johnston, K. and Marmorstein, R. (2003). Co-expression of proteins in E. coli using dual expression vectors. In E. coliGene Expression Protocols, 205-213.

  30. Joseph, B.C., Pichaimuthu, S., Srimeenakshi, S., Murthy, M., Selvakumar, K., Ganesan, M. and Manjunath, S.R. (2015). An overview of the parameters for recombinant protein expression in Escherichia coli. Journal of Cell Science & Therapy, 6(5): 1.

  31. Kataeva, I., Chang, J., Xu, H., Luan, C.H., Zhou, J., Uversky, V.N., Lin, D., Horanyi, P., Liu, Z.J., Ljungdahl, L.G. and Rose, J. (2005). Improving solubility of shewanella o neidensis MR-1 and clostridium thermocellum JW-20 proteins expressed into Esherichia coli. Journal of Proteome Research, 4(6): 1942-1951.

  32. Khlebnikov, A. and Keasling, J.D. (2002). Effect of lacY Expression on homogeneity of induction from the Ptac and Ptrc Promoters by natural and synthetic inducers. Biotechnology Progress, 18(3): 672-674.

  33. Lee, N., Francklyn, C. and Hamilton, E.P. (1987b). Arabinose-induced binding of AraC protein to araI2 activates the araBAD operon promoter. Proceedings of the National Academy of Sciences, 84(24): 8814-8818.

  34. Lefebvre, J., Boileau, G. and Manjunath, P. (2008). Recombinant expression and affinity purification of a novel epididymal human sperm-binding protein, BSPH1. Molecular Human Reproduction, 15(2): 105-114.

  35. Lobstein, J., Emrich, C.A., Jeans, C., Faulkner, M., Riggs, P., Berkmen, M. (2012). SHuffle, a novel Escherichia coli protein expression strain capable of correctly folding disulfide bonded proteins in its cytoplasm. Microbial cell factories. 8(11):56.

  36. Menzella, H.G. (2011). Comparison of two codon optimization strategies to enhance recombinant protein production in Escherichia coli. Microbial cell factories, 10(1): 15.

  37. Milisavljeviæ, M.D., Papiæ, D.R., Timotijeviæ, G.S. and Maksimoviæ, V.R. (2009). Successful production of recombinant buckwheat cysteine-rich aspartic protease in Escherichia coli. Journal of the Serbian Chemical Society, 74(6): 607-618.

  38. Moffatt, B.A. and Studier, F.W. (1987). T7 lysozyme inhibits transcription by T7 RNA polymerase. Cell, 49(2): 221-227

  39. Nallamsetty, S., Austin, B.P., Penrose, K.J. and Waugh, D.S. (2005). Gateway vectors for the production of combinatorially tagged His6 MBP fusion proteins in the cytoplasm and periplasm of Escherichia coli. Protein Science, 14(12): 2964-2971.

  40. Newbury, S.F., Smith, N.H., Robinson, E.C., Hiles, I.D. and Higgins, C.F. (1987). Stabilization of translationally active mRNA by prokaryotic REP sequences. Cell, 48(2): 297-310.

  41. Peroutka, R.J., Orcutt, S.J., Strickler, J.E. and Butt, T.R. (2011). SUMO fusion technology for enhanced protein expression and purification in prokaryotes and eukaryotes: in Heterologous Gene Expression in E. coli, Methods in Molecular Biology, 705: 15–30.

  42. Piserchio, A., Ghose, R. and Cowburn, D. (2009). Optimized bacterial expression and purification of the c-Src catalytic domain for solution NMR studies. Journal of Biomolecular NMR, 44(2): 87-93.

  43. Prinz, W.A., Åslund, F., Holmgren, A. and Beckwith, J. (1997). The Role of the Thioredoxin and Glutaredoxin Pathways in Reducing Protein Disulfide Bonds in the Escherichia coliCytoplasm. Journal of Biological Chemistry, 272(25): 15661-15667.

  44. Qing, G., Ma, L.C., Khorchid, A., Swapna, G.V.T., Mal, T.K., Takayama, M.M., Xia, B., Phadtare, S., Ke, H., Acton, T. and Montelione, G.T. (2004). Cold-shock induced high-yield protein production in Escherichia coli. Nature Biotechnology, 22(7): 877.

  45. Richardson, J.P. and Roberts, J.W. (1993). Transcription termination. Critical Reviews in Biochemistry and Molecular Biology, 28(1): 1-30.

  46. Rosano, G.L. and Ceccarelli, E.A. (2009). Rare codon content affects the solubility of recombinant proteins in a codon bias-adjusted Escherichia coli strain. Microbial Cell Factories, 24(8): 41.

  47. Routzahn, K.M. and Waugh, D.S. (2002). Differential effects of supplementary affinity tags on the solubility of MBP fusion proteins. Journal of Structural and Functional Genomics, 2(2): 83-92.

  48. Sahdev, S., Khattar, S.K. and Saini, K.S. (2008). Production of active eukaryotic proteins through bacterial expression systems: a review of the existing biotechnology strategies. Molecular and cellular biochemistry, 307(1-2): 249-264.

  49. Saida, F., Uzan, M., Odaert, B. and Bontems, F. (2006). Expression of highly toxic genes in E. coli: special strategies and genetic tools. Current Protein and Peptide Science, 7(1): 47-56.

  50. San-Miguel, T., Pérez-Bermúdez, P. and Gavidia, I. (2013). Production of soluble eukaryotic recombinant proteins in E. coli is favoured in early log-phase cultures induced at low temperature. Springerplus, 2(1): 89.

  51. Schlegel, S., Löfblom, J., Lee, C., Hjelm, A., Klepsch, M., Strous, M., Drew, D., Slotboom, D.J. and de Gier, J.W. (2012). Optimizing membrane protein overexpression in the Escherichia coli strain Lemo21 (DE3). Journal of Molecular Biology, 423(4): 648-659.

  52. Smith, D.B. and Johnson, K.S. (1988). Single-step purification of polypeptides expressed in Escherichia coli as fusions with glutathione S-transferase. Gene, 67(1): 31-40.

  53. Snajder, M., Mihelic, M., Turk, D. and Ulrih, N.P. (2015). Codon optimisation is key for Pernisine expression in Escherichia coli. PLoS ONE, 10(4): e0123288.

  54. Sonoda, H., Kumada, Y., Katsuda, T. and Yamaji, H.J. (2011). Effects of cytoplasmic and periplasmic chaperones on secretory production of single-chain Fv antibody in Escherichia coli. Journal of Bioscience and Bioengineering. 111(4):465-470.

  55. Studier, F.W. (1991). Use of bacteriophage T7 lysozyme to improve an inducible T7 expression system. Journal of Molecular Biology, 219(1): 37-44.

  56. Studier, F.W. and Moffatt, B.A. (1986). Use of bacteriophage T7 RNA polymerase to direct selective high-level expression of cloned genes. Journal of Molecular Biology, 189(1): 113-130.

  57. Tohru, N., Takeshi, I. and Tsutomu, N., (1994). A T7 promoter vector with a transcriptional terminator for stringent expression of foreign genes. Gene, 145(1): 145-146.

  58. Turner, P., Holst, O. and Karlsson, E.N. (2005). Optimized expression of soluble cyclomaltodextrinase of thermophilic origin in Escherichia coli by using a soluble fusion-tag and by tuning of inducer concentration. Protein Expression and Purification, 39(1): 54-60.

  59. Vincentelli, R., Bignon, C., Gruez, A., Canaan, S., Sulzenbacher, G., Tegoni, M. and Cambillau, C. (2003). Medium-scale structural genomics: strategies for protein expression and crystallization. Accounts of Chemical Research, 36(3), 165–172.

  60. Volontè, F., Marinelli, F., Gastaldo, L., Sacchi, S., Pilone, M.S., Pollegioni, L. and Molla, G. (2008). Optimization of glutaryl-7-    aminocephalosporanic acid acylase expression in E. coli. Protein Expression and Purification, 61(2): 131-137.

  61. Welch, M., Govindarajan, S., Ness, J.E., Villalobos, A., Gurney, A., Minshull, J. and Gustafsson, C. (2009). Design parameters to control synthetic gene expression in Escherichia coli. PloS One, 4(9): 7002.

  62. Yang, M.T., Scott, H.B. and Gardner, J.F. (1995). Transcription termination at the thr attenuator Evidence that the adenine residues upstream of the stem and loop structure are not required for termination. Journal of Biological Chemistry, 270(40): 23330-    23336.

  63. Yao, J., Patrone, J.D. and Dotson, G.D. (2009). Characterization and kinetics of phosphopantothenoylcysteine synthetase from Enterococcus faecalis. Biochemistry, 48(12): 2799-2806.

  64. Yin, J., Li, G., Ren, X. and Herrler, G. (2007). Select what you need: a comparative evaluation of the advantages and limitations of frequently used expression systems for foreign genes. Journal of Biotechnology, 127(3): 335-347.

  65. Zhang, Y.B., Howitt, J., McCorkle, S., Lawrence, P., Springer, K. and Freimuth, P. (2004). Protein aggregation during overexpression limited by peptide extensions with large net negative charge. Protein Expression and Purification, 36(2): 207-216.

  66. Zhao, X., Li, G. and Liang, S. (2013). Several Affinity Tags Commonly Used in Chromatographic Purification. Journal of Analytical Methods in Chemistry, 2013. 

Editorial Board

View all (0)