Protein Expression
Neil Broadway (n dot broadway at mac dot com)
Berkhamsted, Hertfordshire, United Kingdom
DOI
//dx.doi.org/10.13070/mm.en.2.123
Date
last modified : 2021-01-11; original version : 2012-06-18
Cite as
MATER METHODS 2012;2:123
Abstract

This article presents a review of commonly used vector-host systems for protein expression, based on the PDB database with protein expression information from over 30,000 publications and a Labome survey of randomly selected publications. The expression of toxic proteins is discussed in detail and expression systems used in the production of pharmaceuticals is briefly summarized.

Typical Protein Expression Workflow

Figure 1 shows two typical protein expression workflows. One workflow leads to the generation of a purified protein. The other leads to the generation of a cell line expressing a recombinant protein. In real life, these two workflows may overlap if, for example, a stable mammalian cell line is to be used as the source material from which to purify a recombinant protein.

Protein Expression figure 1
Figure 1. Typical protein expression workflows.
Key Features of Different Expression Systems

The advantages, disadvantages and potential applications of a number of the commonly used recombinant expression systems are listed in Table 1. A number of publications provide detailed information on these systems: Escherichia coli [1-6] ; Saccharomyces cerevisiae [4, 7-10] ; Pichia pastoris [5, 8, 9, 11, 12] ; baculovirus / insect cells [1, 5, 13, 14] ; mammalian cell lines [5, 15] ; and cell-free / in vitro protein production systems [16]. Table 1 summarizes the fundamental properties of these expression systems. Researchers are actively working to improve these fundamental properties (see [17] for a review). Several E. coli strains are now commercially available that aim to overcome the problems of codon bias (Rossetta 2 [18], CodonPlus ril/rp), inefficient disulfide bond formation (SHuffle, Origami [19, 20] ) and poor expression of membrane proteins (C41 and C43). Similarly, E. coli expression vectors utilizing tags such as SUMO, maltose binding protein [21] and thioredoxin [21] designed to promote soluble expression are commercially available. Pre-expression of sulfhydryl oxidase may markedly promote disulfide bond formation [22]. These approaches work very well for some proteins, but for others, it is still not possible to obtain soluble, functional recombinant expression in E. coli [20, 23]. Proteins expressed in bacteria are contaminated with endotoxins, which can be removed by, for example, Toxineraser endotoxin removal kit from GeneScript [24] before being used in vivo. E. coli strains have even been engineered to perform protein N-glycosylation, though the efficiency is low [17, 25], or optimized for His-tagged protein expression (LOBSTR strain) [26, 27]. Likewise, approaches are being developed to express proteins with more mammalian-like glycosylation in the baculovirus/insect cell system and hence expand its utility [28]. Baculovirus variants that promote greater protein secretion are also being developed [17]. Staus DP et al minimized the number of cysteine residues in a truncated rat beta-arrestin 1 gene to increase its expression and stability in BL21/DE3 bacteria [29].

The critical problem facing the researcher is that it is still not possible to predict which expression system will work best for a particular protein and a specific end-use. A universally applicable expression system does not yet exist [30]. When selecting an expression system, the researcher should bear in mind the fundamental properties of each system, their pros and cons and how any particular limitations of that system can be overcome. Decisions should be informed by knowledge of the protein expression target/family members and the ultimate use of the recombinant protein. If resources permit, it may be prudent to explore two (or more) expression systems in parallel.

Expression system Advantages Disadvantages Applications Suppliers
Escherichia coli Potentially very high expression levels
Low cost
Simple culture conditions
Rapid growth
Scaleable
Simple transformation protocols
Many parameters can be altered to optimize expression
Inefficient disulfide bond formation
Poor folding of proteins in the cytoplasm (inc. bacterial proteins)
Inclusion body formation
In vitro refolding protocols inefficient – may negate advantages
Codon usage different to eukaryotes
Minimal post-translational modifications
Endotoxin
May not be able to express large proteins
* engineered strains can help alleviate the problems with disulfide bond formation (Shuffle and Origami), codon bias (Rosetta and CodonPlus ril/rp), or protein secretion [31]
Purified protein (structure, enzymology, drug discovery)
Protein therapeutics
Invitrogen / Life Technologies
EMD Millipore
New England Biolabs
Promega
Clontech
Avidis
Saccharomyces cerevisiae Good expression levels
Choice of secreted or cellular expression
Low cost
Simple culture conditions
Scaleable
Able to perform most eukaryotic post-translational modifications
Efficient protein folding
Endotoxin-free
Likely lower expression than with Pichia pastoris
Secretion likely lower than with Pichia pastoris
Glycosylation still different to mammalian cells
A tendency to hyperglycosylate proteins
N-glycan structures considered allergenic
Purified protein (structure, enzymology, drug discovery)
Protein therapeutics
Invitrogen/Life Technologies
Pichia pastoris High expression levels
Low cost
Simple culture conditions
Relatively rapid growth
Scaleable
Choice of secreted or intracellular expression
Protein secretion efficient and allows simple purification
Extensive post-translational modification of proteins
Efficient protein folding
N-glycosylation more like higher eukaryotes than with Saccharomyces cerevisiae
Endotoxin-free
Use of methanol as inducer is a safety (fire) hazard at scale
Glycosylation still different to mammalian cells
Purified protein (structure, enzymology, drug discovery) Invitrogen/Life Technologies
Baculovirus-infected insect cells Good expression levels (esp. for intracellular proteins)
Relatively rapid growth
Efficient protein folding
Moderately scaleable
Extensive post-translational modification of proteins
Glycosylation more like mammalian cells

Relatively easy deglycosylate proteins enzymatically (good for structure determination)
Endotoxin-free
Expensive culture media
Large volumes of virus needed on scale-up
Inefficient processing of pro-peptides in the secretory pathway
Glycosylation still different to mammalian cells
Viral infection leads to cell lysis and potential degradation of expressed proteins
Purified protein (structure, enzymology, drug discovery) Invitrogen/Life Technologies
Oxford Expression Technologies
BD Biosciences
Clontech
Mammalian cells Good expression levels
Moderately scaleable
Suspension-adapted cells facilitate scale-up
Efficient protein folding
Good for secreted proteins
All post-translational modifications
Endotoxin-free
Expensive culture media
Complex growth requirements
Purified protein (structure, enzymology, and drug discovery)
Protein therapeutics
Cell-based studies
Invitrogen/Life Technologies
EMD Millipore
Promega
Stratagene
Transient expression Rapid route to protein Transfection reagents can be expensive at scale
Large amounts of plasmid DNA for scale-up
Stable cell lines Can archive protein-producing cell line Slow (months) route to protein - especially if clonal selection used
Potential loss of expression with passages
BacMam-mediated transient transduction Rapid route to protein
Scaleable
No need to purify large quantities of plasmid DNA
Non-lytic (c.f. baculovirus/insect cells)
Efficient transduction of many primary human cell types
Substantial quantities of virus needed for scale-up
Cell-free protein Production Rapid route to protein
E. coli, wheat germ, insect and mammalian systems commercially available
Scaleable to quantities
Protein synthesis conditions can be manipulated
Can readily incorporate non-amino acids
Can use PCR products as a template - amenable to simple high throughput approaches
Limited post-translational modifications in the absence of canine pancreatic microsomes
Expensive at scale
Purified protein (structure, enzymology, drug discovery)
In vitro expression cloning [16, 32]
Isotopic labeling of proteins for NMR [33, 34]
Incorporation of non-natural amino acids [35]
Invitrogen/Life Technologies
Promega
New England Biolabs
Table 1. Key features of some commonly used expression systems.

Interestingly, one structural genomics effort, the RIKEN Structural Genomics Initiative in Japan, has concentrated on the use of cell-free (in vitro transcription/translation) to generate proteins for its structure determination efforts [3, 36], which demonstrates the synthetic capacity of modern cell-free protein expression systems. However, cell-free systems tend to have low efficiency for proper folding.

Key aspects of protein expression for structural biology have recently been reviewed in a special edition of Current Opinion in Structural Biology in 2013.

This article has focussed on the most commonly used expression vector-host systems. These are the systems that are likely to be the first port of call when planning to express a recombinant protein. However, it should be noted that many other, more esoteric, expression systems are available. These may be of interest to those researchers with experience of protein expression, or in those situations where the more ‘mainstream’ expression systems do not meet the needs of a particular study. By way of example, the following microbial/plant cell host systems have been described: yeast (Hansenula polymorpha, Arxula adeninivorans, Kluyveromyces lactis, Yarrowia lipolytica, Schizosaccharomyces pombe [8] ); bacteria (Bacillus brevis, Bacillus megaterium, Bacillus subtilis and Caulobacter crescentis [2], Corynebacterium [37], hyperthermophilic sulfolobus islandicus [38] ); and algae [39]. For a recent review of alternative/less common expression systems used for structural biology see [40]. The non-pathogenic Mycobacterium smegmatis was used for the soluble expression of proteins from pathogenic Mycobacterium tuberculosis. This is of note as it is reported that the expression of mycobacterial proteins in E. coli can be problematic [41].

The following viral expression vectors are available for recombinant protein expression in mammalian cells: semliki forest virus [42] ; lentivirus [43] ; adenovirus [44] ; adeno-associated virus. Semliki forest virus has proved popular for the expression of membrane proteins for drug discovery and structural genomics [42]. Lentiviral and adenoviral vectors are currently of great interest in the field of gene therapy [45, 46]. There is also considerable interest in the expression of therapeutic recombinant proteins in the milk of transgenic animals [47]. Transposase-based systems are also gaining popularity for hyperactive constitutive expression, such as the sleeping beauty vector [48].

There has been a substantial recent interest in developing systems for the recombinant expression of multi-protein complexes for both structural biology and drug discovery/development [49-52].

Most Popular Expression Hosts and Expression Vectors in the Worldwide Protein Data Bank
About Worldwide Protein Data Bank

The Worldwide Protein Data Bank (wwPDB: http://www.wwpdb.org/) is an international collaboration of four organisations: RCSB PDB (USA: http://www.rcsb.org/); MSD-EBI (Europe: http://www.ebi.ac.uk/pdbe/); PDBj (Japan: http://www.pdbj.org/); and BMRB (USA). The wwPDB is a repository of macromolecular structural data whose "mission is to maintain a single PDB archive of macromolecular structural data that is freely and publicly available to the global community" [53, 54]. The vast majority of the structures in the wwPDB are of proteins.

Expression host and expression vector

To understand our current practice on protein expression, we select the PDB entries with publications within the last 10 years (from 2009 onwards), resulting 27096 articles, which correspond to 64713 records.

The majority of the wwPDB entries cited Escherichia coli as the expression host, with 23041 out of 27096 publications (85%) reporting its use. The common strain is BL21 (11628 articles).

Table 2 lists the top 5 expression hosts and the top 2 or 3 expression vectors for each of these host organisms.

Expression hostMost commonly used expression vectorsPublications
Escherichia coli 23041
pET28 and derivatives (Novagen/EMD Millipore) 2679
pET15 (Novagen/EMD Millipore) 863
pET21 (Novagen/EMD Millipore) 735
Spodoptera frugiperda 1493
pFastBac (Invitrogen/Life Tech) 228
pVL1392/3 (BD Bioscience) 27
pFB-LIC-Bse 16
Homo sapiens 1181
pHL-sec 92
pVRC8400 42
pTT5 17
Trichoplusia ni 532
pFastBac 76
pAcGP67 15
Cricetulus griseus (CHO) 245
pEE series 12
PGS 8
Table 2. The top 5 most commonly used expression hosts by the number of publications, based on the wwPDB dataset since 2009 as of January 28, 2020.

It is noteworthy that Escherichia coli is the most common expression host in the wwPDB dataset by a very substantial margin. Escherichia coli is a Gram-negative, rod-shaped bacterium. It is one of the key model organisms in life science research and has been extensively exploited in both academic and industrial settings. It should be noted that the lipopolysaccharides in the outer membrane are the source of endotoxin, which may elicit severe inflammatory responses in cellular and in vivo experimental models. Spodoptera frugiperda cells used for protein expression are cell lines (Sf 9 and Sf 21) derived from the ovarian tissues of the Fall Armyworm. Novavax expressed modified SARS-CoV-2 spike protein in Sf 9 cells to produce a COVID 19 vaccine [55]. Trichoplusia ni cells (available commercially as High Five) are derived from Cabbage Looper ovary cells. Trichoplusia ni is reported to be better than Sf9 cells for the production of secreted proteins using the baculovirus/insect cell system [56]. Trichoplusia ni cells are also superior to Sf9 cells as a host for the production of virus-like particles for recombinant vaccine production [57]. Pichia pastoris is respiratory, methylotrophic yeast that can utilize methanol as its sole carbon and energy source. For example, Kitchen P et al expressed full-length AQP4 protein in Pichia pastoris [58] and D Wrapp et al expressed bivalent VHHs in the pKai61 vector in Pichia pastoris [59]. Cricetulus griseus cell lines are derived from Chinese hamster ovary cells (CHO). This extensively used cell line can be adapted to suspension growth. Most recombinant antibodies are produced in CHO cell lines. Maun HR et al, for example, expressed human alpha- and beta-tryptase genes in CHO DKO cell line [60].

Commonly used expression vectors in the wwPDB dataset

For each expression host, the top 2 or 3 most frequently cited expression vectors account for only a relatively small proportion of the total publications (Table 2). This reflects the wide range of different expression vectors that have been used for each expression host. Figures 1 and 2 show plasmid maps for pET28 and pcDNA3.3 (the latest version of pcDNA3), respectively. These plasmids are the archetypal expression vectors for E. coli and mammalian cells, respectively.

With pET vectors the T7 RNA polymerase promoter drives expression of the recombinant gene. The pET28 plasmid encodes an N-terminal His-tag/thrombin cleavage site/T7-tag sequence and an optional C-terminal His-Tag sequence. These vectors are used with lambda DE3 lysogen strains of E. coli. In these strains expression of a genomic copy of the T7 RNA polymerase is under control of the lac repressor. Expression of the recombinant protein is induced by the addition of isopropyl-b-D-thio-galactoside (IPTG) to the culture medium. Interestingly, vectors in which expression is induced by other molecules (e.g., arabinose) do not feature prominently in the wwPDB dataset.

Protein Expression figure 2
Figure 2. Plasmid map of pET28. pET28 images provided with permission of EMD Millipore Corporation.

With the pcDNA3 series of plasmids, the expression is driven by the immediate early promoter of human cytomegalovirus (CMV). This is a strong promoter, constitutively active in mammalian cells. pcDNA3 is the original version of the pcDNA3 series of vectors and is no longer commercially available. The latest development of this series of vectors is pcDNA3.3. It should be noted that pcDNA3 was also identified as one of the most commonly used mammalian expression vectors in a survey of formal publications (like pcDNA3.1 [61, 62] or pcDNA3.3 from Invitrogen (K8300-01) [63] ).

Protein Expression figure 3
Figure 3. Plasmid map of pcDNA3.3. pcDNA3.3 images provided with permission of Life Technologies Corporation.

The most commonly used baculovirus/insect cell vectors all utilize the strong polyhedrin promoter to drive constitutive expression of the recombinant protein. The plasmids (baculovirus transfer vectors) do not themselves directly drive protein expression. They are used to generate recombinant baculovirus containing the gene of interest under the control of the polyhedrin promoter. pFastBac is a newer generation of plasmids that utilize site-specific transposition to generate recombinant baculovirus. This reduces the time taken to generate recombinant baculovirus to around 2 weeks compared to the 4-6 weeks required with older generation plasmids such as pVL1392/3 and pAcGP67.

The Pichia pastoris vectors both utilize the strong AOX1 promoter. The expression is induced by methanol. Despite Pichia pastoris being an excellent system for the production of secreted proteins (see below), the two most commonly used Pichia pastoris expression vectors in the wwPDB dataset are both designed for cytoplasmic expression.

Potential bias in the wwPDB dataset

The data in Table 2 correspond to expression hosts and expression vectors that were specifically employed for producing proteins for structural studies. Consequently, this dataset is biased towards those expression vectors/hosts that are capable of generating large amounts of purified proteins that are required for 3-dimensional structure determination studies. The inability of Escherichia coli to glycosylate proteins and the relative ease with which insect cell N-glycosylation can be removed enzymatically (see Table 1) may also contribute to the frequent use of these systems in the wwPDB dataset. Unglycosylated proteins are generally preferred for structural studies, unless the sugar molecules are essential for function.

For other applications (such as enzyme assays, cellular assays, production of antigen for antibody generation, over-expression to study cellular function or localization) it may not be necessary to express and purify such substantial quantities of protein. Indeed, for cell-based studies purification of the recombinantly-expressed protein is unlikely to be a consideration. Nonetheless, the data on expression vectors/hosts obtained from the wwPDB is a precious resource. The data can help guide protein expression projects for many applications.

Popular Expression Vectors from Surveyed Publications

To avoid the potential bias of the wwPDB dataset, Labome surveyed a randomly selected set of formal publications that cited plasmids. The top 3 most commonly used groups of expression vectors are shown in Table 3. As observed for the wwPDB dataset, the most commonly cited expression vectors account for only a small proportion of the total publications. Again, this reflects the diversity of expression vectors that researchers use for any given expression host. Vector pcDNA3.1 drives expression in mammalian cells via the constitutive CMV promoter (see above). Plasmid pGL3 is a luciferase reporter vector designed for the quantitative study of the regulation of mammalian gene expression. The increasing popularity of this vector presumably reflects a growing desire of researchers to study the function of the human genome at both the transcriptional and proteomic levels.

Similarly, pEGFP is a mammalian expression vector in which expression is driven constitutively by the CMV promoter. Enhanced green fluorescent protein (EGFP) is expressed as either an N-terminal (pEGFP-C1) or a C-terminal (pEGFP-N1) fusion with the protein of interest. These pEGFP vectors may be used to study the subcellular localization or trafficking of proteins by monitoring the EGFP fluorescence [64, 65]. Other vectors such as pRK5 expression vector [66] are also utilized.

VectorPMIDHostCommon variant Reference
pcDNA3.1 (Invitrogen) 83 Mammalian cell lines pcDNA 3.1 His
pcDNA 3.1 V5
[67, 68]
pGL3 (Promega) 44 Mammalian cell lines [68]
pEGFP (Clontech) 40 Mammalian cell lines pEGFP-C1
pEGFP-N1
[69]
Table 3. The most commonly used expression vectors from a survey of formal publications.

It is notable that the most commonly used mammalian expression vectors, both in the publication survey and in the wwPDB dataset, drive constitutive expression. This is despite the commercial availability of inducible mammalian expression vectors such as the T-Rex system, like Thermo Fisher Flp-In T-REx 293 cells used in the production of ApoE3 proteins [70] or other proteins [67], pGEX vectors from GE Healthcare [71], or the pF12 RM Flexi system. Although not featuring highly in academic publications, the BacMam system has proven popular in the pharmaceutical industry for expressing proteins both for cellular studies and for purification [72-74] and is now commercially available. BacMam utilizes a modified baculovirus in which the usual promoter is replaced with the mammalian cell-active CMV promoter. The BacMam virus drives non-replicative, non-lytic expression in a wide range of mammalian cell types.

Expression Systems in the Production of Protein Pharmaceuticals

Unlike in research settings, the production of newly approved protein pharmaceuticals is often in mammalian expression systems. Gary Walsh summarized the expression systems used in the production of protein pharmaceuticals approved by US/EU regulatory authorities from Jan 2014 to Dec 2018 [79]. Fifty-two out of 62 novel recombinant protein pharmaceuticals are expressed in mammalian cell lines, one (Sebelipase alfa) in a mammalian transgenic system, 5 in E. coli, and 4 in S. cerevisiae. CHO cell-based systems are the most common mammalian expression host. Among the 68 monoclonal antibody drugs (novel or biosimilar) approved during the same period, 57 are produced in CHO cell lines, 9 in NS0 cells and 2 in Sp2/0 cells.

Expression of Toxic Proteins

A frequently encountered problem is the expression of recombinant proteins that are toxic to the host cells in which they are expressed. Several strategies are available to overcome this issue. The researcher generally has to empirically determine which potential solution works best for their particular protein. Literature precedent, target class knowledge and ‘in-house’ experience of the target protein can all be used to guide the choice of strategy.

Escherichia coli

The critical issue with E. coli expression is the leaky expression of the toxic protein before induction. This leads to plasmid loss/rearrangement, poor cell growth and reduced protein expression [80]. Several approaches have been developed to address the problem of toxic protein expression in E. coli.

  • Tightly regulated (i.e., non-leaky) expression systems such as the pBAD system utilizing the araBAD promoter (Invitrogen/Life Sciences) can be used to minimize basal expression [80].
  • With T7 promoter-based plasmids, pre-induction expression can be reduced by the use of pLysS/pLysE/pLysY host cells expressing T7 lysozyme that inhibits T7 RNA polymerase and thus reduces promoter activity prior to IPTG induction. Similarly, glucose can be used to repress promoter activity prior to induction with IPTG [80].
  • The pETcocoTM system (EMD Millipore) allows plasmid copy number to be maintained at a very low level during cell growth thus minimizing basal expression and maximizing plasmid stability prior to induction. Plasmid copy number is markedly upregulated and target gene expression induced by IPTG [80].
  • Another approach is to use host strains, such as C41(DE3) and C43(DE3), empirically selected for their ability to express toxic proteins more effectively than the parental BL21(DE3) strain (Avidis, Lucigen) [80].
  • Empirically screening fusion tags such as maltose binding protein, GST, thioredoxin or SUMO may identify a fusion partner that overcomes the toxicity of the target protein (Invitrogen/Life Sciences, New England Biolabs, LifeSensors) [80].
  • Directing expression to the periplasm can potentially overcome toxicity associated with cytosolic accumulation [80].
  • Batch-fed culture may also be a useful approach to the expression of toxic proteins [81].
Mammalian expression

Many mammalian expression systems use a constitutively active CMV promoter. This is problematic for the expression of proteins that are toxic to the host cells. However, researchers frequently wish to study the cellular function of such proteins or wish to express and purify such proteins bearing full mammalian cell post-translational modifications. Multiple inducible mammalian expression systems, utilizing the strong CMV promoter, are now commercially available in which expression is induced by tetracycline (T-RexTM Invitrogen/Life Sciences, Tet-On 3G Clontech), ecdysone (Agilent Technologies/Stratagene) and IPTG (pTUNE Origene). These systems facilitate the growth of sufficient cell numbers prior to the induction of the target protein. Nakagawa T expressed a DualTetONGluA2-FLAG/CNIH3-1D4 plasmid in HEKTetON cell (CLONTECH) to alleviate the toxicity by the activation of the ion channel GluA2 [82].

Use a different expression system

If one particular expression system fails, it may be advantageous to switch to a different system (e.g., yeast, insect, bacterial, mammalian) if other considerations allow (e.g., end use, post-translational modifications). A protein that is toxic in one system may not be toxic in another [80].

Cell-free protein expression

If relatively small quantities of (purified) protein are required then cell-free protein production is an attractive option to circumvent issues of cellular toxicity [80].

References
  1. Hunt I. From gene to protein: a review of new and enabling technologies for multi-parallel protein expression. Protein Expr Purif. 2005;40:1-22 pubmed
  2. Terpe K. Overview of bacterial expression systems for heterologous protein production: from molecular and biochemical fundamentals to commercial systems. Appl Microbiol Biotechnol. 2006;72:211-22 pubmed
  3. Gräslund S, Nordlund P, Weigelt J, Hallberg B, Bray J, Gileadi O, et al. Protein production and purification. Nat Methods. 2008;5:135-46 pubmed publisher
  4. Ferrer Miralles N, Domingo Espín J, Corchero J, Vazquez E, Villaverde A. Microbial factories for recombinant pharmaceuticals. Microb Cell Fact. 2009;8:17 pubmed publisher
  5. Brondyk W. Selecting an appropriate method for expressing a recombinant protein. Methods Enzymol. 2009;463:131-47 pubmed publisher
  6. Gopal G, Kumar A. Strategies for the production of recombinant protein in Escherichia coli. Protein J. 2013;32:419-25 pubmed publisher
  7. Holz C, Hesse O, Bolotina N, Stahl U, Lang C. A micro-scale process for high-throughput expression of cDNAs in the yeast Saccharomyces cerevisiae. Protein Expr Purif. 2002;25:372-8 pubmed
  8. Celik E, Calik P. Production of recombinant proteins by yeast cells. Biotechnol Adv. 2012;30:1108-18 pubmed publisher
  9. Mattanovich D, Branduardi P, Dato L, Gasser B, Sauer M, Porro D. Recombinant protein production in yeasts. Methods Mol Biol. 2012;824:329-58 pubmed publisher
  10. Bill R. Playing catch-up with Escherichia coli: using yeast to increase success rates in recombinant protein production experiments. Front Microbiol. 2014;5:85 pubmed publisher
  11. Macauley Patrick S, Fazenda M, McNeil B, Harvey L. Heterologous protein production using the Pichia pastoris expression system. Yeast. 2005;22:249-70 pubmed
  12. Gurkan C, Ellar D. Recombinant production of bacterial toxins and their derivatives in the methylotrophic yeast Pichia pastoris. Microb Cell Fact. 2005;4:33 pubmed
  13. Hu Y. Baculovirus as a highly efficient expression vector in insect and mammalian cells. Acta Pharmacol Sin. 2005;26:405-16 pubmed
  14. Jarvis D. Baculovirus-insect cell expression systems. Methods Enzymol. 2009;463:191-222 pubmed publisher
  15. Almo S, Love J. Better and faster: improvements and optimization for mammalian recombinant protein production. Curr Opin Struct Biol. 2014;26:39-43 pubmed publisher
  16. Jackson A, Boutell J, Cooley N, He M. Cell-free protein synthesis for proteomics. Brief Funct Genomic Proteomic. 2004;2:308-19 pubmed
  17. Assenberg R, Wan P, Geisse S, Mayr L. Advances in recombinant protein expression for use in pharmaceutical research. Curr Opin Struct Biol. 2013;23:393-402 pubmed publisher
  18. King M, Petry S. Phase separation of TPX2 enhances and spatially coordinates microtubule nucleation. Nat Commun. 2020;11:270 pubmed publisher
  19. Bhagawati M, Terhorst T, Füsser F, Hoffmann S, Pasch T, Pietrokovski S, et al. A mesophilic cysteine-less split intein for protein trans-splicing applications under oxidizing conditions. Proc Natl Acad Sci U S A. 2019;116:22164-22172 pubmed publisher
  20. de Marco A. Recent contributions in the field of the recombinant expression of disulfide bonded proteins in bacteria. Microb Cell Fact. 2012;11:129 pubmed publisher
  21. Lee Y, Warne T, Nehmé R, Pandey S, Dwivedi Agnihotri H, Chaturvedi M, et al. Molecular basis of β-arrestin coupling to formoterol-bound β1-adrenoceptor. Nature. 2020;583:862-866 pubmed publisher
  22. Nguyen V, Hatahet F, Salo K, Enlund E, Zhang C, Ruddock L. Pre-expression of a sulfhydryl oxidase significantly increases the yields of eukaryotic disulfide bond containing proteins expressed in the cytoplasm of E.coli. Microb Cell Fact. 2011;10:1 pubmed publisher
  23. de Marco A. Strategies for successful recombinant expression of disulfide bond-dependent proteins in Escherichia coli. Microb Cell Fact. 2009;8:26 pubmed publisher
  24. Kam T, Mao X, Park H, Chou S, Karuppagounder S, Umanah G, et al. Poly(ADP-ribose) drives pathologic α-synuclein neurodegeneration in Parkinson's disease. Science. 2018;362: pubmed publisher
  25. Valderrama Rincon J, Fisher A, Merritt J, Fan Y, Reading C, Chhiba K, et al. An engineered eukaryotic protein glycosylation pathway in Escherichia coli. Nat Chem Biol. 2012;8:434-6 pubmed publisher
  26. Klein I, Boija A, Afeyan L, Hawken S, Fan M, Dall Agnese A, et al. Partitioning of cancer therapeutics in nuclear condensates. Science. 2020;368:1386-1392 pubmed publisher
  27. Andersen K, Leksa N, Schwartz T. Optimized E. coli expression strain LOBSTR eliminates common contaminants from His-tag purification. Proteins. 2013;81:1857-61 pubmed publisher
  28. Palmberger D, Wilson I, Berger I, Grabherr R, Rendic D. SweetBac: a new approach for the production of mammalianised glycoproteins in insect cells. PLoS ONE. 2012;7:e34226 pubmed publisher
  29. Staus D, Hu H, Robertson M, Kleinhenz A, Wingler L, Capel W, et al. Structure of the M2 muscarinic receptor-β-arrestin complex in a lipid nanodisc. Nature. 2020;579:297-302 pubmed publisher
  30. Sørensen H. Towards universal systems for recombinant gene expression. Microb Cell Fact. 2010;9:27 pubmed publisher
  31. Chen Z, Cao J, Xie L, Li X, Yu Z, Tong W. Construction of leaky strains and extracellular production of exogenous proteins in recombinant Escherichia coli. Microb Biotechnol. 2014;7:360-70 pubmed publisher
  32. King R, Lustig K, Stukenberg P, McGarry T, Kirschner M. Expression cloning in the test tube. Science. 1997;277:973-4 pubmed
  33. Keppetipola S, Kudlicki W, Nguyen B, Meng X, Donovan K, Shaka A. From gene to HSQC in under five hours: high-throughput NMR proteomics. J Am Chem Soc. 2006;128:4508-9 pubmed
  34. Kohno T, Endo Y. Production of protein for nuclear magnetic resonance study using the wheat germ cell-free system. Methods Mol Biol. 2007;375:257-72 pubmed
  35. Noren C, Anthony Cahill S, Griffith M, Schultz P. A general method for site-specific incorporation of unnatural amino acids into proteins. Science. 1989;244:182-8 pubmed
  36. Yokoyama S, Hirota H, Kigawa T, Yabuki T, Shirouzu M, Terada T, et al. Structural genomics projects in Japan. Nat Struct Biol. 2000;7 Suppl:943-5 pubmed
  37. Sundaram R, Hurwitz I, Matthews S, Hoy E, Kurapati S, Crawford C, et al. Expression of a functional single-chain antibody via Corynebacterium pseudodiphtheriticum. Eur J Clin Microbiol Infect Dis. 2008;27:617-22 pubmed publisher
  38. Peng N, Deng L, Mei Y, Jiang D, Hu Y, Awayez M, et al. A synthetic arabinose-inducible promoter confers high levels of recombinant protein expression in hyperthermophilic archaeon Sulfolobus islandicus. Appl Environ Microbiol. 2012;78:5630-7 pubmed publisher
  39. Specht E, Mayfield S. Algae-based oral recombinant vaccines. Front Microbiol. 2014;5:60 pubmed publisher
  40. Fernandez F, Vega M. Technologies to keep an eye on: alternative hosts for protein production in structural biology. Curr Opin Struct Biol. 2013;23:365-73 pubmed publisher
  41. Bashiri G, Baker E. Production of recombinant proteins in Mycobacterium smegmatis for structural and functional studies. Protein Sci. 2015;24:1-10 pubmed publisher
  42. Lundstrom K. Expression of mammalian membrane proteins in mammalian cells using Semliki Forest virus vectors. Methods Mol Biol. 2010;601:149-63 pubmed publisher
  43. Dull T, Zufferey R, Kelly M, Mandel R, Nguyen M, Trono D, et al. A third-generation lentivirus vector with a conditional packaging system. J Virol. 1998;72:8463-71 pubmed
  44. Bett A, Haddara W, Prevec L, Graham F. An efficient and flexible system for construction of adenovirus vectors with insertions or deletions in early regions 1 and 3. Proc Natl Acad Sci U S A. 1994;91:8802-6 pubmed
  45. Sakuma T, Barry M, Ikeda Y. Lentiviral vectors: basic to translational. Biochem J. 2012;443:603-18 pubmed publisher
  46. Yao X, Nakagawa S, Gao J. Current targeting strategies for adenovirus vectors in cancer gene therapy. Curr Cancer Drug Targets. 2011;11:810-25 pubmed
  47. Bosze Z, Baranyi M, Whitelaw C. Producing recombinant human milk proteins in the milk of livestock species. Adv Exp Med Biol. 2008;606:357-93 pubmed
  48. Moya I, Castaldo S, Van den Mooter L, Soheily S, Sansores Garcia L, Jacobs J, et al. Peritumoral activation of the Hippo pathway effectors YAP and TAZ suppresses liver cancer in mice. Science. 2019;366:1029-1034 pubmed publisher
  49. Gomez S, López Estepa M, Fernandez F, Vega M. Protein Complex Production in Alternative Prokaryotic Hosts. Adv Exp Med Biol. 2016;896:115-33 pubmed publisher
  50. Gomez S, López Estepa M, Fernandez F, Suarez T, Vega M. Alternative Eukaryotic Expression Systems for the Production of Proteins and Protein Complexes. Adv Exp Med Biol. 2016;896:167-84 pubmed publisher
  51. Sari D, Gupta K, Thimiri Govinda Raj D, Aubert A, Drncová P, Garzoni F, et al. The MultiBac Baculovirus/Insect Cell Expression Vector System for Producing Complex Protein Biologics. Adv Exp Med Biol. 2016;896:199-215 pubmed publisher
  52. Baser B, van den Heuvel J. Assembling Multi-subunit Complexes Using Mammalian Expression. Adv Exp Med Biol. 2016;896:225-38 pubmed publisher
  53. Berman H, Henrick K, Nakamura H. Announcing the worldwide Protein Data Bank. Nat Struct Biol. 2003;10:980 pubmed
  54. Berman H, Henrick K, Nakamura H, Markley J. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 2007;35:D301-3 pubmed
  55. Bangaru S, Ozorowski G, Turner H, Antanasijevic A, Huang D, Wang X, et al. Structural analysis of full-length SARS-CoV-2 spike protein from an advanced vaccine candidate. Science. 2020;: pubmed publisher
  56. Davis T, Wickham T, McKenna K, Granados R, Shuler M, Wood H. Comparative recombinant protein production of eight insect cell lines. In Vitro Cell Dev Biol Anim. 1993;29A:388-90 pubmed
  57. Krammer F, Schinko T, Palmberger D, Tauer C, Messner P, Grabherr R. Trichoplusia ni cells (High Five) are highly efficient for the production of influenza A virus-like particles: a comparison of two insect cell lines as production platforms for influenza vaccines. Mol Biotechnol. 2010;45:226-34 pubmed publisher
  58. Kitchen P, Salman M, Halsey A, Clarke Bland C, MacDonald J, Ishida H, et al. Targeting Aquaporin-4 Subcellular Localization to Treat Central Nervous System Edema. Cell. 2020;181:784-799.e19 pubmed publisher
  59. Wrapp D, De Vlieger D, Corbett K, Torres G, Wang N, Van Breedam W, et al. Structural Basis for Potent Neutralization of Betacoronaviruses by Single-Domain Camelid Antibodies. Cell. 2020;181:1004-1015.e15 pubmed publisher
  60. Maun H, Jackman J, Choy D, Loyet K, Staton T, Jia G, et al. An Allosteric Anti-tryptase Antibody for the Treatment of Mast Cell-Mediated Severe Asthma. Cell. 2019;179:417-431.e19 pubmed publisher
  61. Hettmann T, Gillies S, Kleinschmidt M, Piechotta A, Makioka K, Lemere C, et al. Development of the clinical candidate PBD-C06, a humanized pGlu3-Aβ-specific antibody against Alzheimer's disease with reduced complement activation. Sci Rep. 2020;10:3294 pubmed publisher
  62. Guo A, Wang Y, Chen B, Wang Y, Yuan J, Zhang L, et al. E-C coupling structural protein junctophilin-2 encodes a stress-adaptive transcription regulator. Science. 2018;362: pubmed publisher
  63. McGeary S, Lin K, Shi C, Pham T, Bisaria N, Kelley G, et al. The biochemical basis of microRNA targeting efficacy. Science. 2019;: pubmed publisher
  64. Broadway N, Pease R, Birdsey G, Shayeghi M, Turner N, David Saggerson E. The liver isoform of carnitine palmitoyltransferase 1 is not targeted to the endoplasmic reticulum. Biochem J. 2003;370:223-31 pubmed
  65. Tamura K, Ohbayashi N, Ishibashi K, Fukuda M. Structure-function analysis of VPS9-ankyrin-repeat protein (Varp) in the trafficking of tyrosinase-related protein 1 in melanocytes. J Biol Chem. 2011;286:7507-21 pubmed publisher
  66. Verschueren E, Husain B, Yuen K, Sun Y, Paduchuri S, Senbabaoglu Y, et al. The Immunoglobulin Superfamily Receptome Defines Cancer-Relevant Networks Associated with Clinical Outcome. Cell. 2020;182:329-344.e19 pubmed publisher
  67. Lin Z, Gasic I, Chandrasekaran V, Peters N, Shao S, Mitchison T, et al. TTC5 mediates autoregulation of tubulin via mRNA degradation. Science. 2020;367:100-104 pubmed publisher
  68. Wang L, Wen M, Cao X. Nuclear hnRNPA2B1 initiates and amplifies the innate immune response to DNA viruses. Science. 2019;365: pubmed publisher
  69. Dong J, Lee Y, Kirmiz M, Palacio S, Dumitras C, Moreno C, et al. A toolbox of nanobodies developed and validated for use as intrabodies and nanoscale immunolabels in brain neurons. elife. 2019;8: pubmed publisher
  70. Arboleda Velasquez J, Lopera F, O Hare M, Delgado Tirado S, Marino C, Chmielewska N, et al. Resistance to autosomal dominant Alzheimer's disease in an APOE3 Christchurch homozygote: a case report. Nat Med. 2019;: pubmed publisher
  71. Li Z, Wang C, Wang Z, Zhu C, Li J, Sha T, et al. Allele-selective lowering of mutant HTT protein by HTT-LC3 linker compounds. Nature. 2019;: pubmed publisher
  72. Scott M, Modha S, Rhodes A, Broadway N, Hardwicke P, Zhao H, et al. Efficient expression of secreted proteases via recombinant BacMam virus. Protein Expr Purif. 2007;52:104-16 pubmed
  73. Kost T, Condreay J, Ames R, Rees S, Romanos M. Implementation of BacMam virus gene delivery technology in a drug discovery setting. Drug Discov Today. 2007;12:396-403 pubmed
  74. Davenport E, Nuthulaganti P, Ames R. BacMam: versatile gene delivery technology for GPCR assays. Methods Mol Biol. 2009;552:199-211 pubmed publisher
  75. Magupalli V, Negro R, Tian Y, Hauenstein A, Di Caprio G, Skillern W, et al. HDAC6 mediates an aggresome-like mechanism for NLRP3 and pyrin inflammasome activation. Science. 2020;369: pubmed publisher
  76. Zhao N, Kamijo K, Fox P, Oda H, Morisaki T, Sato Y, et al. A genetically encoded probe for imaging nascent and mature HA-tagged proteins in vivo. Nat Commun. 2019;10:2947 pubmed publisher
  77. Horn V, Uckelmann M, Zhang H, Eerland J, Aarsman I, le Paige U, et al. Structural basis of specific H2A K13/K15 ubiquitination by RNF168. Nat Commun. 2019;10:1751 pubmed publisher
  78. Li Y, Orlando B, Liao M. Structural basis of lipopolysaccharide extraction by the LptB2FGC complex. Nature. 2019;567:486-490 pubmed publisher
  79. Walsh G. Biopharmaceutical benchmarks 2018. Nat Biotechnol. 2018;36:1136-1145 pubmed publisher
  80. Saida F, Uzan M, Odaert B, Bontems F. Expression of highly toxic genes in E. coli: special strategies and genetic tools. Curr Protein Pept Sci. 2006;7:47-56 pubmed
  81. Khasa Y, Khushoo A, Mukherjee K. Enhancing toxic protein expression in Escherichia coli fed-batch culture using kinetic parameters: Human granulocyte-macrophage colony-stimulating factor as a model system. J Biosci Bioeng. 2013;115:291-7 pubmed publisher
  82. Nakagawa T. Structures of the AMPA receptor in complex with its auxiliary subunit cornichon. Science. 2019;366:1259-1263 pubmed publisher
ISSN : 2329-5139