Incorporating Unnatural Amino Acids into Recombinant Proteins in Living Cells
Nivedita Mitra (nivedita dot mitra at gmail dot com)
Yale University, United States
DOI
//dx.doi.org/10.13070/mm.en.3.204
Date
last modified : 2023-10-17; original version : 2013-08-27
Cite as
MATER METHODS 2013;3:204
Abstract

An overview of unnatural amino acids in recombinant proteins.

All proteins are composed of 20 canonical amino acids. Of course, as in all things biology, there are a few exceptions when other amino acids are found to occur naturally. Since most proteins have more than one of each amino acid, it is a problem when individual locations in the proteins have to be studied by covalent modification of an amino acid side-chain. For example, chemically labeling a cysteine in a location of interest in the protein will also result in the modification of other exposed cysteines. Such a scenario would call for the removal of all the exposed cysteines. This kind of mutation throughout the protein is likely to produce changes that might not be desirable in many cases. This is where unnatural amino acids (UAAs) come in. Incorporating a UAA with a chemically unique side chain at the desired location in the protein will render large-scale mutations unnecessary and may help preserve its structure.

UAAs add to the chemical diversity of protein, and thus can be used to create proteins with new functions. For instance, peptide inhibitors with a dithiol amino acid in the place of two consecutive cysteine residues can avoid the formation of disulfide bond isomers and render a much stronger inhibitory effect [1]. To date more than ~ 50 UAA have been incorporated into proteins produced in bacterial, yeast and mammalian systems. For reviews please refer to [2-4]. Cell-free systems have also been devised to introduce UAAs into proteins, including those from genomically recoded bacteria [5]. Biotech company Sutro Biopharma (Nasdaq:STRO) uses modified Escherichia coli extract to incorporate UAAs in defined numbers and positions within potential pharmaceutical proteins [6].

Except for the stop codons amber, ochre, and opal, all other triplet codons have been taken up by cellular systems to code for one of the 20 canonical amino acids. To encode a new amino acid in the translational machinery of an organism, one of these three degenerate stop codons is generally used. These codons are not recognized by any of the endogenous host tRNAs. A tRNA charged with the UAA of interest is engineered to recognize a stop codon, which then adds the UAA in the growing polypeptide chain by a mechanism commonly referred to as nonsense codon suppression. The most commonly used nonsense codon is the amber or TAG codon. The incorporation can happen in a codon-specific and mRNA-selective manner with the clever application of phase-transition and spatial separation [7]. The genetic codons might also be expanded to accommodate UAAs in a Hachimoji RNA (and DNA) system with eight genetic codes (A, C, G, T, B, P, S, Z) [8].

Incorporating Unnatural Amino Acids into Recombinant Proteins in Living Cells figure 1
Figure 1. A schematic of UAA incorporation method: Plasmids encoding the suppressor tRNACUA, the evolved amino acyl-tRNA synthetase (aaRS) and the target protein with the amber codon are transfected into cells. The media covering the cells is supplemented with the UAA (orange cross). The aaRS catalyses the acylation of the suppressor tRNACUA with the UAA. When the mRNA containing the amber codon is being translated in the ribosome (light green oval), the amber codon is recognized by the tRNACUA charged with the UAA and this amino acid is added to the growing polypeptide chain.

Early experiments to incorporate UAAs into recombinant proteins started with tRNAs that were already charged with the UAA by chemical methods, followed by either cell-free translation or injecting this tRNA into Xenopus oocytes along with the mRNA encoding the protein of interest containing the TAG codon [9, 10]. It is not hard to imagine that the yields of UAA incorporated recombinant proteins that were produced in this manner were very small. Since then, methods have been developed that allow the tRNA to be charged with the appropriate UAA inside a living cell, after which the UAA is directly incorporated into the protein of interest. These methods involve using a tRNA and an aminoacyl-tRNA synthetase (or aaRS; it is the enzyme that acylates the tRNA with an amino acid) pair from a species that is widely different from the host in which the recombinant protein is being produced, i.e., using a bacterial or archaeal tRNA/aaRS pair in a mammalian host or using an archaeal tRNA/ aaRS pair in a bacterial host.

As a first step, both the tRNA and the aaRS are engineered to recognize the UAA of interest [11-14]. Moreover, the engineered tRNA cannot be non-specifically acylated by any other aaRS endogenous to the host. In the same way, the engineered aaRS should not acylate endogenous tRNAs, i.e., the tRNA/aaRS pair should be orthogonal to the host system. Using an orthogonal tRNA/aaRS pair from a distant species helps satisfy this criterion. The anticodon on the tRNA is mutated in such a way that it specifically recognizes the stop codon that codes for the UAA (e.g., if the amber codon TAG is used, the tRNA anticodon is mutated to CUA).

Next, the aaRS gene is modified such that it recognizes the UAA specifically. This modification starts with a large library of aaRS active site mutants. After multiple rounds of positive and negative selection in bacteria or yeast the final aaRS is obtained. The rounds of positive selection involve a gene such as the chloramphenicol resistance gene, with inframe TAG codons. Only those bacteria that produce a full-length chloramphenicol resistance gene survive in the selection media containing chloramphenicol. In the negative selection round, the ability of an aaRS to non-specifically suppress amber codons in a gene that is toxic to the bacteria, in the absence of the UAA, results in the expression in the toxic gene and subsequent bacterial cell death and elimination of that aaRS clone. In the end, what are left are clones that suppress amber codons exclusively with the UAA.

Ideally, such selection protocols have to be carried out for each new amino acid and host. If the UAA incorporation system is being developed for yeast, the selection process is carried out in yeast as well. The long replication time of mammalian cells prevents such a selection process from being carried out in mammalian systems. Thus, amino aaRS sequences evolved in yeast or bacteria are used in mammalian systems [11].

Once the UAA incorporation is engineered into the translational machinery of the host cell, the UAA is added into the cell growth media for subsequent tRNA acylation and its incorporation into the target protein. Thus, the UAA has to be cell permeable, non-toxic and stable inside cells, to be effectively utilized by the engineered tRNA and aaRS.

Incorporation of UAA into recombinant proteins produced in live cells using an ‘evolved’ aaRS was pioneered in the lab of Peter Schultz at The Scripps Research Institute, La Jolla. O-Methyl L-tyrosine was the first UAA that was successfully incorporated into a recombinant protein in live cells using an engineered tRNA-aaRS pair. The protein was produced in E. coli using the archaeal tRNATyr-tyrosyl-tRNA synthetase (TyrRS) pair from Methanococcus jannaschii [15].

UAAs have also been incorporated into proteins produced in yeast using an E. coli tRNATyr-TyrRS pair [14]. The amino acids p-acetyl-L-phenylalanine, p-benzoyl-L-phenylalanine, p-azido-L-phenylalanine, O-methyl-L-tyrosine and p-iodo-L-tyrosine, all closely related in structure to tyrosine, have been added to the translational machinery of yeast using the tRNATyr-TyrRS pair.

Many different laboratories have succeeded in suppressing amber/nonsense mutations within mammalian cells. Suppression has been achieved using both natural and UAAs [16]. The Schultz lab for the first time used the process of ‘evolution’ in yeast to generate many versions of the E. coli TyrRS that are specific for a variety of different amino acids, for use in mammalian cells [11]. Since the E. coli tRNATyr does not have the right promoter elements for transcription in mammalian cells, tRNATyr from Bacillus stearothermophilus was used instead. Earlier work by Sakamoto et al had shown that B. stearothermophilus tRNATyr could be transcribed and acylated by a specific variant of the E. coli TyrRS in mammalian cells [17].

Various sets of orthogonal tRNA and aaRS pairs have been developed to add new UAAs into the protein synthesis machinery of bacteria (Mycobacterium [18] and E. coli), yeast and mammals, including photocaged-citrulline [19]. Some of these are E. coli TyrRS and B. stearothermophilus tRNATyr in mammalian cells [11, 17, 20], M. jannaschii (Mj) TyrRS and tRNATyr in bacterial cells [15], M. barkeri PylRS and tRNAPylCUA in yeast [21].E. coli TyrRS and tRNATyr pair in yeast [14] E. coli LeuRS and tRNALeu pair in yeast [22], M. mazei PylRS and tRNAPylCUA in mammalian cells [23].

As a continuation of the multiple efforts to use UAAs to generate new enzymes for chemical synthesis (reviewed in [24] ), a recent report by Drienovska et al has demonstrated the successful incorporation of p-aminophenylalanine into the transcriptional regulator from Lactococcus lactis [25]. The efficiency of the artificial enzyme was achieved by insertion of p-aminophenylalanine residue into the hydrophobic pocket of the enzyme molecule, which boosted the activity of the aniline side chain.

With regard to the in vivo research, transgenic Caernorhabditis elegans has actively been used as a model for UAA studies. For example, UAAs can be inserted into C. elegans using a pyrrolysyl tRNA-synthetase and a pyrrolysyl tRNA construct. The detection of the incorporated UAAs can be performed by either Western blotting or immunofluorescence [26].

It is now possible to incorporate more than one UAA into a single protein in vivo. The Schultz lab has reported on the genetic incorporation of pAcF and azidolysine into the trastuzumab polypeptide sequence using aminoacyl tRNA synthetase / tRNA pairs specific for different UAAs in a mammalian cell system [27].

Quadruplet Codons

With rare exceptions, naturally occurring DNA sequences code for amino acids with codons made up of three base pairs, i.e., codons occur as triplets. Although the mechanistic details of how translations by quadruplet codons work are not known, there have been a number of efforts to use quadruplet codons to encode UAAs. Many of these initial efforts relied on using the tRNA with its 3 base anti-codon sequence switched out by a 4 base anticodon, which was then chemically charged with the UAA of interest. Work in the last eight to ten years by Jason Chin and colleagues at the MRC lab in Cambridge has resulted in the development of a quadruplet codon recognition system where the entire translational machinery including acylation of the tRNA happens inside a live cell. The strategy was to create an alternate translational machinery, orthogonal to the host cell (i.e., it does not translate native mRNA) and involved creating an alternate ribosome that would specifically recognize the mRNA with the quadruplet codon.

Using an alternate ribosome, which is also orthogonal is desirable, because it can be evolved to specifically recognize both the modified mRNA with the quadruplet codon and the tRNA with the quadruplet anticodon while leaving the host translational machinery undisturbed. It has been shown that the evolved ribosome, Ribo-Q1 can more efficiently translate both triplet and quadruplet codons than the native ribosomes [28].

Recently Niu et al have used an alternate approach for incorporating UAAs in response to a quadruplet codon [29]. Based on the idea that the tRNAPyl anticodon loop is not an important determining factor for the recognition of its cognate PylRS, the authors first changed the anticodon of UCCU to recognize the codon AGGA. F to UCCU to recognize the codon AGGA. Following this, a tRNAPylCUA library was generated where four other base positions of the anticodon loop were randomized. This library was evolved in vitro in the presence of the UAA, Nε-(tert-butyloxy-carbonyl)-L-lysine or Boc-Lys and the BocLysRS (a modified PylRS, originally described in [30] ) to produce a tRNA clone that recognized the quadruplet codon in the chloramphenicol acetyltransferase gene. This evolved tRNA was used then in E. coli and in mammalian cells for incorporation of Boc-Lys in response to the AGGA quadruplet codon.

Unnatural Amino Acid Incorporation Method

The gene of interest containing the amber codon, the suppressor tRNA and the evolved aaRS are introduced into cells (Figure 1). This is followed by addition of media containing the UAA (can be obtained from Sigma) to the cells. After allowing the cells to grow for about two days, the cells are harvested and the protein is purified. It has been observed that increasing the copy number of suppressor tRNA leads to an increase in yields of the protein with the UAA [11]. Usually multiple copies of the tRNA are placed in tandem in the tRNA plasmid. It has also been observed that having low concentrations of the aaRS is useful to prevent cross-acylation of native tRNA molecules with the UAA and also to prevent read through (or suppression) of naturally existing amber stop codons marking the end of gene sequences. This problem is more acute in mammalian cells where the percentage of stop codons with the amber sequence is 23% than that in bacterial cells where only 5% of all stop codons are amber. Generally, during transfection in mammalian cells, the DNA coding for the AARS gene (gene coding for aaRS) is used at about 1/10th the amount of DNA coding for the target gene.

Protocols describing the incorporation of UAAs can be found in [11, 31], and [32], among others. Dickey TH et al, for example, incorporated, by amber stop codon suppression with the help of pEVOL-pAzF from Addgene ( 31186), the unnatural amino acid azido-phenylalanine to enable specific click chemistry labeling with alkyne-containing fluorophores [33]. Romei MG et al incorporated site-specific substitutions of 10 non-canonical amino acids into photoswitchable green fluorescent protein Dronpa2 using five different aminoacyl-tRNA synthetases (aaRSs) and their corresponding tRNACUA to study photoisomerization in proteins [34]. D Cervettini et al developed a scalable approach (tRNA Extension) to discover and evolve aminoacyl-tRNA synthetase–tRNA pairs [35]. Incorporation of UAAs can also be achieved through inteins [36-38].

Recently, a method has been described, based on the use of Methanosarcina mazei pyrrolysyl-tRNA synthetase and the corresponding tRNA, for the genetic incorporation of UAAs in stable mammalian cell lines. The aminoacyl-tRNA synthetase/tRNA pair was stably integrated into the mammalian genome. The system was used to explore the effects of substituting histone lysine residues with the constitutively active UAA Nε-acetyl-lysine [39].

Using Photocrosslinking UAAs to Study Interactions Sites in Proteins

Substituting a native amino acid in a protein with an amino acid that can cross-link upon exposure to light can give information about interacting partners in the vicinity of that amino acid. One such amino acid is p-benzoyl L-phenylalanine (pBpa), which cross-links with nearby C-H bonds when exposed to light between 350-360 nm. An orthogonal tRNA/aaRS pair was evolved for incorporation of pBpa in proteins in E. coli [13] and yeast [14]. The system evolved in yeast was subsequently adapted in the mammalian system to incorporate pBpa into the adaptor protein human Grb2 [20] and probe its interaction with the EGF receptor. Since then, other studies have used the pBpa system to study interactions between proteins and their interacting partners inside living cells [40-42].

Another photocrosslinking amino acid 3’-azibutyl-N-carbamoyl-lysine (AbK) has been adapted in E. coli and mammalian cells using the M. barkeri tRNAPylCUA/PylRS pair [43].

p-azido-L-phenylalanine (AzPhe) has also been used as a photocrosslinking UAA to determine the binding sites of a small molecule (maraviroc) and peptide (T140) on the G-protein coupled receptors (GPCRs) CC chemokine receptor 5 (CCR5) and CXC chemokine receptor 4 (CXCR4), respectively [41, 44]. Details of methods used for incorporating photocrosslinking amino acid into GPCRs can be found in [45].

A recent publication describes the use of a genetically encoded photo-affinity-labelled UAA ((Se-(N-(3-(3-methyl-3H-diazirin-3-yl)propyl)propanamide)-3-yl-homoselenocysteine), known as DiZHSeC) for protein interaction studies that incorporated a label identifiable by mass spectroscopy (MS). The UAA was incorporated into the bait protein. Following in vivo photo-affinity-labelling of the prey protein, DiZHSeC can be oxidatively cleaved to leave an N-(4,4-bis-substituted-pentyl)acrylamide group attached to the prey protein which enables subsequent MS identification. This approach has been shown to identify in vivo protein-protein interactions in both E.coli and HEK 294T cells [46]. Very recently, the use of genetically encoded UAA to study the protein-protein interactions between SUMO and proteins containing the SUMO-interacting motif (SIM) has been described. Photoactivated UAAs were genetically incorporated into the SIM interaction groove of the SUMO1 protein [47].

Incorporating Photocaged Amino Acids for Temporal Regulation of Protein Activity

Proteins can be made inactive by incorporating a light-removable protecting group within their sequence that blocks sites important for their biological activity. This blocking or ‘caging’ activity can be brought about by photocaged amino acids. When the unnatural photocaged amino acid is intact, the protein is inactive, but upon exposure to light, the functional blocking group in the amino acid is removed, leaving behind the native amino acid and a biologically active protein. Thus, photocaged amino acids can be used to tune protein activity finely. Photocaged versions of tyrosine [48-50], cysteine [22], lysine [23] and serine [51] have been genetically incorporated into proteins.

The role of active site mutations of isocitrate dehydrogenase 2 in epigenetic and metabolic changes and the development of certain cancers has recently been investigated using the genetic incorporation of a photoactivated caged lysine derivative UAA. The protein containing the caged UAA was inactive. Upon photoactivation, the lysine residue generated recapitulated the disease-causing mutation and enabled the time course of disease-relevant epigenetic and metabolic changes to be followed in HEK 293 cells [52].

Incorporating Fluorescent Amino Acids

Site-specific introduction of an amino acid that can fluoresce differently from native amino acids can be a very powerful tool to study conformational changes, localization and molecular interactions in a protein. L-(7-hydroxycoumarin-4-yl) ethylglycine, which has a high quantum yield, large Stokes’ shift and responds to changes in solvent polarity has been introduced in the E. coli translational machinery using an evolved MjtRNATyr/TyrRS pair. This UAA was introduced into myoglobin and its unfolding was monitored using the fluorescence signal from hydroxycoumarin [53].

Another fluorescent amino acid dansylalanine, with the environmentally sensitive dansyl fluorophore has been added into the translational machinery of Saccharomyces cerevisiae [54]. This was achieved starting from a tRNAPylCUA and the LeuRS from E. coli. The authors showed that it was possible to monitor the unfolding of superoxide dismutase by measuring the fluorescence signals from the dansyl group.

Unnatural fluorescent amino acids in proteins have also been used for visualization of protein localization in cells. Another coumarin-based fluorescent amino acid (S)-1-carboxy-3-(7-hydroxy-2-oxo-2H-chromen-4-yl)propan-1-aminium (CouAA) has been incorporated into proteins in E. coli using an evolved Mj tRNATyr/TyrRS pair. CouAA has been incorporated into the E. coli FtsZ and GRoEL for visualization of the respective proteins inside living bacterial cells [55, 56]. The polarity-sensitive fluorescent amino acid 3-(6-acetylnaphthalen-2-ylamino)-2-aminopropanoic acid (Anap) has been incorporated in EGFP (enhanced green fluorescent protein) produced in mammalian cells. After some modification, an E. coli tRNALeu/LeuRS pair previously evolved in S. cerevisiae was used to achieve this incorporation [57].

In addition, the recently reported generation of UAA β-(1-azulenyl)-l alanine, a synthetic deep-blue tryptophan analogue with specific fluorescence characteristics, would be beneficial for the studies visualizing intracellular targeted proteins. The authors have inserted β-(1-azulenyl)-l alanine into arginine C, one of the non-ribosomal peptides [58].

Incorporating IR Active Probes

Another kind spectroscopic probe that can be used to study conformational changes in proteins is the IR-active probe, which can be monitored, among other methods, by FTIR (Fourier transform infrared spectroscopy). By incorporating AzPhe, a polarity and electric field sensing probe at specific positions in the GPCR, rhodopsin, the authors were able to detect electrostatic changes around the transmembrane helices of the protein that occurred during light-induced activation of the protein [59]. Using this probe, the authors were also able to detect smaller conformational changes in the rhodopsin helices that occurred before larger rigid body helix movements [60].

Increasing the Diversity of Proteins by Chemically Adding functional Groups to Site-specific UAAs

p-acetylphenylalanine can be incorporated at specific positions in a protein in response to a TAG stop codon. This UAA can then be further chemically reacted with a desired molecule containing a hydroxylamine functional group. This technique has been used to label proteins with the hydroxylamine derivative of Alexa Fluor® 488 functioning as a fluorescent dye [61], or used in developing pharmaceuticals, such as ARX788 or ARX517 by biotech company Ambrx Biopharma (Nasdaq: AMAM).

A similar approach has been used for site-specific labeling of proteins with a spin label, which then becomes a marker for conformational change and can be studied by EPR spectroscopy [62].

Fluorescent dyes with catalogue numbers
Alexa Fluor 488 azide (A10266) and alkyne (A10267)
Alexa Fluor 555 azide (A20012) and alkyne (A20013)
Alexa Fluor 594 azide (A10270) and alkyne (A10275)
Alexa Fluor 647 azide (A10277) and alkyne (A10278)
Oregon Green 488 azide (O10180) and alkyne (O10181)
Tetramethylrhodamine (TAMRA) azide (T10182) and alkyne (T10183)
Biotin azide (B10184) and alkyne (B10185)
Click-iT Alexa Fluor 488 DIBO alkyne (C10405)
Click-iT Alexa Fluor 555 DIBO alkyne (C10406)
Click-iT Alexa Fluor 594 DIBO alkyne (C10407)
Click-iT Alexa Fluor 647 DIBO alkyne (C10408)
Click-iT TAMRA DIBO alkyne (C10410)
Click-iT biotin DIBO alkyne (C10412)
Table 1.Commercially available fluorescent dyes for labeling reaction.

Another UAA that has been chemically modified after site-specific incorporation in a protein is AzPhe. In Neumann et al [28], the authors site-specifically introduced AzPhe as the first and N6-[(2-propynyloxy)carbonyl]-l-lysine (CAK) as the last amino acid in calmodulin in response to quadruplet and amber codons respectively. The resultant folded protein had an azide (AzPhe) and an alkyne (CAK) group in close proximity to one another. Then using a copper-catalyzed alkyne-azide [2+3] cycloaddition reaction or click reaction, the two moieties were linked together covalently to yield a cyclic protein. This reaction can be used to incorporate a variety of chemical probes (Table 1) into proteins that have a site-specific UAA with an azide or an alkyne group.

AzPhe incorporated into the GPCR, CCR5 was labeled with the FLAG peptide using the Staudinger ligation (between azide and phosphine groups) [63]. The FLAG peptide in this study was modified with a triarylphosphine to facilitate the Staudinger ligation.

A recent study has shown that AzPhe incorporated in a GPCR can also be bio-orthogonally labeled with fluorescent probes within mammalian cells using the Staudinger–Bertozzi ligation [64]. In another study, a protein with the UAA Propargyllysine that has an alkyne group was labeled with a commercially available fluorescent dye with an azide moiety using the click reaction [65]. Both papers give details of the labeling protocols. A variety of fluorescent probes with both azide and alkyne functional are now available from Invitrogen for use in click reactions.

Another group of dyes that can react with the azide moiety in a UAA in a copper-free reaction, and are thus more suitable for reactions on the surface of live cells or where copper adversely affects the activity of the protein being labeled are available as well.

Ribosomal synthesis of peptide macrocycles

The genetic incorporation of UAA has been used as part of a system for the in vivo generation of macrocyclic peptides in bacterial cells. The system used an engineered aminoacyl tRNA synthetase from Methanocaldococcus jannaschii to genetically incorporate the phenylalanine derivative UAA 3-(2-mercapto-ethyl)amino phenylalanine into target proteins. The system has the potential for the screening of genetic libraries of polypeptide sequences for the generation of functional macrocyclic peptides [66].

Immuno-conjugates

The use of UAA to site-specifically incorporate toxins and/or reporter groups into therapeutic antibodies has received significant attention. Traditional methods of tag incorporation based on the use of bi-functional amino-reactive tags generate heterogeneous labelling of antibodies leading to conjugate preparations with, potentially, a broad spectrum of biochemical and pharmacological properties. The use of monoclonal antibodies genetically engineered to incorporate UAA offers a potential way to circumvent this issue. A monoclonal antibody against Her2/neu has been engineered to incorporate the UAA N6-((2-azidoethoxy)carbonyl)-l-lysine at 4 specific sites using a mammalian expression system. The incorporation of this UAA enabled the generation of homogeneous antibody-toxin conjugates containing 4 toxin molecules per antibody via click cycloaddition chemistry [67]. A similar approach has been successfully applied to generating immune-conjugates for imaging studies; in this latter case, two different UAAs were genetically incorporated into the antibody [27].

Potential Clinical Applications of UAAs

UAAs and their derivatives may have clinical applications. Several studies have demonstrated the development of UAA-containing synthetic molecules with antibacterial and antitumor functions. In particular, naphthalene-tripeptides containing α-aminoisobutyric acid downregulated the growth of melanoma cells in vitro. Furthermore, a D-enantiomer of alanine bearing naphthalene-tripeptides suppressed the growth of Staphylococcus epidermidis [68]. Other research has generated cationic antimicrobial peptides containing UAAs, which have shown antimicrobial effects against both Gram-positive and Gram-negative bacteria [69].

Another application of UAAs is the generation of protein inhibitors, which would be effective in drug design. Several selective inhibitors of Keap1‐Nrf2 protein‐protein interaction with potential application for the treatment of neurodegenerative disorders were synthesized by inserting UAAs, such as thiazolidine-4-carboxylic acid and piperidine-2-carboxylic acid [70].

Unnatural Amino AcidHostUnique property of the amino acidReference
p-iodo-L-tyrosineE. coli, yeast mammalianThe heavy atom iodine facilitates structural studies of proteins. This UAA can also be used for determination of tyrosine phosphorylation sites in a protein [11, 71, 72]
p-acetyl-L-phenylalanineE. coli, yeast,The acetyl group can be modified bioorthogonally with any other probe with a hydroxylamine group. [11, 62, 73]
p-benzoyl-L-phenylalanine, (Bpa)E. coli, yeast, mammalianCross-links with nearby C-H bonds when exposed to light between 350-360 nm. [11, 13, 14, 41, 42]
p-azido-L-phenylalanine (AzPhe)E. coli, yeast, mammalianAzPhe has been used as a photocrosslinking UAA to determine the interactions sites on proteins. This UAA has also been used as an IR-active probe for detection of any conformational change in proteins. AzPhe also cross-links with triarylphosphines, alkyne or DIBO alkynes in a bioorthogonal manner. [11, 41, 44, 60, 63, 64]
O-Methyl L-tyrosineE. coli, yeast, mammalianIncreases the bulk of tyrosine [15, 74]
L-(7-hydroxycoumarin-4-yl) EthylglycineE. coliFluorescent amino acid [53]
dansylalanineYeastFluorescent amino acid [54]
(S)-1-carboxy-3-(7-hydroxy-2-oxo-2H-chromen-4-yl)propan-1-aminium (CouAA)E. coliFluorescent amino acid [55, 56]
3-(6-acetylnaphthalen-2-ylamino)-2-aminopropanoic acid (Anap)Yeast, mammalsFluorescent amino acid [57, 75]
Photocaged tyrosine derivativesE.coli, mammalianPhotocaged amino acid [49]
Photocaged cysteine derivativesYeastPhotocaged amino acid [22]
Photocaged lysine derivativesE. coli, mammalianPhotocaged amino acid [23]
Photocaged serine derivativesYeastPhotocaged amino acid [51]
(2,2'-bipyridin-5-yl)alanine (Bpy-Ala) (A Fe2+/3+, Cu2+, Co2+/3+, and Ru2+/3+ chelating UAA)E. coliThis UAA was site-specifically introduced in a DNA binding protein. The UAA modified protein site-specifically cleaved double-stranded DNA. [76, 77]
2-nitrophenyl alanine (2-NPA)E. coliWhen irradiated at 365 nm, the 2-NPA residue photocleaves the protein specifically at the site of incorporation. [78]
p-carboxymethyl-phenylalanineE. coliA stable phosphotyrosine analogue, resistant to hydrolysis by protein tyrosine phosphatase resulting in constitutively active proteins. [79]
Table 2. A short list of UAAs incorporated into the genetic machinery of bacterial, yeast and mammalian cells along with their unique features.
References
  1. Chen S, Gopalakrishnan R, Schaer T, Marger F, Hovius R, Bertrand D, et al. Dithiol amino acids can structurally shape and enhance the ligand-binding properties of polypeptides. Nat Chem. 2014;6:1009-16 pubmed publisher
  2. Liu C, Schultz P. Adding new chemistries to the genetic code. Annu Rev Biochem. 2010;79:413-44 pubmed publisher
  3. Neumann H. Rewiring translation - Genetic code expansion and its applications. FEBS Lett. 2012;586:2057-64 pubmed publisher
  4. Lang K, Chin J. Cellular incorporation of unnatural amino acids and bioorthogonal labeling of proteins. Chem Rev. 2014;114:4764-806 pubmed publisher
  5. Martin R, Des Soye B, Kwon Y, Kay J, Davis R, Thomas P, et al. Cell-free protein synthesis from genomically recoded bacteria enables multisite incorporation of noncanonical amino acids. Nat Commun. 2018;9:1203 pubmed publisher
  6. Cai Q, Hanson J, Steiner A, Tran C, Masikat M, Chen R, et al. A simplified and robust protocol for immunoglobulin expression in Escherichia coli cell-free protein synthesis systems. Biotechnol Prog. 2015;31:823-31 pubmed publisher
  7. Reinkemeier C, Girona G, Lemke E. Designer membraneless organelles enable codon reassignment of selected mRNAs in eukaryotes. Science. 2019;363: pubmed publisher
  8. Hoshika S, Leal N, Kim M, Kim M, Karalkar N, Kim H, et al. Hachimoji DNA and RNA: A genetic system with eight building blocks. Science. 2019;363:884-887 pubmed publisher
  9. Gallivan J, Lester H, Dougherty D. Site-specific incorporation of biotinylated amino acids to identify surface-exposed residues in integral membrane proteins. Chem Biol. 1997;4:739-49 pubmed
  10. Turcatti G, Nemeth K, Edgerton M, Meseth U, Talabot F, Peitsch M, et al. Probing the structure and function of the tachykinin neurokinin-2 receptor through biosynthetic incorporation of fluorescent amino acids at specific sites. J Biol Chem. 1996;271:19991-8 pubmed
  11. Liu W, Brock A, Chen S, Chen S, Schultz P. Genetic incorporation of unnatural amino acids into proteins in mammalian cells. Nat Methods. 2007;4:239-44 pubmed
  12. Chin J, Santoro S, Martin A, King D, Wang L, Schultz P. Addition of p-azido-L-phenylalanine to the genetic code of Escherichia coli. J Am Chem Soc. 2002;124:9026-7 pubmed
  13. Chin J, Martin A, King D, Wang L, Schultz P. Addition of a photocrosslinking amino acid to the genetic code of Escherichiacoli. Proc Natl Acad Sci U S A. 2002;99:11020-4 pubmed
  14. Chin J, Cropp T, Anderson J, Mukherji M, Zhang Z, Schultz P. An expanded eukaryotic genetic code. Science. 2003;301:964-7 pubmed
  15. Wang L, Brock A, Herberich B, Schultz P. Expanding the genetic code of Escherichia coli. Science. 2001;292:498-500 pubmed
  16. Drabkin H, Park H, RajBhandary U. Amber suppression in mammalian cells dependent upon expression of an Escherichia coli aminoacyl-tRNA synthetase gene. Mol Cell Biol. 1996;16:907-13 pubmed
  17. Sakamoto K, Hayashi A, Sakamoto A, Kiga D, Nakayama H, Soma A, et al. Site-specific incorporation of an unnatural amino acid into proteins in mammalian cells. Nucleic Acids Res. 2002;30:4692-9 pubmed
  18. Wang F, Robbins S, Guo J, Shen W, Schultz P. Genetic incorporation of unnatural amino acids into proteins in Mycobacterium tuberculosis. PLoS ONE. 2010;5:e9354 pubmed publisher
  19. Mondal S, Wang S, Zheng Y, Sen S, Chatterjee A, Thompson P. Site-specific incorporation of citrulline into proteins in mammalian cells. Nat Commun. 2021;12:45 pubmed publisher
  20. Hino N, Okazaki Y, Kobayashi T, Hayashi A, Sakamoto K, Yokoyama S. Protein photo-cross-linking in mammalian cells by site-specific incorporation of a photoreactive amino acid. Nat Methods. 2005;2:201-6 pubmed
  21. Hancock S, Uprety R, Deiters A, Chin J. Expanding the genetic code of yeast for incorporation of diverse unnatural amino acids via a pyrrolysyl-tRNA synthetase/tRNA pair. J Am Chem Soc. 2010;132:14819-24 pubmed publisher
  22. Wu N, Deiters A, Cropp T, King D, Schultz P. A genetically encoded photocaged amino acid. J Am Chem Soc. 2004;126:14306-7 pubmed
  23. Chen P, Groff D, Guo J, Ou W, Cellitti S, Geierstanger B, et al. A facile system for encoding unnatural amino acids in mammalian cells. Angew Chem Int Ed Engl. 2009;48:4052-5 pubmed publisher
  24. Agostini F, Völler J, Koksch B, Acevedo Rocha C, Kubyshkin V, Budisa N. Biocatalysis with Unnatural Amino Acids: Enzymology Meets Xenobiology. Angew Chem Int Ed Engl. 2017;56:9680-9703 pubmed publisher
  25. Drienovska I, Mayer C, Dulson C, Roelfes G. A designer enzyme for hydrazone and oxime formation featuring an unnatural catalytic aniline residue. Nat Chem. 2018;10:946-952 pubmed publisher
  26. Davis L, Greiss S. Genetic Encoding of Unnatural Amino Acids in C. elegans. Methods Mol Biol. 2018;1728:389-408 pubmed publisher
  27. Xiao H, Chatterjee A, Choi S, Bajjuri K, Sinha S, Schultz P. Genetic incorporation of multiple unnatural amino acids into proteins in mammalian cells. Angew Chem Int Ed Engl. 2013;52:14080-3 pubmed publisher
  28. Neumann H, Wang K, Davis L, Garcia Alai M, Chin J. Encoding multiple unnatural amino acids via evolution of a quadruplet-decoding ribosome. Nature. 2010;464:441-4 pubmed publisher
  29. Niu W, Schultz P, Guo J. An expanded genetic code in mammalian cells with a functional quadruplet codon. ACS Chem Biol. 2013;8:1640-5 pubmed publisher
  30. Yanagisawa T, Ishii R, Fukunaga R, Kobayashi T, Sakamoto K, Yokoyama S. Multistep engineering of pyrrolysyl-tRNA synthetase to genetically encode N(epsilon)-(o-azidobenzyloxycarbonyl) lysine for site-specific protein modification. Chem Biol. 2008;15:1187-97 pubmed publisher
  31. Xie J, Schultz P. An expanding genetic code. Methods. 2005;36:227-38 pubmed
  32. Ryu Y, Schultz P. Efficient incorporation of unnatural amino acids into proteins in Escherichia coli. Nat Methods. 2006;3:263-5 pubmed
  33. Dickey T, Song B, Pyle A. RNA binding activates RIG-I by releasing an autorepressed signaling domain. Sci Adv. 2019;5:eaax3641 pubmed publisher
  34. Romei M, Lin C, Mathews I, Boxer S. Electrostatic control of photoisomerization pathways in proteins. Science. 2020;367:76-79 pubmed publisher
  35. Cervettini D, Tang S, Fried S, Willis J, Funke L, Colwell L, et al. Rapid discovery and evolution of orthogonal aminoacyl-tRNA synthetase-tRNA pairs. Nat Biotechnol. 2020;: pubmed publisher
  36. Valiyaveetil F, MacKinnon R, Muir T. Semisynthesis and folding of the potassium channel KcsA. J Am Chem Soc. 2002;124:9113-20 pubmed
  37. Valiyaveetil F, Leonetti M, Muir T, MacKinnon R. Ion selectivity in a semisynthetic K+ channel locked in the conductive conformation. Science. 2006;314:1004-7 pubmed
  38. Jagadish K, Borra R, Lacey V, Majumder S, Shekhtman A, Wang L, et al. Expression of fluorescent cyclotides using protein trans-splicing for easy monitoring of cyclotide-protein interactions. Angew Chem Int Ed Engl. 2013;52:3126-31 pubmed publisher
  39. Elsässer S, Ernst R, Walker O, Chin J. Genetic code expansion in stable cell lines enables encoded chromatin modification. Nat Methods. 2016;13:158-64 pubmed publisher
  40. Lin Z, Gasic I, Chandrasekaran V, Peters N, Shao S, Mitchison T, et al. TTC5 mediates autoregulation of tubulin via mRNA degradation. Science. 2020;367:100-104 pubmed publisher
  41. Grunbeck A, Huber T, Abrol R, Trzaskowski B, Goddard W, Sakmar T. Genetically encoded photo-cross-linkers map the binding site of an allosteric drug on a G protein-coupled receptor. ACS Chem Biol. 2012;7:967-72 pubmed publisher
  42. Das S, Oliver D. Mapping of the SecA·SecY and SecA·SecG interfaces by site-directed in vivo photocross-linking. J Biol Chem. 2011;286:12371-80 pubmed publisher
  43. Chou C, Uprety R, Davis L, Chin JW, Deiters A. Genetically encoding an aliphatic diazirine for protein photocrosslinking. Chem. Sci. 2011;2:480-483. Available from: dx.doi.org/10.1039/C0SC00373E
  44. Grunbeck A, Huber T, Sachdev P, Sakmar T. Mapping the ligand-binding site on a G protein-coupled receptor (GPCR) using genetically encoded photocrosslinkers. Biochemistry. 2011;50:3411-3 pubmed publisher
  45. Grunbeck A, Huber T, Sakmar T. Mapping a ligand binding site using genetically encoded photoactivatable crosslinkers. Methods Enzymol. 2013;520:307-22 pubmed publisher
  46. Yang Y, Song H, He D, Zhang S, Dai S, Lin S, et al. Genetically encoded protein photocrosslinker with a transferable mass spectrometry-identifiable label. Nat Commun. 2016;7:12299 pubmed publisher
  47. Taupitz K, Dörner W, Mootz H. Covalent Capturing of Transient SUMO-SIM Interactions Using Unnatural Amino Acid Mutagenesis and Photocrosslinking. Chemistry. 2017;23:5978-5982 pubmed publisher
  48. Deiters A, Groff D, Ryu Y, Xie J, Schultz P. A genetically encoded photocaged tyrosine. Angew Chem Int Ed Engl. 2006;45:2728-31 pubmed
  49. Chou C, Young D, Deiters A. A light-activated DNA polymerase. Angew Chem Int Ed Engl. 2009;48:5950-3 pubmed publisher
  50. Chou C, Young D, Deiters A. Photocaged t7 RNA polymerase for the light activation of transcription and gene function in pro- and eukaryotic cells. Chembiochem. 2010;11:972-7 pubmed publisher
  51. Lemke E, Summerer D, Geierstanger B, Brittain S, Schultz P. Control of protein phosphorylation with a genetically encoded photocaged amino acid. Nat Chem Biol. 2007;3:769-72 pubmed
  52. Walker O, Elsässer S, Mahesh M, Bachman M, Balasubramanian S, Chin J. Photoactivation of Mutant Isocitrate Dehydrogenase 2 Reveals Rapid Cancer-Associated Metabolic and Epigenetic Changes. J Am Chem Soc. 2016;138:718-21 pubmed publisher
  53. Wang J, Xie J, Schultz P. A genetically encoded fluorescent amino acid. J Am Chem Soc. 2006;128:8738-9 pubmed
  54. Summerer D, Chen S, Wu N, Deiters A, Chin J, Schultz P. A genetically encoded fluorescent amino acid. Proc Natl Acad Sci U S A. 2006;103:9785-9 pubmed
  55. Charbon G, Brustad E, Scott K, Wang J, Løbner Olesen A, Schultz P, et al. Subcellular protein localization by using a genetically encoded fluorescent amino acid. Chembiochem. 2011;12:1818-21 pubmed publisher
  56. Charbon G, Wang J, Brustad E, Schultz P, Horwich A, Jacobs Wagner C, et al. Localization of GroEL determined by in vivo incorporation of a fluorescent amino acid. Bioorg Med Chem Lett. 2011;21:6067-70 pubmed publisher
  57. Chatterjee A, Guo J, Lee H, Schultz P. A genetically encoded fluorescent probe in mammalian cells. J Am Chem Soc. 2013;135:12540-3 pubmed publisher
  58. Stempel E, Kaml R, Budisa N, Kalesse M. Painting argyrins blue: Negishi cross-coupling for synthesis of deep-blue tryptophan analogue β-(1-azulenyl)-l alanine and its incorporation into argyrin C. Bioorg Med Chem. 2018;26:5259-5269 pubmed publisher
  59. Ye S, Huber T, Vogel R, Sakmar T. FTIR analysis of GPCR activation using azido probes. Nat Chem Biol. 2009;5:397-9 pubmed publisher
  60. Ye S, Zaitseva E, Caltabiano G, Schertler G, Sakmar T, Deupi X, et al. Tracking G-protein-coupled receptor activation using genetically encoded infrared probes. Nature. 2010;464:1386-9 pubmed publisher
  61. Lemke E. Site-specific labeling of proteins for single-molecule FRET measurements using genetically encoded ketone functionalities. Methods Mol Biol. 2011;751:3-15 pubmed publisher
  62. Fleissner M, Brustad E, Kalai T, Altenbach C, Cascio D, Peters F, et al. Site-directed spin labeling of a genetically encoded unnatural amino acid. Proc Natl Acad Sci U S A. 2009;106:21637-42 pubmed publisher
  63. Naganathan S, Ye S, Sakmar T, Huber T. Site-specific epitope tagging of G protein-coupled receptors by bioorthogonal modification of a genetically encoded unnatural amino acid. Biochemistry. 2013;52:1028-36 pubmed publisher
  64. Huber T, Naganathan S, Tian H, Ye S, Sakmar T. Unnatural amino acid mutagenesis of GPCRs using amber codon suppression and bioorthogonal labeling. Methods Enzymol. 2013;520:281-305 pubmed publisher
  65. Tyagi S, Lemke E. Genetically encoded click chemistry for single-molecule FRET of proteins. Methods Cell Biol. 2013;113:169-87 pubmed publisher
  66. Frost J, Jacob N, Papa L, Owens A, Fasan R. Ribosomal Synthesis of Macrocyclic Peptides in Vitro and in Vivo Mediated by Genetically Encoded Aminothiol Unnatural Amino Acids. ACS Chem Biol. 2015;10:1805-16 pubmed publisher
  67. VanBrunt M, Shanebeck K, Caldwell Z, Johnson J, Thompson P, Martin T, et al. Genetically Encoded Azide Containing Amino Acid in Mammalian Cells Enables Site-Specific Antibody-Drug Conjugates Using Click Cycloaddition Chemistry. Bioconjug Chem. 2015;26:2249-60 pubmed publisher
  68. Nahhas A, Chang R, Webster T. Introducing Unnatural Amino Acids-Containing Tripeptides as Antimicrobial and Anticancer Agents. J Biomed Nanotechnol. 2018;14:987-993 pubmed publisher
  69. Oliva R, Chino M, Pane K, Pistorio V, De Santis A, Pizzo E, et al. Exploring the role of unnatural amino acids in antimicrobial peptides. Sci Rep. 2018;8:8888 pubmed publisher
  70. Georgakopoulos N, Talapatra S, Gatliff J, Kozielski F, Wells G. Modified peptide inhibitors of the Keap1-Nrf2 protein-protein interaction incorporating unnatural amino acids. Chembiochem. 2018;: pubmed publisher
  71. Xie J, Wang L, Wu N, Brock A, Spraggon G, Schultz P. The site-specific incorporation of p-iodo-L-phenylalanine into proteins for structure determination. Nat Biotechnol. 2004;22:1297-301 pubmed
  72. Hayashi A, Hino N, Kobayashi T, Arai R, Shirouzu M, Yokoyama S, et al. Dissecting cell signaling pathways with genetically encoded 3-iodo-L-tyrosine. Chembiochem. 2011;12:387-9 pubmed publisher
  73. Zeng H, Xie J, Schultz P. Genetic introduction of a diketone-containing amino acid into proteins. Bioorg Med Chem Lett. 2006;16:5356-9 pubmed
  74. Wang W, Takimoto J, Louie G, Baiga T, Noel J, Lee K, et al. Genetically encoding unnatural amino acids for cellular and neuronal studies. Nat Neurosci. 2007;10:1063-72 pubmed
  75. Lee H, Guo J, Lemke E, Dimla R, Schultz P. Genetic incorporation of a small, environmentally sensitive, fluorescent probe into proteins in Saccharomyces cerevisiae. J Am Chem Soc. 2009;131:12921-3 pubmed publisher
  76. Lee H, Schultz P. Biosynthesis of a site-specific DNA cleaving protein. J Am Chem Soc. 2008;130:13194-5 pubmed publisher
  77. Xie J, Liu W, Schultz P. A genetically encoded bidentate, metal-binding amino acid. Angew Chem Int Ed Engl. 2007;46:9239-42 pubmed publisher
  78. Peters F, Brock A, Wang J, Schultz P. Photocleavage of the polypeptide backbone by 2-nitrophenylalanine. Chem Biol. 2009;16:148-52 pubmed publisher
  79. Xie J, Supekova L, Schultz P. A genetically encoded metabolically stable analogue of phosphotyrosine in Escherichia coli. ACS Chem Biol. 2007;2:474-8 pubmed
ISSN : 2329-5139