Previous Article | Next Article ![]()
Journal of Bacteriology, December 2008, p. 7699-7708, Vol. 190, No. 23
0021-9193/08/$08.00+0 doi:10.1128/JB.00997-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.


Department of Biology, American University of Beirut, Beirut, Lebanon
Received 19 July 2008/ Accepted 12 September 2008
|
|
|---|
, and
21 N proteins produced mutants that displayed bias. P22 N– plaque size plotted against boxBleft and boxBright reporter activities suggests that lytic viral fitness depends on balanced antitermination. A few N proteins were able to complement both
N- and P22 N-deficient viruses, but no proteins were found to complement both P22 N- and
21 N-deficient viruses. A single tryptophan substitution allowed P22 N to complement both P22 and
N–. The existence of relaxed-specificity mutants suggests that conformational plasticity provides evolutionary transitions between distinct modes of RNA-protein recognition. |
|
|---|
, P22,
21, and other phages regulate the expression of delayed early genes by allowing transcription past terminators in the Pleft and Pright operons. These phages share regulatory mechanisms with
, but they have uncertain evolutionary relationships that are obscured by recombination among tailed phages (Caudovirales) (7, 8). In phage
, antitermination allows expression of genes regulating the development of lysis or lysogeny (12). The assembly of transcription antitermination complexes in P22,
, and
21 is initiated by the binding of viral N proteins to small hairpin boxB RNAs in the nut sites (N utilization) of regulated transcripts (40). These complexes contain N and host factors, including NusA and the transcribing polymerase, allowing transcription to proceed through downstream transcription termination signals. P22,
, and
21 exhibit type specificity, where the N protein of one virus cannot complement its absence in a different virus (13, 24).
N proteins recognize their cognate boxB RNAs via arginine-rich domains near their amino termini (21). Their boxBs are hairpin RNAs that have little sequence similarity, yet similar secondary structures (Fig. 1A). Alignment of P22,
, and
21 reveals that N protein RNA-binding domains contain four conserved amino acids (Fig. 1B). Protein and RNA sequence differences create specificity; noncognate interactions function poorly. boxBs bind noncognate N peptides poorly in vitro (1, 10, 39). Likewise, noncognate N-nut interactions do not function in vivo (19, 29), and noncognate N proteins do not rescue N-deficient viruses (13, 24). Compared to the extensive genetic and biochemical work on
N-boxB, there have been far fewer studies of the corresponding
21 and P22 interactions. Insight into the
21 N-boxB complex is limited to biochemical characterization (1) and a nuclear magnetic resonance (NMR) model (9). Previous P22 studies included extensive mutagenesis of P22 boxBs (11), examination of a limited number of P22-
hybrid N proteins (18), and an NMR model of the P22 N peptide bound to P22 boxBleft (5). Despite the insight provided by the NMR model, the roles of most P22 N residues in recognizing P22 boxBs and the basis of discrimination against
and
21 boxBs remain unclear.
![]() View larger version (21K): [in a new window] |
FIG. 1. Comparison of boxB RNAs and N RNA-binding domains of lambdoid phages. (A) The secondary structures of boxBs found in bacteriophages P22, , and 21 from the beginning of the boxB stem. PL, P22 boxBleft; PR, P22 boxBright; LL, boxBleft; LR, boxBright; FL, 21 boxBleft; FR, 21 boxBright. Noncanonical base pairs are indicated by a connecting dash. (B) Alignment of wild-type N proteins and libraries used in this study, with the RNA-binding domains flanked by spaces. Residues indicated by bold type are conserved in the three phage proteins and were not mutated in libraries. Single libraries were made using randomized codons at each nonconserved position in the RNA-binding domain of P22 N. The hybrid library is described by numbers indicating the number of possible amino acids at each position, and in most cases the sets of amino acids include only the corresponding residues from P22, , and 21 at each position; the exception is the residue indicated by the number 3 in bold type, at which an arginine substitutes for the glutamine of N. X indicates many amino acids (see Materials and Methods for details).
|
, and
21 N proteins bind as
helices in the major grooves of the cognate boxBs, making extensive contact with the boxB loop and 5' backbone (9). Interestingly, few base-specific contacts are made between the N proteins and boxB RNAs. P22,
, and
21 boxBs adopt similar, yet distinct loop conformations. P22 adopts a 3-out GNRA-like pentaloop (5),
adopts a 4-out GNRA-like pentaloop (30, 37), and the apical four nucleotides of the
21 loop adopt a non-GNRA U-turn (9). All of them are structurally similar in having all but one loop base stack on the 3' half. The results of mutagenesis and biochemical studies and NMR data support a model in which the N peptides recognize specific conformations of boxB, with little direct recognition of sequence. The antitermination levels reflect N-boxB affinity, but there are important differences between the host factor dependence of P22,
, and
21 N-nut on antitermination (20).
N antitermination appears to require that
N Trp18 stacks on the loop of boxB in order to properly bind Escherichia coli NusA to achieve full antitermination (41).
The NMR-derived structural model of the P22 N-boxBleft complex (5) provides limited guidance for understanding the sequence requirements of P22 N protein and the origin of type specificity (Fig. 2A, B, and C). Many contacts are made between the peptide and RNA, but there are few base-specific contacts between amino acid side chains and the RNA (Fig. 2A). Of the residues conserved in P22,
, and
21, Ala15 makes hydrophobic contact with boxB bases, and arginines contact backbone phosphates (Fig. 2D). P22 NMR data support a certain role for only one nonconserved P22 N residue, Arg19, and the results of substitution of
N residues in P22 N (18) suggest that Asn14, Lys16, Arg24, Ala27, Ile28, and Arg30 are important to P22 N function or specificity, yet the structural roles of these residues are uncertain (Fig. 2E). The nonconserved residues likely have important roles in providing both affinity and specificity, including discrimination against noncognate boxBs.
![]() View larger version (47K): [in a new window] |
FIG. 2. Observed contacts and NMR model of P22 N peptide bound to P22 boxBleft. (A) Schematic diagram of the secondary structure of P22 boxBleft when it is bound to P22 N peptide and the sequence of the P22 N peptide. The lines indicate observed close contacts between amino acid side chains and RNA bases (5). boxB numbering starts from the 5' nucleotide at the base of the stem, and N amino acid numbering starts from the amino-terminal methionine of P22 N. Conserved amino acids are indicated by bold type. Cytosine 9 is extruded from the loop, whereas all other bases are stacked. (B) View of the NMR model from the minor groove, in an orientation similar to that of the schematic diagram in A, with the peptide atoms indicated by light gray spheres and the RNA bases indicated by dark gray sticks. The RNA backbone is not shown for clarity. (C) View of the NMR model from the major groove, where the peptide backbone is indicated by dark gray sticks and the RNA bases are indicated by light gray sticks. Peptide side chains and the RNA backbone are not shown. (D) View from the major groove, with RNA indicated by light gray spheres and the peptide indicated by dark gray sticks. Only the peptide backbone and conserved side chains are shown. (E) Same as panel D, but only nonconserved peptide side chains previously implicated in P22 N function are shown (1, 18).
|
, and
21 N-boxBs are related, and understanding what evolutionary paths connect them may provide general insight into how new RNA-protein recognition strategies evolve. Neutral theorists contend that there are enough mutants with neutral fitness to create incremental paths to new phenotypes without any intermediate loss of function (28, 34). Intriguingly, workers have found both RNAs and peptides that adopt different conformations in different contexts (11, 33, 38). These chameleon sequences could allow smooth evolutionary transitions between distinct recognition strategies.
In order to determine functional roles of nonconserved amino acids of the P22 N RNA-binding domain, we screened 13 single-codon, randomized libraries of P22 N for activity using a plasmid-based β-galactosidase reporter system that reconstitutes antitermination in E. coli (19). A library of hybrid N RNA-binding domains of P22,
, and
21 was also screened to find sequences that distinguish between boxBleft and boxBright. The relationship between boxBleft and boxBright antitermination and plaque size resulting from the complementation of clear N– virus was also examined. We found that a single R30W substitution and two hybrids are able to complement both P22 and
N– viruses, suggesting that there are multiple evolutionary paths between these similar, yet distinct protein-RNA recognition strategies.
|
|
|---|
Bacterial strains, plasmids, and bacteriophages.
Escherichia coli supporting antitermination, N567 (15), pBR-ptac-N*
(19), and lytic N–
phages with immunity regions of
, P22, and
21 (phage
imm2224amclr, phage
Clear Nam7am53 [18], and phage
imm21Nam Clear) were obtained from Naomi Franklin (University of Utah). DH5
cells and control plasmids for the human immunodeficiency virus type 1 (HIV-1) Rev-RRE interaction, pBRN-HIVRev (23) and pAC-HIV-RRE (23), were obtained from Kazuo Harada (Tokyo Gakugei University). P22 boxB reporter plasmids are replacements of
boxB in the
nutleft site by P22 boxBleft or boxBright (11).
Construction of N fusions and libraries.
All mutant and hybrid RNA-binding domains were expressed such that they replaced the RNA-binding domain of
N (residues 1 to 19), which are in an NcoI-BsmI cassette. Standard molecular biology procedures were used, and all clones were tested for function and were sequenced using the synthetic insert for confirmation of identity.
To construct the P22 N supplier plasmid pBRNP22N12-30, referred to below as wt P22 N, a double-stranded DNA with NcoI and BsmI sites was designed to contain an initiation codon followed by P22 N residues 12 to 30. It was made by annealing and extending two mutually priming synthetic oligonucleotides, P22N12-30F (5'-GCG CCC ATG GCA GGC AAT GCT AAA ACT CGT CGC CAC GAA CGT CGC-3') and P22N12-30R (5'-GGG ATT TGC ATT CCG CTC AAT CGC CAG TTT ACG GCG ACG TTC GTG GCG ACG-3'). Following primer extension with Thermus aquaticus DNA polymerase I, the product was digested with NcoI and BsmI and cloned into the pBR-ptac-N*
backbone so that the expressed N protein was a fusion between P22 N residues 12 to 30 (MAGNAKTRRHERRRKLAIER) and
N residues 19 to 107.
pBRNphi21N1-30, referred to below as wt
21 N, was constructed by using a similar strategy and synthetic oligonucleotides Phi21N1-30F (5'-GAGCCCATGGTAACCATTGTCTGGAAAGAATCCAAAGGTACGGCAAAAAGCCGCTACAAAGCTCGC-3') and Phi21N1-30R (5'-GACTGCTGCATTCGAGCGTCGCTCGGCAATAAGTTCTGCTCTGCGAGCTTTGTAGCGGCTTTTTGC-3'). The expressed N protein was a fusion between
21 N residues 1 to 30 (MVTIVWKESKGTAKSRYKARRAELIAERRS) and
N residues 19 to 107.
P22N12-30 libraries were cloned from synthetic double-stranded NcoI-BsmI fragments like the wt P22 N libraries (Fig. 2B). The inserts were created using four similar strategies, all of which created the same sequence as wt P22 N, with one codon completely randomized. For libraries Asn14X, Lys16X, and Thr17X single-codon randomized oligonucleotides based on CF1 (5'-GCG CCC ATG GCA GGC AAT GCT AAA ACT CGT CGC CAC GAA CGT CGC-3') with unmutated oligonucleotide CR1 (5'-GGG ATT TGC ATT CCG CTC AAT CGC CAG TTT ACG GCG ACG TTC GTG GCG ACG-3') and mutual priming were used. For libraries Arg19X, His20X, and Glu21X single-codon randomized oligonucleotides based on CF2 (5'-GCG CCC ATG GCA GGC AAT GCT AAA ACT CGT CGC CAC GAA CGT CGC CGT AAA CTG GCG-3') with unmutated oligonucleotide CR2 (5'-GGG ATT TGC ATT CCG CTC AAT CGC CAG TTT ACG GCG ACG-3') were used. For libraries Arg24X, Lys25X, Leu26X, Ala27X, Ile28X, and Glu29X unmutated oligonucleotide CF1 and single-codon randomized oligonucleotides based on CR1 were used, and library Arg30X was made by extending primer R30XR (5'-C ATT NNN CTC AAT CGC CAG TTT-3') on single-codon randomized oligonucleotide R30XF (5'-GCG CCC ATG GCA GGC AAT GCT AAA ACT CGT CGC CAC GAA CGT CGC CGT AAA CTG GCG ATT GAG NNN AAT GCA-3'). Most of the resulting double-stranded DNAs were digested with NcoI and BsmI; the only exception was the R30X DNA, in which the BsmI overhang was preformed.
The hybrid library was constructed using a combinatorial strategy involving codon-based mutagenesis, degenerate codons, and mixing of solid support resins during oligonucleotide synthesis (Fig. 1B). The library was designed to include every possible hybrid of P22,
, and
21 RNA-binding domains fused to the
N activation domain. Because no two degenerate codons could encode only Gln, Ala, and Ile and because
NQ15R (which aligned with P22 N Ala27) has been reported to function (41), an arginine codon was substituted for
N Gln1. The hybrid region was flanked by degenerate codons. The sequence of the resulting insert, represented by the coding strand, was GGC CCC ATG GNN NNN GGT (RAT/AMT) GCT MAA WCT CGC (CGT/TAT) (CRT/ARA) GMA CGT CGA (GCC/CGC) RAA (AAA/CTG) (AKA/GCC) (GCA/ATT) SAA (TGG/CGT) NNG AAT GCA GCA AAT CCC-3', and the resulting expressed N proteins were a 22-residue library fused to
N19-107 with the sequence M(A/D/E/G/V)(X)G(D/N/T)A(K/Q)(S/T)R(R/Y)(H/K/R)(A/E)RR(A/R)(E/K)(E/L)(A/I/R)(A/I)(E/Q)(R/W)(A/E/G/K/L/M/P/Q/R/S/T/V/W), where degenerate positions are indicated by parentheses and X indicates all possible amino acids and stop codons. The DNA insert encoding this library was created by NcoI and BsmI digestion of the extension product of two mutually priming, degenerate oligonucleotides. Each degenerate oligonucleotide (HybridF and HybridR) was synthesized using a combinatorial strategy with two concurrent synthesis programs and mixing solid support before and after each divergent codon. Standard synthesizer codes were used for degenerate nucleotides (N = A + C + G + T, R = A + G, Y = C + T, K = G + T, M = A + C, S = G + C, and W = A + T). HybridF was made using HybridF1 and HybridF2 programs (HybridF1, 5'-GGC CCC ATG GNN NNN GGT RAT GCT MAA WCT CGC CGT CRT GMA CGT CGA-3'; HybridF2, 5'-GGC CCC ATG GNN NNN GGT AMT GCT MAA WCT CGC TAT ARA GMA CGT CGA-3'), and HybridR was made using HybridR1 and HybridR2 programs (HybridR1, 5'-GGG ATT TGC TGC ATT CNN CCA TTS TGC TMT TTT TTY GGC TCG ACG TKC AYG ACG GCG A-3'; HybridR2, 5'-GGG ATT TGC TGC ATT CNN ACG TTS AAT GGC CAG TTY GCG TCG ACG TKC TYT ATA GCG A-3'). P22 N R19K and P22 N H20Y inserts were directly constructed by annealing coding and noncoding oligonucleotides creating preformed NcoI and BsmI overhangs.
DNA preparation and sequencing. All N-expressing constructs were sequenced with PBRNR2 (5'-GGCTTGCTGTACCATGTG-3') using a BigDye Terminator v1.1 cycle sequencing kit (Applied Biosystems, United States) under standard conditions. The labeled products were subjected to electrophoresis with an ABI 310 genetic analyzer sequencing system (Applied Biosystems, United States). The entire insert was confirmed using Chromas Lite software from Technelysium (Australia).
Single-position library screening and X-Gal plate assays.
Competent N567 host cells carrying boxB reporter plasmids (P22 boxBleft, P22 boxBright, or HIV RRE) were transformed with the library or clones of interest and control plasmids, including the wild-type
N supplier (pBR-ptac-N*
), the P22 N fusion supplier (wt P22 N), the
21 N supplier (wt
21 N), and the HIV-Rev N fusion supplier (pBRN-HIVRev). Approximately 10 to 100 ng of plasmid per 100 µl of competent cells was transformed by heat shock and plated on tryptone medium plates containing 50 µg/ml ampicillin, 15 µg/ml chloramphenicol, and 80 µg/ml X-Gal (5-bromo-4-chloro-3-indolyl β-D-galactoside), which is a chromogenic substrate of the β-galactosidase reporter protein. All X-Gal plates included 0.05 mM IPTG (isopropyl β-D-thiogalactoside) to induce the tac promoters expressing N protein and the reporter transcript. The plates were viewed and scored after 1 day of incubation at 34°C and after a second day of incubation at 24°C.
Virus complementation assays.
Plaque assays were performed by standard procedures (36), using clear strains of N-deficient
phage, in which the immunity region was either from phage
or a replacement from phage P22 or
21. Overnight cultures of N567 hosting N supplier plasmids were grown in LB with ampicillin at 37°C. Cultures were centrifuged and resuspended in 10 mM MgSO4 to obtain an optical density at 600 nm of 2.0. Cells were incubated for approximately 30 min with approximately 100 PFU of virus by mixing 0.05 ml cells and 0.05 ml virus in SM (100 mM NaCl, 8 mM MgSO4, 50 mM Tris-HCl [pH 7.5], 0.1 g/liter gelatin) along with 1.2 ml tryptone top agar (48°C) and immediately plated on 5-cm tryptone plates. No antibiotic was used in the bottom or top agar, but where indicated below, N expression was induced with 0.3 mM IPTG in the bottom agar. The plates were allowed to set before overnight incubation at 37°C. Plaque diameters were assessed the next morning by superimposition on positive controls, and the results were expressed as percentages of the cognate diameters.
ONPG solution antitermination assay. For each N-boxB interaction, representative colonies were picked from X-Gal plates for use in solution assays (32). At least three independent colonies were used for each interaction. For measurement of N-mediated antitermination, cultures were grown overnight at 30°C with aeration in tryptone with 50 µg/ml ampicillin, 15 µg/ml chloramphenicol, and 0.05 mM IPTG. The cells were then permeabilized, the β-galactosidase activity was assayed using o-nitrophenol-β-D-galactoside (ONPG), and the β-galactosidase activity was calculated by using the method of Miller (32). The activities were normalized using P22N14-30 for boxBleft and boxBright and RevN for RRE.
Visualization of the structure.
Protein Explorer (31) was used to view the solution state NMR models of the structure of P22 N peptide-P22 boxBleft (Protein Data Bank accession number 1A4T) (5),
N peptide-
boxBright (Protein Data Bank accession number 1QFQ) (37), and
21 N peptide-
21 boxBright (Protein Data Bank accession number 1NYB) (9).
|
|
|---|
, and
21 N proteins recognize boxB conformation, with little direct recognition of RNA sequence. The roles of only a few P22 N residues in boxB recognition and specificity are well understood based on the available P22 N-boxB NMR data (5) and mutagenesis data (18). We first constructed plasmids expressing P22 N and
21 N RNA-binding domains fused to the
N activation domain (wt P22 N and wt
21 N, respectively). These plasmids and the
N plasmid specifically complement viruses lacking the cognate N protein. The roles of the conserved residues in the three viral N-boxB structures have been found to be similar (9), and these residues cannot play a role in type specificity. To assess the mutability of P22 N residues and to reveal their structural roles, we constructed 13 single-substitution libraries for each nonconserved residue in the RNA-binding domain of P22 N (Fig. 1B). Each library contained an unbiased collection of 64 codons at the targeted position.
The mutability of P22 N residues ranges from invariant to unrestricted.
To estimate the mutability of each residue, plasmid libraries were transformed into P22 boxBleft and boxBright reporter cells, and their abilities to support antitermination were visualized by inspection of colonies grown on solid media containing the chromogenic substrate X-Gal, which was transformed into a blue pigment by the β-galactosidase reporter gene (Table 1). The proportion and intensity of blue colonies provided an indication of the tolerance of each residue to mutation. We estimated from screening wt P22 N and wt
21 N clones (constructed with the same strategy used for the libraries) that oligonucleotide errors created nonfunctional clones in 20 to 40% of the transformants; therefore, completely mutable positions should produce 60 to 80% blue colonies. These data suggest that the mutability of P22 N RNA-binding domain residues ranges from invariant to unrestricted. Although large differences in the proportion of active clones in the libraries were apparent, no bias within any library was obvious for boxBleft and boxBright reporters, with the exception of His20X. Notably, we observed that when inactive colonies were disregarded, the distribution of activity varied between libraries; for the Asn14X library few colonies were as active as wild-type colonies, and for the His20X library about 5% of the total colonies had activity higher than the wild-type activity. For two basic residue libraries, Lys16X and Arg19X, there were very few active clones. Arg19X colonies appeared to be either white or as blue as wild-type P22 N colonies, whereas Lys16X colonies exhibited a range of activity. Several positions appeared to be very mutable; in particular, the Thr17, Glu21, Lys25, and Arg30 residues in most library members appeared to be active. The mixed-library results suggest that most of the 247 possible single-substitution mutants of P22 N expressed by these libraries of the P22 N RNA-binding domain were functional.
|
View this table: [in a new window] |
TABLE 1. Library antitermination frequencies
|
strain dependent on P22 N-nut antitermination. Clones able to complement P22 N– virus were sequenced, and plaque diameters were compared to those of wt P22 N (Fig. 3). Most libraries yielded active mutants; the only exception was Arg19X, for which only Arg codons were found. Wild-type residues were not isolated at Lys16, Thr17, and E29, presumably because wild-type codons are less active than mutant codons or are relatively rare among active clones.
![]() View larger version (16K): [in a new window] |
FIG. 3. Summary of active single amino acid substitutions in P22 N. The wild-type sequence of the RNA-binding domain of P22 N is indicated on the left by using position numbers and single-letter codes. The number of wild-type residues observed in the total number sequenced for each library (wild type/total) are indicated in the middle. Conserved positions (Ala15, Arg18, Arg22, and Arg23) were not examined using mutagenesis. Single mutations supporting viral replication are indicated on the right, and the heights represent plaque diameters of 100%, 75%, 50%, and 25% and less. When wild-type sequences were recovered, they were grouped with the library of origin.
|
Only arginine codons were found in active clones from library Arg19X. The very low proportion and wild-type activity of positive colonies seen with P22 boxB reporters suggested that this position is immutable without a complete loss of activity. The immutability of Arg19 was expected in light of NMR data (5), which showed that this residue has an unambiguous and certainly critical role, binding to Gua7 of the noncanonical G:A pair and the phosphate backbone. Because we sequenced only five clones and could not rule out the possibility that lysine might function at this position, we constructed and tested the R19K mutant and found that it was inactive.
Surprisingly, we recovered only arginine codons from library Lys16X, presumably by inadvertently selecting only the most active colonies. The active arginine mutant and the low proportion of positive clones in this library suggest that a basic residue is required at this position. Without exhaustive screening, we cannot rule out the possibility that constructs with nonbasic substitutions function, although we predict that they have low activity. Previous mutagenesis data indicated that constructs with the equivalent
glutamine residue at this position can function weakly (18).
The aliphatic chain of Arg24 appears in the NMR model as packing against His20 and the ribose of the extruded loop nucleotide Cyt9, possibly stabilizing the essential 3-out GNRA-like conformation of boxB (5). Franklin (18) has reported that a P22-
N hybrid with alanine at this position has weak activity. The guanidinium group of Arg24 is poorly defined and has no clear role in the NMR model. Our finding that the R24L mutant functions, although weakly, supports the hypothesis that the hydrophobic interaction between the aliphatic chain of Arg24 and His20 and the ribose of Cyt9 is important, but it does not clarify the role of the guanidinium group. No phosphate appears to be within reach, and there is no data supporting an interaction with two nearby planar groups, Cyt9 and His20. We imagine that the guanidinium positive charge may increase nonspecific affinity, and we expect that a construct with a lysine substitution would be functional.
Asn14 mutants exhibit reduced activity. We found only small hydrophilic substitutions, suggesting that Asn14 has a specific role. Ile28 is defined well by NMR data, packing against the face of Cyt9, and mutagenesis data suggest that it is functionally important (18). We found that this residue can be mutated to similar residues capable of packing against Cyt9. P22 boxB can tolerate mutation of the extruded base at boxB position 9 with little loss of activity (11), and the role of Ile28 is most likely to recognize the 3-out GNRA-like loop conformation. Presumably, any residue with a hydrophobic face can play the same role.
In the NMR model His20 is described as part of a hydrophobic surface facing Cyt9 and possibly interacting with a backbone phosphate via a weak hydrogen bond. A previous mutagenesis study revealed no functional role for His20 (18). In contrast, we found only the aromatic residues His, Phe, and Trp in our screening analysis. Close examination of the NMR model reveals that the imidazole ring of His20 makes only partial contact with the aliphatic chain of Arg24 and the ribose of Cyt9 and that the majority of its aromatic ring faces an internal void (5). The NMR data provide no support for stacking against either of the two closest planar partners, Cyt9 and Arg24. When the colonies with the highest activity were chosen, only H20F and H20W mutants were found. The H20F substitution suggests that contact with the phosphate backbone by His20 is unimportant. The presence of only aromatic residues suggests strongly that His20 stacks against a planar partner. We made the H20Y construct and found that it produced half-size P22 N– plaques.
Few single mutants distinguish between boxBleft and boxBright. The unresolved roles of Asn14, His20, and Arg24 prompted us to consider whether these residues are more important in binding boxBright than in binding boxBleft. boxBleft and boxBright have identical loops but different base pairs at each end of their stems (Fig. 1A). Residues contacting these bases or subtly stabilizing a specific bound boxB conformation could create bias. We noted that the largest bias between boxBleft and boxBright was observed when the His20X library was screened, although sequencing colonies with high activity revealed the same collection of aromatic mutants for both screens. We selected mutants from each library for quantitative assays of the function on boxBleft and boxBright (Table 2).
|
View this table: [in a new window] |
TABLE 2. Antitermination activities of single substitutions
|
A library of basic domain hybrids reveals biased N proteins.
Previously, we observed that
N activates P22 boxBleft reporters strongly and boxBright reporters weakly, demonstrating that bias is possible (11). Reasoning that a library of hybrid RNA-binding domains might contain more constructs with bias, we designed a library of N proteins in which almost all possible hybrids of P22,
, and
21 RNA-binding domains were fused to
activation domain (Fig. 1B). So that the hybrid library could be synthesized with the available two-column oligonucleotide synthesizer, an arginine codon was substituted for
N Gln15 (which aligns with P22 N Ala27).
NQ15R has been reported to function in antitermination at wild-type levels (41).
Less than 600 of approximately 160,000 hybrid library transformants plated exhibited activity on boxBleft or boxBright reporters. Colonies active on boxBleft or boxBright were separately pooled and rescreened to examine differential activity on boxB reporters. Approximately one-half of the pool of transformants originally selected on boxBleft showed bias. In sharp contrast, less than 2% of the pool selected on boxBright showed bias. Repeated attempts to find clones with bias toward boxBright identified a few weakly biased sequences. Selected clones were sequenced and characterized to determine their antitermination activities and abilities to complement P22 N– virus (Table 3). Some clones were able to complement P22 N– virus only when N expression was induced. Considering the 14 unique sequences, bias for boxBleft was more common than bias for boxBright, for which only two sequences had significant bias. The sequences of hybrids able to support P22 replication provided additional support for some findings for the single-substitution libraries. Residues Asn14, Lys16, Arg19, and Ala27 were found only as P22 variants. Interestingly, hybrids with H20R and H20K mutations were active, although none of them were as active as the wild type. Arginine is not an aromatic amino acid, but its conjugated planar surface can engage in similar stacking interactions, and the aliphatic portion of lysine can pack against planar surfaces. R24A mutants are active, although not fully. Unexpectedly, an R24P mutant was found, although it was only weakly active. Analysis of these sequences did not yield much insight into the origin of bias between boxBleft binding and boxBright binding, possibly because of compensatory effects of multiple substitutions.
|
View this table: [in a new window] |
TABLE 3. Antitermination activity and P22 N– complementation of hybrids
|
N (18, 19) suggested that there is at least one residue that is compatible with both viruses at each position, although sometimes the function is reduced. Indeed, Franklin (18) described a few P22-
N hybrids able to complement both
and P22 viruses lacking N, albeit weakly. Unfortunately, no mutagenesis of the
21 N-boxB interaction has been reported. Using complementation of P22 N–,
N–, and
21 N– viruses in plaque assays, we tested N proteins shown in Fig. 3 and Table 3, as well as NF3862 (18), a previously described, relaxed-specificity hybrid (Table 4). Only the results obtained with control proteins and with the sequences producing plaques with P22 N– and
N– are described in Table 4. When induced, all N fusions, including RevN, complemented
21 N–; thus, we do not believe that complementation of
21 N– by induced N protein is related to N-boxB specificity. Increased N expression from plasmids by induction of the pTac promoter with IPTG is known to strongly reduce type specificity (22). |
View this table: [in a new window] |
TABLE 4. Relaxed-specificity sequences
|
N– viruses without induction. Arg30 has no clear function in the P22 N-boxB NMR model and is relatively mutable (Fig. 3). The
Trp18 equivalent plays a critical role in the
N-boxB interaction, stacking onto the boxB loop, and this interaction has been implicated in recruiting host factor NusA in a specific binding mode to allow full
antitermination. Concordantly, two of our hybrids with tryptophan at this position exhibited relaxed specificity, but this occurred only under induction conditions. The weak ability of NF3862 to complement P22 N– virus is not surprising, because it does not have two residues that have been found to be important for P22 N function, the basic residue Lys16 and the aromatic residue His20. NF3862 had only background antitermination on P22 boxBright and boxBleft reporters (data not shown). Likewise, the poor ability of NF3862 to complement
N is explained by the absence of a residue equivalent to the critical
Trp18 residue. |
|
|---|
-
21 hybrids are able to complement both P22 N– and
N– viruses, suggesting that evolutionary transitions between these two distinct RNA-protein interactions are facile. No mutants were found to bridge the specificity between P22 and
21 N. P22 N structural roles informed by mutagenesis. The mutability of Thr17, Glu21, Lys25, Leu26, Glu29, and Arg30, all of which face away from RNA, supports the conclusion that these residues have no specific structural role. Some mutations increase antitermination activity (Table 2), suggesting that they can modulate N-boxB affinity, if only indirectly. Despite the typically interchangeable properties of threonine and serine, Thr17 is considered a nonconserved residue in the N RNA-binding domains. This view is bolstered by the lack of any definable role for this position in any N-boxB complex, in contrast to the findings for the conserved Ala15, Arg18, Arg22, and Arg23 residues (9). Its high mutability indicates that conservation merely implies conservation of function.
The structural role of Arg19 is clearly defined by NMR data (Fig. 2A). A similar interaction has been reported for the HIV-1 Tat-TAR complex (6), where this residue is involved in the critical protein-RNA recognition interaction. We found that K16R has activity higher than that of the wild-type sequence with boxB reporters, supporting the specific contact suspected based on NMR results but not observed by NMR. Interestingly, examination of the NMR model suggested that the arginine mutant could contact Gua13 and the phosphate similar to Arg19 at Gua7, and this could explain the higher activity of K16R.
Although the role of Asn14 is not defined by NMR data (5), previous genetic (18) and biochemical (3) data indicated that this residue is preferred to aspartate by both P22 and
boxBs. Interestingly, N14C exhibits a strong bias toward boxBleft (Table 2). The NMR model indicates that Asn14 is within reach of Gua2, which is replaced by Cyt2 in boxBright (Fig. 1A). This suggests that Asn14 has a discrete role, perhaps interacting with Gua2, and that the two boxBs may impose different constraints on N genetic drift. This study, which examined only the single-substitution mutants able to form plaques, would have missed strongly biased mutants that cannot form plaques. All hybrids, even the boxBright-biased mutant, contain Asn14, implying that if this position can cause bias, the available residues from
and
21, Asp2 and Thr12, do not cause bias (Table 3).
The role of His20 remains uncertain and intriguing. We found only planar residues at this position, but in one case a hybrid had a lysine residue (Fig. 3 and Table 3). The P22 N H20W mutant had activity that was two- and fourfold higher than the wild-type activity with boxB reporters (Table 2), strongly suggesting that there is a planar partner surface, yet no planar partner surface was apparent. We speculate that biophysical studies of the P22 N-boxBright complex would illuminate the role of His20. Indeed, the intrinsic fluorescence of tryptophan may allow the H20W mutant to be useful for biophysical studies of N-boxB interactions, such as the studies done for
N-boxB (41).
Bias between boxBleft and boxBright.
The relative lack of bias between boxBleft and boxBright reporter activities observed suggests that P22 N protein binds the two boxBs very similarly. In contrast,
N had very high activity with our P22 boxBleft reporter and minimal activity with our boxBright reporter (Table 2), most likely because
N forces a
boxB conformation on P22 boxBleft (1). Although there was no sequence pattern evident in hybrids that correlated with either boxBleft or boxBright reporter activities (Table 3), strong activity with boxBleft could arise from a hybrid N forcing a
-like, 4-out GRNA-like loop conformation on P22 boxBleft. Bias would then result from P22 boxBright being unable to adopt a
conformation, as a result of indirect thermodynamic effects of the C:C pair or stem differences, both of which have been implicated in
N discrimination between P22 boxBleft and boxBright (11). Our recovery of very few hybrids biased toward P22 boxBright supports this hypothesis (Table 3).
Viral replication correlates to unbiased antitermination.
We also examined the relationship between boxBleft and boxBright antitermination activity and plaque size using a clear
strain regulated by P22 N-nut antitermination. We plotted the plaque diameter of P22 N– virus complemented by the mutant N clones described in Tables 2 and 3 as a function each clone's antitermination activity with boxBleft and boxBright reporters (Fig. 4). Although weakly antiterminating N mutants correlated with small or absent plaques, the reporter activity did not accurately predict plaque size. Importantly, all full-size plaques had low bias, suggesting that balance in leftward and rightward antitermination is important for viral fitness. The lack of a strong correlation between antitermination activity and plaque size most likely reflects the very different conditions used in the two assays. The reporter system measured reporter gene accumulation in saturated cultures grown with continuous induction at 30°C. In contrast, plaques were assessed using lawns grown without induction at 37°C. Nonetheless, full-size plaques were found only when boxBleft antitermination and boxBright antitermination were nearly balanced, which could have provided incremental selective pressure for boxBleft and boxBright to maintain similar binding affinities.
![]() View larger version (12K): [in a new window] |
FIG. 4. P22 N– plaque size as a function of antitermination activity on P22 boxBleft and boxBright. Plaque sizes and antitermination activities of sequences shown in Tables 2 and 3 are plotted. Single substitutions are indicated by open circles, and hybrids are indicated by filled circles. The activities in boxBleft and boxBright reporter cells are expressed as percentages of the wild-type P22 N activity. The circle diameters are proportional and indicate plaque sizes of 100% (largest), 75%, 50%, 25%, and barely visible. Hybrid sequences unable to produce plaques (from Table 3 only) are indicated by plus signs.
|
, and
21 N specificity is the result of complex effects of multiple mutations or the result of specific intolerance to nontype residues at critical positions. Our finding that there is a moderately active, single-substitution P22 N RNA-binding domain mutant and our finding that there are additional weakly active, relaxed-specificity hybrids suggest that there may be many hybrids with relaxed specificity (Table 4). Indeed, the P22 N R30W mutant shows that type specificity can be relaxed without a severe loss of function for either partner by a single substitution.
We found no mutant capable of bridging P22-
21 N type specificity, although such mutants may exist. No mutagenesis study examining
21 N has been described.
21 N Tyr17 appears to contact
21 boxB in an important interaction similar to the critical interaction of Arg19 of P22 N. Our data and the NMR model indicate that tyrosine should be unable to play the role of Arg19 in P22 N. Nonetheless, it is possible that P22 N Arg19 could play the role of Tyr17 in
21 N or that compensatory effects of other mutations could create sequences that bridge type specificity.
Transiting type specificity.
The distinct, yet structurally similar P22,
, and
21 N-boxB interactions are doubtless related evolutionarily. What mutational paths through sequence space might connect these three distinct solutions to the problem of binding a protein to an RNA? Relaxed-specificity N or boxB mutants represent possible transitions between discrete recognition strategies. Our data suggest that there may be many relaxed-specificity hybrids between P22 and
N. From a biophysical perspective, the induced-fit nature of RNA-protein interactions (35) allows plasticity, where the adopted conformation depends on context. Such conformational plasticity may be particularly common in arginine-rich peptide-RNA interactions (4). Induced fit suggests that there is binding of discrete conformations rather than a smooth continuum of specific recognition strategies. There is evidence that
boxB samples discrete conformations, including the bound conformation, when it is unbound (27). Indeed, the results of a recent NMR study of HIV TAR RNA support the idea that proteins sometimes merely capture conformations that are being sampled by motional modes of RNAs (42).
Relaxed-specificity mutants are able to participate in multiple strategies and thereby may provide evolutionary routes between discrete modes of interaction without severe loss of fitness. The ability of N proteins to recognize boxBs with few base-specific contacts suggests that the type specificity of N is primarily the result of being unable to bind more than one boxB loop conformation. In contrast, type specificity in boxBs appears to be limited by the thermodynamics of assuming different bound conformations, with only indirect effects of sequence (1, 11). The ability of
N to force P22 boxBleft to adopt a
conformation illustrates one mechanism by which relaxed specificity can be achieved. The reciprocal mechanism of adopting both P22 and
boxB loop conformations appears to be employed by relaxed-specificity boxBs (11). Relaxed-specificity sequences may be able to acquire specificity simply by mutations that discriminate against one target without affecting binding to another, as has been seen in HIV RRE (26).
Neutral and nearly neutral theories of evolution (28, 34) assert that incremental mutation paths connect distinct phenotypes without loss-of-fitness intermediates. The results of computational studies support the existence of such paths between discrete RNA secondary structures (17, 25), but our understanding of neutral paths between protein phenotypes is less advanced (14). In the case of N-boxB interactions, the coevolution of protein and RNA would expand the number of neutral paths. Recombination occurs between lambdoid phages (7, 8), directly sampling a more diverse sequence space than incremental mutation would sample and likely allowing access to otherwise inaccessible or genetically distant relaxed specificity sequences. Additionally, the enormous population of viruses may allow transient reduced-fitness intermediates to occur. Finally, the reduced type specificity that occurs due to increased expression of N (22) may allow many weakly active, relaxed-specificity mutants to be viable.
We express our heartfelt thanks to Naomi Franklin for providing irreplaceable strains, to Kazuo Harada for hosting C.A.S. in his laboratory, and to the group of André Megarbané for excellent sequencing services. This work benefited from access to the Central Research Science Laboratory at the American University of Beirut.
Published ahead of print on 26 September 2008. ![]()
|
|
|---|
21 N peptide-boxB RNA complex. RNA 9:663-676.
. J. Bacteriol. 190:4263-4271.
genetic networks. J. Bacteriol. 189:298-304.
's N protein. Nucleic Acids Res. 17:5565-5577.
and P22. Mol. Microbiol. 52:815-822.[CrossRef][Medline]
N protein are essential to antitermination of transcription, but their locale cannot compensate for boxB loop defects. J. Mol. Biol. 231:343-360.[CrossRef][Medline]
,
21 and P22. J. Mol. Biol. 181:85-91.[CrossRef][Medline]
,
21 and P22. J. Mol. Biol. 181:75-84.[CrossRef][Medline]
, 21, and P22: loss of N protein specificity. J. Bacteriol. 171:2513-2522.
N antitermination activity. J. Biol. Chem. 280:32177-32183.
N peptide/boxB RNA complex: recognition of a GNRA fold by an arginine-rich motif. Cell 93:289-299.[CrossRef][Medline]
. The structure of the N36 peptide-boxB RNA complex. Eur. J. Biochem. 267:2397-2408.[Medline]
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»