Patricia Tille,3 Keith E. Weaver,3 and Don B. Clewell1,4*
Department of Biologic and Materials Sciences, School of Dentistry,1 Department of Microbiology and Immunology, School of Medicine, The University of Michigan, Ann Arbor, Michigan 48109,4 Servicio de Microbiología, Hospital Universitario Marqués de Valdecilla, 39008 Santander, Cantabria, Spain,2 Department of Microbiology, University of South Dakota School of Medicine, Vermillion, South Dakota 570693
Received 14 April 2004/ Accepted 23 April 2004
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Nucleotide sequence data relating to plasmids from different incompatibility groups (e.g., pAD1, pAM373, pCF10, and pPD1) and responding to four different pheromones have shown that the regions associated with replication and maintenance are organized similarly; in all cases, this region is located adjacent to that involved in regulation of the pheromone response (9, 10). In the case of pAD1 the key determinants associated with plasmid maintenance are repA, repB, and repC (Fig. 1A). On the basis of sequence homology, repA is believed to encode the initiator of vegetative replication, whereas repB and repC most likely represent a partition system (50, 54). When a segment carrying these three determinants was cloned on an E. coli plasmid vector, it enabled the chimera to replicate in E. faecalis (52). Transposon insertion mutations within repA were unable to replicate in E. faecalis, whereas insertions within repB and repC affected stability and copy number. Two series of octanucleotide iterons are located between the divergently oriented repA and repB and, like iteron sequences associated with other plasmid replicons, are believed to play a role in replication and/or maintenance (17). The iterons are a series of 12 and 13 repeats separated by 78 nucleotides (52).
|
Genes encoding the RepA family of proteins noted here bear interesting, centrally located, directly repeated nucleotide sequences. For example, in the case of repA of pAD1 a 33-bp sequence (DR1) is repeated twice (only one mismatch) and separated by about 75 bp (see Fig. 1B). Similar repeats within the repN of pLS32, but involving different nucleotides, have been suggested to contain the oriV site of that plasmid (48).
pAD1 has two transfer (conjugation) origins, oriT1 and oriT2, that are located about 180° apart on the circular map (3, 11, 23). The oriT2 site is located adjacent to a relaxase (TraX) determinant and is believed to be the preferred site for conjugative transfer (22), whereas oriT1 operates several orders of magnitude less efficiently and is located within repA (3). In addition to the location of oriT1 with respect to repA, another interesting feature that seems to "associate" genes for vegetative replication and transfer functions is a phase variation phenomenon involving transfer functions. This involves the reversible switching on and off of conjugation genes by way of changes in the number of iterons (generally an increase in four iterons [32 nucleotides]) between repA and repB (32, 46). The mechanism by which this phase variation affects conjugation functions remains unknown.
The close physical association of sequences involved in both vegetative replication and regulation of conjugative transfer prompted us to further characterize the pAD1 region associated with plasmid replication. Here we present data showing that RepA is the only pAD1-encoded protein required to initiate replication and that oriV is located within a small segment of the repA determinant. RepA is shown to bind to small inverted repeat structures (i.e., IR-1) within oriV, and these structures are shown genetically to play an important role in replication. In addition, we identify an intriguing relationship between the internal structure of RepA protein and the recognition of oriV.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
. Introduction of plasmid DNA into bacterial cells was done by transformation as previously described (16, 31). Electrotransformation of E. faecalis was done as described by Flannagan and Clewell (20). Plasmid DNA was purified from E. coli by using established techniques described elsewhere (47). Isolation of plasmid DNA from E. faecalis was also as previously described (51). When necessary, DNA fragments were purified with silica gel as described by Boyle and Lew (6). Recombinant DNA methodology, as well as analyses of plasmid DNA by using restriction enzymes, agarose gel electrophoresis, and Southern hybridization, involved procedures described by Sambrook et al. (47). Restriction enzymes were purchased from Invitrogen, and reactions were carried out under the conditions recommended. PCR was performed with a Perkin-Elmer Cetus apparatus under conditions recommended by the manufacturer. Specific primers were purchased from Invitrogen, and Taq DNA polymerase was from Roche. PCR-generated fragments were purified by using QIAquick-spin columns (Qiagen). Ligations made use of T4 DNA ligase from New England Biolabs. Nucleotide sequence analyses were carried at the University of Michigan sequencing core facility or using the "fmol DNA Cycle Sequencing System" as specified by the manufacturer (Promega). Plasmid constructions. The vector pSU18bac represents a pSU18 (4) derivative in which the bacteriocin (bac) promoter (27) has been cloned as an EcoRI fragment. pSU18bac* contains a point mutation in the promoter 10 box, which results in the sequence CATAAT. From here the SalI/KpnI fragment that contains the bac promoter was subcloned into pAM434 (21), yielding pAM434b*.
Different segments of pAD1 included in the replication-maintenance region were amplified by PCR from template pAM714 (35) or pAM3314, pAM3316, and pAM3318 (3) by using the oligonucleotides indicated in Table 1 and cloned into pTAd via TA cloning. The corresponding clone containing repA was partially MfeI digested and filled with Klenow to obtain a frameshift mutation. From here, XbaI/HindIII fragments were subcloned into the E. faecalis suicide plasmid pAM88 (22), generating the plasmids pAM88A*, pAM88It, pAM88-3314, pAM88-3316, pAM88-3318, and pAM88oriV. The clones containing repA or repA
9 coding sequences in pAM434 were cloned in several steps. The fragments repA or repA
9 contained in pTAd were obtained by digestion with the restriction enzymes EagI/NruI, purified, and cloned into the BsaI and Eco47III sites of pSU18b*. The SalI/KpnI fragments, which contained genes behind the bac promoter, were subcloned into pAM434, generating the plasmids pAM434brepA and pAM434brepA
9. The repA derivatives with point mutations were picked up as "unexpected" variants noticed upon sequencing PCR products, as was the deletion relating to pAM330
9and pAM434brepA
9.
Fragments of DNA containing sequences from the repA and repB genes were amplified by PCR from pAM714 with the primer pairs ETrepA.1 and ETrepA.2, ETrepA.1 and ETrepA.3, ETrepA.4 and ETrepA.2, ASK60repA.1 and ASK60repA.2, or ASK60repB.1 and ASK60repB.2, respectively (Table 1); digested with BglII and HindIII (repA) or EagI and NruI (repA and repB); and cloned into the same sites of pET30a or pASK60, as indicated, to construct the plasmids pET30aRepA, pET30aRepA5', pET30RepA3', pASK60RepA, and pASK60RepB, respectively. In the expression vector pET30, RepA is under the control of the T7 promoter. In pASK60, RepA and RepB are under the control of the lac promoter.
Fragments of DNA containing the iteron sequences were amplified by PCR with the primers It1 and It2 or It3 and RepB with the plasmid pAM714 as a template and cloned into pTAd plasmid vector to construct the plasmids pTAdIt5' and pTAdIt3'. The complete It region was amplified by PCR with the primers RepA and RepB by using the plasmid pAM714 as a template and then cloned into pTAd plasmid vector to construct the plasmid pTAdIt. The putative oriV site was obtained on a digestion product of MfeI and RsaI and cloned into pBluescript to produce the plasmid pBlueScriptoriV. Plasmids pTAdIR1, pTAdIR1*, and pTAdItC were obtained by annealing the corresponding sense and antisense oligonucleotides (Table 1) and direct ligation into the EcoRI site of the pTAd cloning vector. All of the constructions were confirmed by DNA sequencing.
Stability and incompatibility assays.
Stability assays were performed as previously described by Wirth et al. (55). Incompatibility experiments were performed from single colonies of OG1X strains containing either plasmids pDAK246
E and pAM3314 or plasmids pDAK246
E and pAM401. Loss of pDAK246
E was monitored in the presence of selection for the other plasmid as previously described (52).
Protein purification. The His-tagged fusion proteins (RepA, RepA5', and RepA3') were purified from recombinant E. coli BL21(DE3) induced with 1 mM isopropyl-ß-D-thiogalactoside by using an Ni-agarose column as described in the manufacturer's instructions (Qiagen GmbH). The Strep-tagged fusion proteins (RepA and RepB) were purified from recombinant E. coli JM83 induced with 1 mM isopropyl-ß-D-thiogalactoside by using a streptavidin-immobilized column as described in the manufacturer's instructions (Boehringer Mannheim). All protein preparations used in DNA binding studies were at least 90% pure based on polyacrylamide gel electrophoresis estimates.
Protein analysis. Proteins were boiled in sample buffer containing sodium dodecyl sulfate and ß-mercaptoethanol and then subjected to sodium dodecyl sulfate-polyacrylamide gel electrophoresis on a 12% gel according to the method of Laemmli (42). Gels were stained with Coomassie brilliant blue R-250. Western blotting was performed as described elsewhere (47), and the His-tagged fusion protein (RepA) was detected by using polyclonal anti-His antibody (Santa Cruz Biotechnology), whereas the Strep-tagged fusion proteins (RepA and RepB) were detected by using the polyclonal anti-Strep antibody (Pierce) and the ECL Western blotting analysis system (Amersham Pharmacia Biotech).
DPAC assays. DNA-protein tag affinity chromatography was carried out under the conditions described by Fujimoto and Clewell (26). Restriction enzyme-digested DNA plasmid pAM2603 was extracted with phenol-chloroform and precipitated with ethanol. Then, 4 µl corresponding to ca. 2 µg of cleaved DNA in 10 mM Tris (pH 8.0) was used.
Preparation of DNA substrates.
Double-stranded DNA (dsDNA) containing iteron repeat fragments for binding assays were generated by PCR by using the plasmids pTAdIt5' and pTAdIt3' as templates. The primers used are indicated in Table 1. dsDNA containing oriV fragment was obtained by digestion from pBlueScriptoriV plasmid. The fragments were labeled with [
-32P]dATP (Amersham) included in the PCR or by filling by Klenow in the oriV-containing fragment. PCR products or digestion bands were separated in an agarose gel, and excised bands were eluted with a QiaQuick gel extraction kit (Qiagen). The DNA samples were loaded on an agarose gel for quantification. S1 digestion and boiling and/or denaturation assays were performed in order to demonstrate the nature of the DNA bands tested (24).
Gel mobility shift assays. Labeled DNA fragments (1 pmol) were incubated with either RepA (or its putative N- and C-terminal domains), RepB, or control (vector derived) protein fractions (0.1 and 0.5 µg) for 15 min at 30°C in a 20-µl volume containing 50 mM Tris (pH 7.5), 100 mM NaCl, 0.2 mM EDTA, 5% glycerol, 1 mM dithiothreitol, 1.5 µg of poly(dI-dC) DNA, and 0.7 µg of bovine serum albumin. After this incubation period, the binding reaction mixtures were placed on ice, loaded onto a 5% prerun polyacrylamide gel, and electrophoresed at room temperature in 0.5x Tris-borate-EDTA buffer. After electrophoresis, the gel was dried on Whatman 3MM paper and exposed to X-ray film at 70°C with an intensifying screen.
Construction of clones representing the mutated IR-1 sites. The 525-bp PCR fragment containing the pAD1 oriV site that included modifications in IR3 and IR4 was generated with primers M2 and ETrepA.2 and pAM88A* as a template DNA. The 528-bp PCR fragment also containing the mutated repeats IR3 and IR4 was generated by using M3 and ETrepA.1 as primers and pAM88A* as a template DNA. Both fragments were purified by using QIAquick spin columns (Qiagen), diluted 1:1,000, mixed, and used as a template for a new PCR with ETrepA.1 and ETrepA.2 primers. The resulting 1-kb band contained repA* with the repeats IR3 and IR4 mutated (IR3+4). The 588-bp PCR fragment containing the pAD1 oriV site, including the mutated repeats IR1 and IR2, was generated by using primers M4 and ETrepA.2 and pAM88A* as template DNA. The 465-bp PCR fragment also containing the mutated repeats IR1 and IR2 was generated by using M5 and ETrepA.1 as primers and pAM88A* as template DNA. Again, both fragments were purified (as described above), diluted 1:1,000, mixed, and used as a template for a new PCR by using ETrepA.1 and ETrepA.2 as primers. The resulting 1-kb band contained repA* with the repeats IR1 and IR2 mutated (IR1+2). The 588-bp PCR fragment containing the pAD1 oriV site that included the mutated repeats IR1 and IR2 was generated by using primers M4 and ETrepA.2 and band IR3+4 as template DNA. The 465-bp PCR fragment containing the mutated repeats IR1 and IR2 was generated by using M5 and EtrepA.1 as primers and band IR1+2 as template DNA. Both fragments were purified and diluted (as described above), mixed, and used as template for a new PCR with ETrepA.1 and ETrepA.2 primers. The resulting 1-kb band contained repA* with the repeats IR1, IR2, IR3, and IR4 mutated (IR1+2+3+4). The 501-bp PCR fragment containing the pAD1 oriV site that included the mutated repeat IR5 was generated by using primers M1s and ETrepA.2 and band IR1+2+3+4 as template DNA. The 563-bp PCR fragment containing the mutated repeats IR1, IR2, IR3, IR4, and IR5 was generated by using M1as and ETrepA.1 as primers and band IR1+2+3+4 as template DNA. Both fragments were purified, diluted, mixed (as described above), and used as a template for a new PCR with ETrepA.1 and ETrepA.2 as primers. The resulting 1-kb band contained repA* with all IR-1 repeats mutated (5xIR). The resulting DNA products were purified and ligated to pTAd, and the 1.1-kb XbaI-HindIII fragments were cloned into pAM88 obtaining pAM88A*-IR3+4, pAM88A*-IR1+2, pAM88A*-IR1+2+3+4, and pAM88A*-5xIR, respectively. All clones were confirmed by DNA sequencing.
| RESULTS |
|---|
|
|
|---|
3-kb segment of pAD1 carrying repA, repB, and repC and a series of iterons (52), the precise location of oriV was not determined. Although the presence of an array of iterons suggested involvement in replication initiation, recent reports of oriV sequences being located within determinants of repA-like sequences raised the possibility that the origin might be located within repA. Identification of an oriV sequence is generally based on its ability to facilitate replication when present on a plasmid that could otherwise not replicate, if we assume that appropriate replication factors (e.g., initiator protein) are provided in trans. To locate the oriV of pAD1, we cloned specific segments of DNA (either an internal region of repA or the iteron region located between the repA and repB determinants) into a plasmid (pAM88) that is incapable of autonomous replication in E. faecalis. pAM88 is an E. coli vector carrying a cat determinant able to express in E. faecalis (22) and therefore useful for testing replication. A 1.01-kb segment of pAD1 DNA carrying repA with a frameshift mutation and devoid of a ribosome-binding site was cloned in pAM88, and the resulting chimera, pAM88A* (Table 1 and Fig. 1), was introduced by electroporation into an E. faecalis JH2-2 host harboring a pAD1::pAD2 cointegrate derivative pAM330 (12). (The frameshift mutation in the cloned segment was to assure that there was no expression of RepA from the resulting chimera.) We utilized the cointegrate because it should not be totally dependent on the pAD1 replication origin; that is, it should be able to make use of the pAD2 origin for replication. Selection with chloramphenicol resulted in transformants at a frequency of 3.1 x 107 transformants per number of cells per µg of DNA (Table 2). In contrast, a pAM88 chimera designated pAM88It carrying the iteron sequences located between repA and repB (0.5 kb, Table 1 and Fig. 1), as well as the empty vector (pAM88), could not be maintained in (i.e., transform) JH2-2/pAM330 cells (Table 2). Homologous recombination was not involved in the process, as judged by an inability to detect PCR products with primers designed to amplify either of the joint regions that would be expected (Fig. 2A). In addition, restriction bands appropriate for separate (not recombined) plasmids were observed by agarose gel analysis (data not shown). The data imply that a replication origin (oriV) is located within repA and that, since RepA could not be produced by pAM88A* (i.e., because of the frameshift mutation), it responds to RepA supplied in trans from pAM330. Furthermore, the iterons are not part of the pAD1 oriV, since they are not essential for replication.
|
|
Of likely relevance to the DR-1 sequences is an observation made relating to the earlier-noted experiments involving E. faecalis JH2-2/pAM330 cells transformed with pAM88A*. In the course of examining these transformants we noticed that selected colonies fell into two categories. Approximately 10% of the colonies were "relatively large" while 90% were "small." Interestingly, when we generated PCR products by using primers flanking repA (primer sequences not located on pAM88A*; see Fig. 2A), we observed that the small colonies gave rise to a product of the expected size (1.5 kb), whereas the large colonies gave rise to a smaller product (1.4 kb). Sequence analysis of the 1.5-kb PCR fragment showed a wild-type repA sequence; however, the 1.4-kb fragment contained a 105-bp in-frame deletion within repA. The deletion removed the region between the two large direct repeats (DR-1), as indicated by the vertical arrows in Fig. 1B. The data indicate that in the large colonies pAM330 had undergone a recombination between the two 33-bp direct repeats (DR-1), leading to a deletion that resulted in the loss of 35 amino acids from an internal portion of RepA. The deletion (pAM330
9) did not affect the ability to facilitate replication of pAM88A*. Indeed, pAM88A* replication efficiency may be enhanced under the circumstances since the cells appeared to grow better (i.e., larger colonies) with the deletion. Furthermore, pAM88A* could transform JH2-2/pAM330
9 cells at a frequency almost 2 orders of magnitude higher than JH2-2/pAM330 cells (Table 2). A widely used shuttle plasmid pAM401 (55) was able to transform both strains equally well, suggesting that the deletion in pAM330
9 decreased incompatibility with pAM88A*, although at this point an elevated expression of the altered RepA also cannot be ruled out.
To determine whether the origin within repA on pAM88A* contributes to incompatibility against pAD1, the ability of a cloned origin fragment to displace a resident pAD1 replicon was tested. For this purpose we used pDAK246
E (53) as the resident plasmid and pAM3314 (3) as the pAD1 oriV-containing plasmid. pDAK246
E is a pAD1 minireplicon (Table 1) encoding erythromycin resistance and deleted for the par-encoded postsegregational killing system (located downstream of repC). pAM3314 is a pAM401 clone containing an internal portion (the 3314 segment; Fig. 1A) of repA including the entire putative oriV. (The E. coli-E. faecalis shuttle vector pAM401 replicates independently of pAD1.) Under selection for the pAM3314-encoded chloramphenicol resistance, pDAK246
E was consistently lost from
80% of cells within 40 generations. In the presence of the empty vector (pAM401), no loss of pDAK246
E was observed (the two plasmids are compatible); this is consistent with the view that oriV acts as an incompatibility determinant.
A reasonable interpretation of the above data is that oriV is located, at least in part, in the region between the 33-bp direct repeats (DR-1) in repA and that deletion of this segment eliminated, or greatly reduced, competition between the two plasmids. To further explore this notion, we cloned into pAM88 a 173-bp restriction fragment (MfeI/RsaI) from within repA that contained both repeats (Fig. 1) and introduced it into JH2-2/pAM330; replication was assayed as described above. The chimera (pAM88oriV) was able to transform (selection on chloramphenicol) at a frequency of
107 (Table 2). (pAM88oriV DNA was not maintained in JH2-2 cells that did not harbor pAM330.) Plasmid isolation from several independently obtained transformants, restriction analyses, and Southern hybridizations (Fig. 2B and C) confirmed the presence of the intact plasmid (pAM88oriV) in JH2-2/pAM330. The data indicate that the cloned 173-bp DNA fragment present in pAM88oriV approximates the minimal segment required in cis to support pAD1 replication.
RepA is the only pAD1-encoded protein necessary for plasmid replication.
We have shown above that pAM330 supplies a "replication factor" able to facilitate in trans the establishment of a chimera carrying the oriV sequence (i.e., the RsaI/MfeI segment). To test the hypothesis that this factor was RepA, we attempted to clone an intact repA gene under a gram-positive bacteriocin promoter (27) in the E. coli vector pAM434b to determine whether autonomous replication occurred in E. faecalis (see Materials and Methods and Table 1). We were not able to clone the repA fragment without generating various mutations in repA; conceivably, expression of repA under these conditions was detrimental to E. coli. However, when we used a vector in which the promoter was altered via mutation in the 10 box (TATAAT changed to CATAAT), we recovered a clone with an intact repA. Assuming that the promoter still functions but is probably less active, we attempted to introduce this derivative, designated pAM434brepA, into E. faecalis JH2-2 cells. Erythromycin-resistant transformants were generated at a frequency of 5.1 x 105 (Table 2) and, as shown in Fig. 3, they contained the plasmid. A similarly generated chimera containing the repA
9 coding sequence (repA gene derived from pAM330
9) was constructed with the same altered 10 box, but this chimera, called pAM434brepA
9, was not able to transform E. faecalis cells (Table 2). The data indicate that repA alone comprises the pAD1 minimal replicon and the repA product recognizes an oriV located within its own reading frame. The inability of the repA
9 DNA to facilitate replication, despite the apparent ability of RepA
9 to recognize an intact oriV (described in the previous section) is again consistent with oriV being located between the two DR-1 repeats in repA. For comparison, transformation values associated with the entire repABC region of pAD1 are also included in Table 2. pAM88ABC (carries the wild-type repABC region) transforms JH2-2 cells, as well as pAM434brepA; however, if a similarly generated clone contains the deletion (pAM88A
9BC), the transformation frequency is 2 orders of magnitude lower. This is consistent with the iteron repeats not being a component of the replication origin, since they are present in both plasmids. However, the low level of transformants that does appear with pAM88A
9BC may in some way relate to marginal replication enabled by stability functions provided by RepB, RepC, and perhaps the iterons.
|
The relative stability of pAM434brepA, its small size (
4.65 kb), and its multiple cloning site (see Fig. 3) make it a good candidate for an E. coli-Enterococcus shuttle plasmid. It appeared at least as stable, if not more stable, than the pAM401 (
10.4-kb) shuttle plasmid. After unselected growth in THB for 30 generations, the percentage of pAM434brepA-containing cells was 40% compared to 15% for pAM401-containing cells.
RepA binds to its own coding sequence. If RepA is the replication initiator, it should bind to the oriV within repA. To investigate such behavior, we purified RepA by using an E. coli system that expressed His tag or Strep tag fusions to RepA as described in Materials and Methods. In initial experiments using a previously reported DNA-protein tag affinity chromatography technique (26), we observed that purified Strep-tagged RepA bound specifically to a 0.9-kb DraI restriction fragment generated by cleavage of pAM2603 (which contains 7.9 kb of pAD1 that includes repA cloned in pBluescript [1; data not shown]). As anticipated, this DraI fragment included the region within repA believed to contain oriV (Fig. 1).
Since the
170-bp MfeI/RsaI DNA fragment within repA (Fig. 1A and 2B) was sufficient in cis to allow replication in vivo, we used this segment (designated oriV) in mobility shift assays to examine RepA binding. Comparisons were also made by using segments of DNA containing the set of 12 iterons located adjacent to repB, designated It3' (see Fig. 1A). As shown in Fig. 4A, purified His-tagged RepA was able to form complexes with the oriV-containing DNA fragment (lane 2) but did not show similar complexes in the case of the iteron It3' DNA (lanes 12 to 16) or unrelated DNA fragments, such as a 200-bp fragment containing the polylinker of pBlueScript (not shown). A protein preparation generated in the same way as for His-tagged RepA but with E. coli BL21 cells containing an empty expression vector (pET30a) showed no binding activity (not shown), implying that the RepA protein specifically binds to oriV DNA. Confirmation was obtained with competition experiments. As shown in Fig. 4A, the unlabeled oriV DNA fragment greatly reduced RepA interaction with the labeled oriV DNA (lanes 3 to 6), whereas cold It3' DNA did not compete (lanes 7 to 10), indicating sequence specificity in the binding of RepA. Experiments shown in Fig. 4B suggested this specificity was associated with the N-terminal domain of RepA (RepA5'; see Fig. 1A). Purified His-tagged RepA5' was able to retard the mobility of the oriV-containing DNA fragment (lane 2) but did not retard an iteron-containing fragment (It5'; lane 5). In contrast to the case observed with the intact ("wild-type") His-tagged RepA, the DNA-protein complexes remained in the well. Although the nature of these aggregates is currently unknown, the fact that a similar binding did not occur with the iteron-containing fragment (lane 5) or a purified C-terminal (His-tagged RepA3'; lane 3) preparation suggests that this interaction is specific.
|
IR-1 sequences in oriV are critical for replication. The mobility shift assays suggested the formation of at least five RepA/oriV complexes. Inspection of the oriV region reveals the octanucleotide sequence AGGGTATG, noted as IR-1 in Fig. 5A, present as five copies (two with one mismatch). Four of these represent two paired inverted repeat sequences. To determine whether this sequence was involved in RepA binding, two synthetic DNA fragments were prepared as described in Materials and Methods. The first one contains the sequence AATCAAGAGGGTATGAAAATCATACCCTGCCAAAA and corresponds to the region that includes the two central IR-1 repeats (representing sequences 3 and 4 in Fig. 5A); it was cloned in pTAd yielding pTAdIR1. A 155-bp XbaI/HindIII fragment containing the sequence of note was then used in mobility shift experiments. A second chimera, pTAdIR1*, containing the same sequence but with four altered positions in each one of the IR-1 repeats (IR1*),AATCAAGAGCCTTTCAAAATGAAAGGCTGCCAAAA,was also generated in the same way and was designed to conserve both the dyad structure and the G+C percentage. As shown in Fig. 5B, RepA bound to the segment carrying the wild-type (IR1) sequence but did not bind to the DNA carrying the mutated (IR1*) sequence. An additional control DNA representing three iteron sequences cloned in the same way, giving rise to pTAdItC, also did not bind RepA. The data suggest that the IR1 sequence is part of the oriV site to which RepA binds. However, we observed that an excess of unlabeled IR1 DNA did not compete with RepA binding to the larger oriV fragment containing all five IR1 sequences nor did IR1* (Fig. 5C). This suggests that cooperativity or conformation dictated by the presence of more than two IR-1 sequences may play an important role in RepA binding to oriV.
|
9 by electroporation with pAM88A* and pAM401 serving as positive controls. (In contrast to the strain harboring pAM330, this strain was shown above to be much more efficiently transformable by pAM88A*.) The transformation frequencies are shown in Table 3. PCR experiments were done to confirm the presence of autonomously replicating plasmids as per the experiments shown in Fig. 2. Only pAM88A*-IR1+2 and pAM88A*-IR3+4 could be observed as independent plasmids; whereas, transformants deriving from pAM88A*-IR1+2+3+4 and pAM88*-5xIR were the results of cointegration of pAM330
9 and the pAM88A* derivative. The much higher transformation frequency exhibited by pAM88A*-IR3+4 (similar to that of the wild-type pAM88A*) compared to pAM88A*-IR1+2 (reduced by 2 orders of magnitude) suggests a more important cis-acting role in replication for the repeats related to the latter derivative. However, substitution of both pairs of IR-1 sequences resulted in complete loss of ability to replicate autonomously. The data are consistent with the in vitro binding studies and show a dependence on IR-1 sequences in cis for replication.
|
| DISCUSSION |
|---|
|
|
|---|
Of additional significance was our observation that a spontaneous recombinational event between the two 33-bp DR-1 repeats within repA resulted in an in-frame deletion giving rise to a protein (RepA
9) that, despite the absence of 35 centrally located amino acids, remained able to facilitate (in trans) replication of DNA (plasmid suicide vector) containing an intact oriV. The sequence deleted in repA
9 contained a significant portion of the oriV sequence; thus, it was not surprising to find that, in contrast to the above-noted pAM434brepA that was able to replicate stably in E. faecalis, a variant containing the repA
9 sequence (pAM434brepA
9) was not able to generate transformants. When the same deletion (repA
9) was present, together with repB, repC, and the iterons (i.e., in the case of pAM88A
9BC), however, some degree of replication occurred, presumably owing to stabilizing effects conferred by RepB, RepC, and possibly the iterons, on a remaining portion of oriV. (An analysis of the RepB and RepC roles will be reported elsewhere.)
Our earlier analyses of the pAD1 replicon suggested that RepA was likely to bind to the extensive iterons upstream of the repA coding sequence (32, 52). However, the data of the current study show that RepA targets DNA outside of the iterons. Our in vitro studies showed that RepA (but not a protein [RepB] suspected of functioning in stable inheritance [52]) bound at multiple sites within oriV (inside the repA coding sequence) and indirectly suggested recognition of the five IR-1 sequences. Indeed, an artificially constructed fragment containing one pair of IR-1 sequences bound to RepA, whereas a similar segment containing alterations in the IR-1 did not bind. Subsequent genetic analyses with plasmid constructs with altered IR-1 pairs supported the notion that these sequences play an important cis-acting role in plasmid replication. The likelihood that some degree of cooperativity is involved in RepA binding was apparent from the inability of a DNA fragment containing two copies (inverted) of IR-1 to compete with the
170-bp oriV fragment which contained five copies of the repeat. The data would also be consistent with a preferential role for one of the IR-1 pairs, as was indeed suggested from the genetic analyses. Another explanation, also consistent with the in vivo data, could relate to recognition of a specific DNA structure in addition to the IR-1 sequences. We note that numerous attempts at DNase I footprinting experiments under different conditions (unpublished) were not able to identify a region occupied by RepA. Certain DNA-binding proteins that bind within the minor groove of B-type DNA are known to yield poor footprinting data (18); indeed, several replication initiator proteins have been shown to contact DNA via the minor groove (29, 57). Whether RepA is such a protein is not currently known. Our in vitro DNA retardation data showing binding to repeat sequences within repA is to our knowledge the first such evidence for this family of initiator proteins.
The initiation of replication has been widely studied in gram-negative bacteria, in which plasmids utilizing a theta mechanism frequently carry a series of iterons to which the initiator binds (14, 17, 33). A well-known exception is the R1 plasmid, in which the initiator binds to inverted repeat sequences (30). This is also the case for coliphage lambda (15). Interestingly, in the case of lambda phage, as well as several bacteriophages from gram-positive bacteria, oriV sequences are located within the coding sequence of their respective initiator proteins (44, 45, 57). These initiation proteins, however, are not related to those addressed in the present study, despite the fact that certain RepA homologues are associated with bacteriophages. In addition, certain linear plasmids from Streptomyces species have been shown to contain origins within a rep determinant (7).
Initiator proteins of a wide variety of plasmids create a localized melting in an AT-rich region close to their DNA-binding site in the origin (14, 17). It is noteworthy that in the case of pAD1 one of the IR-1 sequences within oriV believed to bind RepA actually overlaps a highly GC-rich sequence (CCCACCCCCC) that only appears once in the entire plasmid and resembles a transfer origin "nick site" of IncF-like plasmids (gram negative) (3). Interestingly, this site is immediately adjacent to a highly AT-rich sequence (AAAGAAAAACAAA), but whether or not this junction of high and low GC content plays a key role in initiation remains to be determined. In addition to binding specifically and facilitating melting within the replication origin, the ability of RepA to strongly bind nonspecifically to ssDNA suggests a possible role in stabilizing a "melted" conformation important in assembly of the replisome. Such a process has been proposed with respect to the E. faecalis plasmid pAMß1 initiator protein RepE, which has also been shown to bind to ssDNA (43).
RepA of pAD1 is member of a recently described family of replication proteins initially found encoded by the B. natto plasmid pLS32 (48). Generally associated with a theta-type replication mechanism, these "Rep proteins" are encoded by plasmids in gram-positive genera, including Enterococcus, Lactococcus, Lactobacillus, Bacillus, and Staphylococcus and are also associated with the genomes of a number of bacteriophages from Streptococcus spp. (5, 19, 28, 41, 48). Only one sequence in the family relates to a gram-negative bacterium (Fusobacterium nucleatum [EAA24086]). Another member of this family, interferon response binding factor 1 (IREBF-1), appears to have a nonbacterial origin (56); however, Berg et al. (5) have suggested that IREBF-1 may actually represent a contaminant of bacterial origin that was present in the mouse cDNA library.
RepA (pAD1) consists of 336 amino acids, and most members of the related family are similar in size. There are no recognizable motifs such as those representative of ATPase, helicase, or specific HTH-binding domains. There is strong conservation of a number of residues in the N-terminal region, and five amino acidsY41, D58, L90, L95, and Y116are absolutely conserved in all of 38 RepA homologues compared (see Fig. 6). These conserved amino acids may relate to a key function and/or represent parts of an active center. (It is worth noting that the spontaneous mutations generated when we attempted to clone repA under the wild-type bacteriocin promoter involved this 5' region [data not shown].) It has been suggested (56) that this region contains the DNA-binding domain, and binding experiments carried out with the N-terminal domain of RepA were in agreement with this notion. The central region of the RepA homologues shows a conserved presence and organization of repeats, whereas the corresponding amino acid sequences are highly variable (Fig. 6). This is true even among the E. faecalis pheromone-responding plasmids. In the C-terminal part of the RepA family proteins, at least three subgroups are distinguishable, with a number of amino acids uniquely conserved in each group (Fig. 6). These subgroups (Enterococcus, Staphylococcus, and Lactococcus or Lactobacillus) conceivably reflect the different hosts in which the plasmids were originally isolated and may relate to specific functions shared by closely related bacterial species. Although direct evidence for this is not yet available, comparison of the amino acid sequences of different initiator proteins is suggestive of such a possibility. For example, the corresponding Rep proteins from the E. faecalis plasmids pAD1 (A47092) and pCF10 (A53309) exhibit 43% identity in the N-terminal region and 80% in the C-terminal region; whereas, in contrast, a "RepA" protein from E. faecium (ZP_00037682) exhibited 72% identity to that of pAD1 in the N terminus but only 30% identity in the C terminus. Importantly, the greater identity observed in the N-terminal domains of the latter two proteins also correlated with a high conservation in DNA sequence of the direct repeats in the central (oriV) region, which are 92% identical (only 17 differences over 155 bp, and 16 of the 17 differences are outside the DR-1 and IR-1 repeats), pointing again to a DNA-binding function for the N-terminal domain. One might expect that highly conserved DNA-binding domains would share an affinity for DNA sites that are highly homologous; indeed, one can easily envision these two features evolving together. It has been suggested previously (28, 48) for several plasmids (e.g., pLS32 and pSX267), albeit without direct evidence, that the centrally located repeats corresponds to part of the replication origin. Our in vitro and in vivo data, however, represent strong evidence that this is indeed the case for pAD1 and probably the related E. faecium system noted above.
|
| ACKNOWLEDGMENTS |
|---|
We thank all members of our laboratories for helpful discussions.
| FOOTNOTES |
|---|
Present address: Department of Microbiology, Gunma University School of Medicine, Maebashi, Gunma, Japan. ![]()
| REFERENCES |
|---|
|
|
|---|