Previous Article | Next Article ![]()
Journal of Bacteriology, July 2006, p. 4681-4689, Vol. 188, No. 13
0021-9193/06/$08.00+0 doi:10.1128/JB.00332-06
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Maria Carmela Latella,1,
Mariassunta Casalino,2
Mauro Nicoletti,3
Stefano Michienzi,2 and
Bianca Colonna1*
Dipartimento Biologia Cellulare e dello Sviluppo, Università La Sapienza, 00185 Rome, Italy,1 Dipartimento Biologia, Università Roma 3, 00146 Rome, Italy,2 Dipartimento Scienze Biomediche, Università G. D'Annunzio, 66100 Chieti, Italy3
Received 7 March 2006/ Accepted 10 April 2006
|
|
|---|
|
|
|---|
Several mobile DNA elements of both prokaryotic and eukaryotic origin have been observed as circular intermediates after excision (23). The molecular mechanism underlying the transposition strategy has been extensively studied using two members of the IS3 family as models, IS911 (10, 36) and IS2 (19, 20). The formation of an IS minicircle has been shown to occur by a variation of the "cut-and-paste" transposition pathway (10, 20). This strategy involves generation of a "figure eight" intermediate resulting from Tpase-mediated cleavage of the IS 3' end and from transfer of this end to a site on the same DNA strand a few nucleotides outside the other IS end. The intermediate is then converted into an IS minicircle by replication and/or by host repair activities, thus reproducing the initial replicon carrying the parental IS. The junction formed by the abutted IS ends in the minicircle is a transpositionally hyperactive substrate, which can be readily nicked by the Tpase, giving rise to a linear IS element.
A peculiarity of ISs which transpose using a reactive junction is that the abutted IRs are always separated by a short DNA linker derived from nucleotides flanking the target end in the parental insertion (22, 23, 26, 29). Another feature of several transposable elements which undergo circularization is that the junction between the left IR (IRl) and the right IR (IRr) results in assembly of a junction promoter (Pjunc) capable of efficiently driving Tpase synthesis, as is the case in IS911 (36). Junction promoters are generally stronger than the indigenous IS promoters (PIRL), which are usually located partially within the IS sequence upstream of the Tpase gene, and their formation leads to a burst of Tpase synthesis that results in efficient integration of the IRr-IRl junction (27). In IS911, as in other IS elements, the formation of a minicircle by means of a short intervening nucleotide stretch (3 to 6 bp) creates a promoter by placing a 35 element present in the terminal IRr at an optimal distance from a 10 box located in the abutted IRl (20, 36). In other cases, including the IS1111 transposon family, when the ends of the IS element are brought together to form the circular intermediate, the interstitial sequence itself represents a 10 box, which is correctly positioned with respect to the 35 region located near the right end of the IS element (26, 29).
Although integration of transposable elements often seems to occur at random, sequence-specific insertions have been observed for several elements (9). Thus, Tn7 integrates at a high frequency into a single locus (attTn7) of the Escherichia coli chromosome (21). IS1630 seems to integrate preferentially into palindromic sequences that resemble transcription terminators (4), while IS621 and ISPpu10 transpose within the repetitive extragenic palindromic sequences of E. coli and Pseudomonas putida, respectively (7, 32). In some cases, like IS117 and IS1383, which are able to generate a circular intermediate, the target sequence exhibits homology to the interstitial junction sequence, indicating that insertion may involve a site-specific recombination process (2, 26).
Here we describe identification and functional characterization of a new insertion sequence, ISEc11, which exhibits high levels of homology with elements belonging to the IS1111 family (29). ISEc11 was isolated from enteroinvasive E. coli (EIEC) strain HN280, in which it resides both on the chromosome and on the large (260-kb) F-type virulence plasmid (pINV). We show here that ISEc11 is able to induce formation of circular intermediates, is transpositionally active, and has target specificity. Minicircle formation results in the assembly of a Pjunc containing a 10 box in the interstitial sequence. Moreover, we present data indicating that the plasmid copy of ISEc11 (ISEc11p) is a spontaneous variant containing an internal 39-bp duplication of the right end region. The presence of this duplication is responsible for generation of a much more efficient junction promoter upon circularization, suggesting that there are cis-acting elements in the 39-bp duplication, which facilitate the binding of RNA polymerase to the promoter.
|
|
|---|
ß-Galactosidase assays were performed with sodium dodecyl sulfate-chloroform-permeabilized cells grown in LB medium supplemented with ampicillin. ß-Galactosidase activity was determined as described by Miller (25), and the results were expressed as averages of three independent experiments.
DNA manipulations.
Extraction of total and plasmid DNA, restriction digestion, electrophoresis, and purification of DNA fragments were carried out as described previously (12, 31). Southern hybridization was performed using
-32P-labeled DNA probes essentially as described by Sambrook and Russell (34). Details concerning the primers used in this study are shown in Table S1 at http://w3.uniroma1.it/bcolonna/docs/jb06_tabs1.pdf. The IS probe was generated using oligonucleotides S1 and S2 to amplify an internal region of the ISEc11 tnpA sequence. While PCRs were routinely performed using Taq polymerase, amplicons for cloning were obtained with high-fidelity Pwo polymerase.
Plasmid construction. Plasmids used in this study are listed in Table 1. Plasmid pGEc11, containing the ISEc11 copy inserted into the yjgZ-yjzY chromosomal locus, and plasmid pGEc11p, containing the ISEc11 plasmid variant (ISEc11p), were generated by cloning the amplicons obtained with primers CS1 and CS2 or with primers PS1 and PIS2 from genomic DNA of EIEC strain HN280 into pGEM-T (Promega). Plasmids pGJC1 and pGJP3, containing the IRl-IRr junction sequences of ISEc11 and ISEc11p, were generated by cloning into pGEM-T fragments obtained by amplifying HN280 DNA with primers PC1 and PC2 in the opposite orientations and corresponding to ISEc11 sequence positions 1062 to 1079 and 715 to 699, respectively.
|
View this table: [in a new window] |
TABLE 1. Plasmids used in this study
|
All plasmids containing the TnpA-LacZ fusion were constructed by cloning PCR-generated fragments of the tnpA regulatory regions into the multiple cloning site of the 'lacZYA translational fusion vector pRS414. Plasmids pRINc1 and pRINp2, containing the regulatory region of the Tpase gene of the ISEc11 and ISEc11p linear copies (PIRL), were obtained using pGEc11 and pGEc11p, respectively, as the templates and primers FL2 and FL3 as the forward and reverse primers, respectively. Plasmids pRJUc1 and pRJUp3 were obtained by amplifying fragments containing the entire IRr-IRl ISEc11 and ISEc11p junctions, using pGJC1 andpGJP3, respectively, as the templates and primers FL1 and FL3 as the forward and reverse primers, respectively. The FL1 and FL2 primers contain an EcoRI site, while reverse primer FL3 contains a BamHI site. The fragments obtained (PIRL [428 bp] derived from pGEc11 or pGEc11p; PjuncC [668 bp] derived from pGJUc1; and PjuncP [707 bp] derived from pGJUp3 [see Fig. 3A]), which contained the N-terminal portion of the tnpA gene, were fused to the 'lacZ gene of the EcoRI-BamHI-linearized pRS414 vector.
![]() View larger version (22K): [in a new window] |
FIG. 3. Analysis of the strength of the Tpase promoters of ISEc11 and ISEc11p. (A) The fragments containing the tnpA promoter of ISEc11 or ISEc11p in the circular form (PjuncC andPjuncP, respectively) or in the linear form (PIRL) were obtained by PCR amplification with oligonucleotides FL1 and FL3 or with oligonucleotides FL2 and FL3. The PIRL-containing regions of ISEc11 and ISEc11p are identical. All fragments extend up to Tpase amino acid Asp92 (TnpA'). (B) ß-Galactosidase activity of the TnpA-LacZ translational fusions obtained by cloning the fragments described in panel A into vector pRS414. Activity is expressed in Miller units, and the values are averages of three independent experiments (PjuncP, 3,170 ± 105 Miller units; PjuncC, 109 ± 8 Miller units; PIRL, 5 ± 1 Miller units). (C) The affinity of RNA polymerase (RNApol) for PjuncC and PjuncP was quantified by real-time PCR as described in Materials and Methods. The fragments containing PjuncC and PjuncP described in panel A were cloned into the pKK232-8 vector upstream of a cat reporter gene. Transcripts were generated in vitro in a 60-µl reaction mixture using different amounts of RNA polymerase (0.1, 0.25, 0.5, and 1 U). The PjuncP/PjuncC affinity ratios are the averages for three independent assays.
|
The sequences of PCR-generated fragments were checked by the dideoxy chain termination method (34).
Transposon mobility. Transposition of ISEc11 and ISEc11p was tested using the pTRIS0 plasmid. pTRIS0 is an entrapment vector containing the ISEc11 insertion consensus (5'-GTGAAAATACTG-3') upstream of a cat reporter gene (Apr Cms) which is silent because of an incomplete 10 box and because of the absence of a 35 box (Table 1). Insertion of ISEc11 or ISEc11p into the pTRIS0 target site (giving rise to plasmids pTRIS11 and pTRIS11p, respectively) was monitored by PCR analysis using primers PC1 and TR3 with total DNA from 120 HN280 Apr Cmr colonies. Since chloramphenicol resistance occurs only if the IS element is inserted into its target site, the efficiency of transposition was calculated by averaging the results of three independent experiments and was expressed as the Cmr/Apr ratio.
Primer extension.
Total RNA from strains DH10b(pRJUc1) and DH10b(pRJUp3) grown to an optical density at 600 nm of 0.6 was extracted by using a modification of the hot phenol method and was quantified spectrophotometrically (31). Primer PE1 was 5' end labeled with [
-32P]dATP using T4 polynucleotide kinase and was hybridized with 50 µg of total RNA. Reverse transcription experiments were carried out at 42°C using cMaster RT (Eppendorf) according to the manufacturer's instructions. The resulting cDNAs were analyzed on denaturing 6% polyacrylamide gels, along with a sequencing ladder that was generated by using the same primer and pRJUp3 as the template. Sequencing reactions were performed with a T7 DNA sequencing kit (US Biochemicals) and [
-32P]dATP. Primer extension experiments with transcripts synthesized in vitro (see below) from pRJUc1 or pRJUp3 were performed under the same conditions.
Quantitative PCR.
Plasmids pKJEc11 and pKJEc11p, derived from pKK232-8, were used as templates in in vitro transcription assays, which were performed at 37°C in 60-µl portions of buffer containing 25 mM Tris-HCl (pH 7.9), 0.1 mM dithiothreitol, 10 mM MgCl2, 15 mM KCl, 1 mM EDTA, each ribonucleoside triphosphate at a concentration of 2.5 mM, and different amounts (0.1, 0.25, 0.5, and 1 U) of RNA polymerase (Epicenter). While 20 µl of each reaction mixture was stored without further treatment (aliquot 1), the remaining volume was digested with RNase-free RQ1 DNase (Promega). Twenty microliters of each digest was stored without further treatment (aliquot 2), and the remaining 20 µl (aliquot 3) was used as a template for a reverse transcriptase PCR (TaqMan reverse transcriptase reagents; Applied Biosystems). Aliquots 1, 2, and 3 were used as templates for quantitative PCR performed with an ABI PRISM 7300 detection system, using SYBR Green PCR Master Mix (Applied Biosystems) and primers RTC1 and RTC2, corresponding to sequences internal to the pKK232-8 cat gene. A variant of the 
CT method was used to calculate the final results. In particular, experiments were performed in triplicate, and the
CT value was obtained by subtracting the average cycle threshold (CT) value of aliquot 1 from the average CT value of aliquot 3. Calculation of 
CT involved subtracting pKJEc11-related
CT values from pKJEc11p-related
CT values. The average CT value for aliquot 2 was used to evaluate the efficiency of the RNase-free DNase treatment. Primer pair efficiency was examined by looking at how
CT varied with template dilution, according to the suggestions of Applied Biosystems.
Sequence analysis. DNA sequence data were compared to known nucleotide and protein sequences using the BLAST Server (National Center for Biotechnology Information, Bethesda, Md.) and the IS finder database (http://www-is.biotoul.fr/is.html). Searches for Tpase functional motifs were carried out with the CD-Search program (24). Computer predictions of intrinsic bending were obtained as previously described (12, 31).
Nucleotide sequence accession numbers. The IS element described in this paper was assigned a designation (ISEc11) by the IS finder database (http://www-is.biotoul.fr/is.html). The sequences of ISEc11 and its plasmid variant, ISEc11p, have been deposited in the GenBank database under accession numbers DQ361537 and DQ361536, respectively.
|
|
|---|
![]() View larger version (18K): [in a new window] |
FIG. 1. Structure of ISEc11 and its plasmid variant, ISEc11p. ISEc11 is a 1,443-bp IS element with a perfectly conserved 13-bp subterminal inverted repeat (arrowheads), flanked by 3-bp terminal sequences (gray shading). The left TS (5'-GTA-3') differs slightly from the right TS (5'-ATA-3'). The 4-bp sequence 5'-AAAT-3' between the abutted ends of ISEc11/ISEc11p in the circular forms is indicated by gray letters to the left of the left TS. The open arrow indicates the transposase gene (tnpA). ISEc11p is a 1,482-bp IS element that was originally identified on the pINV plasmid of EIEC strain HN280. It differs from ISEc11 by a 39-bp duplication at the right end of the element. The duplication (ISEc11 nucleotides 1405 to 1443) contains a copy of the right IR and a copy of the right TS, as well as an upstream stretch.
|
|
View this table: [in a new window] |
TABLE 2. Distinctive features of ISEc11/ISEc11p and other members of the IS1111 family
|
In silico analysis with the CD software (24) revealed that besides a DEDD motif in the N-terminal portion, the ISEc11 Tpase exhibits, between amino acids 186 and 284, homology with a structural motif of pfam02371 (transposase family 20). Furthermore, between residues 76 and 145, there is partial homology with a domain typical of pfam1548 (transposase family 9). It has recently been reported that these two motifs are also present in the Tpase of ISPpu10, a member of the IS110/IS492 family (32).
Target specificity of ISEc11. Southern hybridization analysis was performed to determine the exact number of copies of ISEc11 present in the EIEC strain HN280 genome. Since HN280 contains, besides the large virulence plasmid (pHN280), a large cryptic plasmid (pHN281) (Table 1) (8), we probed EcoRI digests of HN280 total DNA, as well as EcoRI digests of pHN280 and pHN281, with the ISEc11 transposase coding region. Interestingly, HN280 harbors five copies of ISEc11; four of them are located on the chromosome, and one is located on the pHN280 virulence plasmid. No hybridization was detected with pHN281 DNA.
To detect spreading of ISEc11 in EIEC strains and in the closely related pathogen S. flexneri, total DNAs from nine EIEC strains (HN280, HN11, HN13, HN280, HN300, 13-80, 6.81, 53638, and 4608) and from seven S. flexneri strains (SFZM43, SFZM46, SFZM49, SFZM50, SFZM53, M90T, and YSH6000), belonging to different serotypes and isolated in different geographic areas (5, 28), were digested with EcoRI and hybridized with the ISEc11 transposase probe. While the plasmid copy of ISEc11 is present on the same restriction fragment (5.1-kb EcoRI fragment) in all of the EIEC strains analyzed, there are differences in the number and the relative location of the chromosomal copies. In particular, while the five EIEC strains from eastern Africa (HN280, HN11, HN13, HN19, and HN300) harbor four ISEc11 copies in the same position, in the remaining four EIEC strains the number of ISEc11 chromosomal copies varies from none to four. In contrast to EIEC, none of the S. flexneri strains analyzed harbor regions homologous to the ISEc11 transposase-encoding gene (data not shown).
Alignment of the sequences adjacent to the ends of the HN280 ISEc11 copies revealed that the IS has a potential target site (5'-GTNAAAANANTG-3') consisting of a central stretch of four A residues bounded by a GTN trinucleotide on the 5' end and by an NANTG stretch on the 3' end. The asymmetric structure of the target site could account for the fact that all of the ISEc11 copies analyzed were found to be inserted in the same orientation. A few members of the IS1111 family are known to have preferred target sites into which they insert in only one orientation, and often the target sites are represented by the IRs of other transposable elements (26, 29).
ISEc11p, the virulence plasmid variant of ISEc11. Sequence analysis revealed that the single copy of ISEc11 carried by pHN280, which is 4.6 kb upstream of the ospC3 plasmid gene, contains a 39-bp additional sequence downstream of the right end of the element. This short DNA stretch is a duplication of the outermost 39 nucleotides of ISEc11 (from position 1405 to position 1443) and therefore contains another copy of the IR element (IRr) and the TSr sequence (Fig. 1). The resulting structure of this ISEc11 variant, which we designated ISEc11p (GenBank accession number DQ361536), could have originated from insertion of ISEc11 into itself, followed by a rearrangement-excision process that left only remnants of the inserted IS.
A PCR analysis was performed in order to check for the presence of ISEc11p in the pINVs of all EIEC strains used in this study (see Materials and Methods). To do this, we used primers PC1 and PIS2, corresponding to a tnpA internal sequence and a region downstream of the ISEc11p insertion site, respectively. In all pINVs an insertion appeared to have occurred at the same locus, either as ISEc11 or as ISEc11p. In particular, the variant form ISEc11p is present only in pINVs of strains HN11, HN13, HN19, and HN300, which were isolated (like HN280) during an epidemic outbreak in Somalia (5, 28). Since there is a high level of sequence homology and since the pINVs of Shigella and EIEC are interchangeable (17), using the ShiBASE database (www.ngc.ac.cn) (38), we verified in silico that ISEc11 or remnants of ISEc11 were present in Shigella spp. pINVs. This analysis revealed that the presence of an entire ISEc11 element in pINV is a hallmark of EIEC strains, while pINV plasmids of S. flexneri, Shigella dysenteriae, and Shigella boydi contain only a fragment corresponding to nucleotides 1 to 220 of the ISEc11 sequence. The S. dysenteriae pINV also harbors a larger fragment, containing internal ISEc11 sequences (nucleotides 275 to 698). The extremely high IS content of pINV plasmids (38) and the peculiar location of ISEc11 in the pINVs of EIEC strains (i.e., an IS-rich region) could have favored a series of recombination events that left only remnants of ISEc11.
ISEc11 and ISEc11p form circular intermediates. The sequence data strongly suggest that ISEc11 is one of the ISs which have the capacity to generate transpositionally active circular forms as a consequence of the IRl-IRr junction. Therefore, we looked for evidence of circular intermediates of ISEc11 and of its variant ISEc11p by performing a PCR analysis. Total DNA of EIEC strain HN280 was used as a template for a PCR with ISEc11 primers PC1 and PC2 oriented toward the ends of the element, so that an amplification product could occur only if the inverted repeats abutted each other as a consequence of circularization of the IS. An ca. 1.1-kb product was detected, confirming the ability of ISEc11 to generate circular forms. To ascertain whether the amplicons could have arisen from circular intermediates of ISEc11 or ISEc11p (or both), we separated the PCR products on 1.2% agarose gels. Two fragments were detected, and sequence analysis confirmed that they correspond to amplification of the circular intermediate forms of ISEc11 (1,097 bp) and ISEc11p (1,136 bp).
The two amplicons consist of the right and left ends of ISEc11 or ISEc11p joined by the abutted IRs. In particular, the right and left 13-bp IRs (IRl and IRr) were separated by a 10-bp sequence, IRr-ATAAAATGTA-IRl, that comprised the abutted terminal sequences of ISEc11 and ISEc11p (Fig. 2B). Considering the 3-bp terminal sequence (TSs) part of the IS, we concluded that they are separated by a 4-bp 5'-AAAT-3' spacer sequence. This tetranucleotide flanks each of the IS copies on both sides. Therefore, the interstitial DNA stretch between TSl and TSr at the junction is identical to the direct repeats flanking the element in its integrated form. We hypothesize that all of the 5'-AAAT-3' spacer could have been derived from either the left or right flanking sequence or could consist of 1, 2, or 3 bases from the left end and 3, 2, or 1 bases from the right end of the direct repeats generated by the IS upon insertion. A comparison of the 5'-AAAT-3' spacer to the target site (5'-GTGAAATACTG-3') (Fig. 3A) further suggested that integration of the circular IS form may have occurred by a site-specific recombination process in which the spacer tetranucleotide was used as the central recombination site.
![]() View larger version (23K): [in a new window] |
FIG. 2. Structural and functional analysis of the Pjunc promoters of ISEc11 and ISEc11p. (A) Primer extension analysis of the transcripts generated under control of the Pjunc promoters of ISEc11 (PjuncC) and ISEc11p (PjuncP). The autoradiograph is the result of a typical experiment performed with the PE1 primer and RNA extracted from strain MC4100 transformed with pRJc1 (PjuncC) or RJp3 (PjuncP). Lanes G, A, T, and C show the sequencing ladder generated with the same primer. The arrow indicates the end point of the extended product. (B) Sequence of the ISEc11 and ISEc11p minicircle junction. Inverted repeats are indicated by solid arrows and by open arrows. The spacer sequence acts like a 10 box (gray background) of the Pjunc promoter. The outwardly directed 35 box upstream of the right IR contributes to generation of the Pjunc promoter. The transcription start site (+1), mapped by primer extension (panel A), is located in the left IR. ISEc11p contains an additional 39-bp stretch, generated by duplication, and it also contains a second copy of the right IR (IRr*) (see the legend to Fig. 1).
|
To analyze the functionality of the predicted Pjunc promoters and to compare their activity with that of the native Tpase promoter, we introduced into pRS414, a vector suitable for creating LacZ translational fusions (Table 1), fragments containing the upstream region of the Tpase gene when ISEc11 and ISEc11p are in the linear or circular form (Fig. 3A). Assuming that the native Tpase promoter is very likely located around the IRl, we amplified the whole left IS region using oligonucleotides FL2 and FL3 and the DNA of pGEC11 or pGEC11p (Table 1) as the template. The amplicons, which contained the same nucleotide sequences, were cloned into EcoRI-BamHI-linearized pRS414 to obtain plasmids pRINc1 and pRINp2 (Table 1). The fragments carrying the Pjunc promoters, obtained by amplifying pGJC1 and pGJP3 (Table 1) with oligonucleotides FL2 and FL3, were cloned into pRS414, giving rise to plasmids pRJUc1 and pRJUp3, respectively (Table 1). All the pRS414 derivatives (pRINc1, pRINp2, pRJUc1, and pRJUp3) encode a Tpase-LacZ hybrid protein whose expression is controlled by either indigenous (PIRL) or junction (PjuncC from ISEc11, PjuncP from ISEc11p) promoters.
The results of ß-galactosidase assays (Fig. 3B) indicated that the level of expression of the TnpA-LacZ fusion under the control of PIRL (containing the regulatory region of the tnpA gene when ISEc11 and ISEc11p were integrated) was very low (5 ± 1 Miller units). This presumably reflects very low efficiency of the Tpase promoter and is in agreement with the known inefficiency of many indigenous Tpase promoters (27). In contrast, the presence of the junction sequences activated the expression of the Tpase-LacZ fusion in pRJC1 and pRJP3, confirming that the predicted Pjunc promoter is functional. Surprisingly, the ß-galactosidase activity under the control of the PjuncP promoter (pRJp3) was 30-fold higher (Fig. 3B) than the ß-galactosidase activity elicited by PjuncC (pRJc1).
To ascertain whether the difference was due to increased activation of PjuncP or to the use of an alternative promoter, we analyzed the transcription start position of the tnpA-lacZ mRNA. Total RNA extracted from exponentially growing cultures of strains carrying pRJc1 and pRJp3 was reverse transcribed using an end-labeled oligonucleotide primer (PE1) complementary to nucleotides 182 to 161 of ISEc11/ISEc11p. The results (Fig. 2A) show that there was only one tnpA transcript, starting downstream of the 10 box located in the interstitial sequence, at the G nucleotide (G10) located in the IRl element. This indicates that transcription of the Tpase gene is dependent on the Pjunc predicted in silico and rules out the possibility that there is activity of alternative promoters. Under the conditions used, the signal from pRJc1 was very low, as expected considering the large difference between PjuncC and PjuncP observed in the ß-galactosidase assays (Fig. 3B).
Comparative in vitro activity assays of PjuncC and PjuncP. The higher efficiency of PjuncP may well be explained on the basis of the upstream structural difference between the two promoters (i.e., the 39-bp duplication), which mediates either stronger binding of the RNA polymerase or better promoter accessibility for other positive regulatory factors. The latter possibility is unlikely since the higher PjuncP activity was also confirmed by primer extension assays (not shown) using in vitro transcripts originating from PjuncC (pRJUc1) and PjuncP (pRJUp3).
To determine whether the 39-bp duplication might mediate more efficient RNA polymerase binding to the promoter, we compared in vitro transcripts obtained under the control of PjuncC and PjuncP. Plasmids pKJEc11 and pKJEc11p, obtained by cloning fragments containing the Pjunc promoter of ISEc11 or ISEc11p (Fig. 3A) into pKK232-8 (Table 1) upstream of a cat reporter gene, were transcribed in vitro using different amounts of RNA polymerase. The transcripts were then quantified by real-time PCR in the presence of the SYBR Green dye. As shown in Fig. 3C, at a low RNA polymerase level (0.1 U) the PjuncP activity was about 20-fold higher than the activity of PjuncC, and the difference decreased with increasing RNA polymerase levels.
These in vitro observations strongly suggest that no other regulatory factors are involved in the enhanced transcriptional activity elicited by PjuncP and that, at limiting RNA polymerase concentrations, the transcriptional stimulation strictly depends on the presence of the 39-bp duplication. Therefore, in this region we looked for cis-acting elements known for their potential to stimulate promoter activity and enhance RNA polymerase binding, intrinsic curvature of the DNA molecule (30, 37), and the presence of UP elements (11, 33).
Intrinsically curved (bent) DNA molecules most frequently appear when recurrent short sequences of A residues occur in phase with the B-DNA helical repeat (37). A stimulatory effect of bent DNA on promoter activity has been demonstrated in a number of cases (30). It appears unlikely that DNA geometry might appreciably influence the efficiency of the Pjunc promoters of ISEc11/ISEc11p, since computer-generated bending predictions, obtained as described previously (31), indicated that the overall intrinsic curvature of a region encompassing the PjuncC and PjuncP promoters (from 300 bp upstream to 200 bp downstream of the transcription start) is weak and is not significantly affected by the presence of the 39-bp duplication (data not shown).
In many bacterial and phage promoters significant stimulation of the transcriptional activity has been shown to depend on the binding of the carboxy-terminal domain of the RNA polymerase
subunit to upstream sequences. These so-called UP elements are usually located between position 38 and position 59 with respect to the transcription start site and consist almost exclusively of A residues (11, 33). In terms of both nucleotide sequence and relative position, the presence of an A-rich stretch in the 39-bp duplication of PjuncP and its location (nucleotides 41 to 51) (Fig. 2B) correspond well to a potential role as an UP element.
Transposition of ISEc11 and ISEc11p into the target sequence. As previously shown, analysis of the sequences flanking the ISEc11 element revealed the presence of a preferential target site. To demonstrate transposition of ISEc11 and ISEc11p, we constructed a specific entrapment vector, pTRIS0 (Apr) (Table 1), carrying the target sequence (5'-GTGAAATACTG-3') and a cat reporter gene which cannot be expressed, since its promoter is incomplete (Fig. 4A) due to the lack of the 35 consensus and to replacement of the 10 box with the target sequence. Hence, the Cmr phenotype can occur only if the regulatory region is restored by supplying a properly spaced 35 box together with a functional 10 box. On the basis of the sequences of ISEc11 and ISEc11p, insertion of one of these ISs into the pTRIS0 target site is expected to supply a 35 region and to create a functional 10 box, thus inducing expression of the cat gene (Fig. 4B).
![]() View larger version (25K): [in a new window] |
FIG. 4. Structure of entrapment vector pTRIS0. (A) Plasmid pTRIS0 is a pRS414 derivative that was created to analyze the transposition of ISEc11 and ISEc11p. The 5'-GTGAAAATACTG-3' nucleotide stretch (gray background) corresponding to the target site in the yjgY-yjgZ locus was used as a substrate for ISEc11 or ISEc11p insertion. The target site was inserted upstream of a defective promoter of the vector-encoded cat gene. The cat gene was silenced due to the lack of a complete promoter region (+1 indicates the potential transcription start). (B) Integration of ISEc11 or ISEc11p created a functional promoter by supplying a 35 box, located just upstream of the right IR, and by completing a 10 box by insertion of right end terminal sequences. Integration of ISEc11 or ISEc11p restored expression of the cat gene.
|
|
|
|---|
ISEc11 is a new IS element that is widespread in enteroinvasive E. coli strains. We found that in these strains, which together with Shigella strains are the etiological agents of bacillary dysentery (15), the ISEc11 element is always present on the pINV plasmid and also is very frequently present on the chromosome. pINVs of Shigella and EIEC strains are large (220-kb to 260-kb) F-type plasmids that contain all the genes required for invasion and for intra- and intercellular spread, including their positive activators (8, 12, 31). Despite the fact that pINVs are functionally interchangeable between Shigella and EIEC, the presence of a complete ISEc11 on the pINV plasmids is limited to EIEC, whereas Shigella pINVs harbor only remnants of this IS element. While the pINVs of four of nine EIEC strains (4608, 53638, 13.80, and 6.81) (6) used in this study harbor a native copy of ISEc11, in the remaining five strains (HN280, HN11, HN13, HN19, and HN300) (28) the pINV-located ISEc11 is represented by a natural variant, ISEc11p. Both ISs are integrated into the same plasmid locus. Compared to ISEc11, ISEc11p contains a 39-bp duplication, probably generated by insertion of the IS element into itself, followed by an excision event leaving only part of the right end of the excised IS (Fig. 1). Indeed, the duplication contains a perfect copy of the rightmost 39 bp of ISEc11, including the IRr and TSr elements.
ISEc11 and ISEc11p are able to form minicircles and contain subterminal 13-bp IR elements flanked by slightly different (3 bp) terminal sequences (Fig. 1 and Table 2). Upon circularization both elements give rise to a functional Pjunc promoter consisting of a 10 box (5'-ATAAAA-3'), generated by joining of the TS through a 4-bp spacer (5'-AAAT-3'), and a 35 box (5'-TTGCAAA-3') located in the IRr element. The AAAT sequence between the abutted TS elements is derived from the tetranucleotide direct repeats flanking the IS ends in the integrated form (Fig. 1). Although upon circularization ISEc11 and ISEc11p form a junction promoter carrying identical 10 and 35 boxes, ß-galactosidase assays (Fig. 3B) revealed that the junction promoter of ISEc11p (PjuncP) is able to induce a 30-fold-higher level of TnpA-LacZ synthesis than the ISEc11 junction promoter (PjuncC) induces. The presence of only one starting point in primer extension experiments (Fig. 2A) rules out the possibility that the 39-bp duplication in PjuncP contains an alternative promoter.
To investigate the dependence of the enhanced PjuncP activity on trans-acting regulatory factors or on additional cis-acting elements, we analyzed in vitro transcripts generated by PjuncP and PjuncC. As shown in Fig. 3C, real-time PCR assays confirmed that PjuncP exhibits promoter activity that is up to about 20-fold greater than the promoter activity exhibited by PjuncC. Taking into account the fact that the difference is particularly significant at a low RNA polymerase concentration, it is likely that the presence of the 39-bp duplication strongly increases the affinity of PjuncP for the RNA polymerase. Recently, it has become increasingly evident that besides the classical 35 and 10 boxes, many promoters contain one or two recognition boxes for the carboxy-terminal domain of the RNA polymerase
subunit. These so-called UP elements are characterized by a high A content and are known to stimulate transcription up to 100-fold by facilitating the initial binding of RNA polymerase and the subsequent transcription initiation steps (11). In silico analysis revealed that the 39-bp duplication contains A stretches (from position 41 to position 51) which closely mimic the features of UP elements (Fig. 2B) and therefore could have the potential to act as transcription-stimulating cis elements.
Helically phased upstream A-rich regions have also been shown to play a critical role in altering the architecture of a promoter through creation of an intrinsically curved DNA structure that is potentially able to affect transcription (30, 37). Computer-based DNA curvature predictions (31) for a 500-bp region encompassing the junction promoters of ISEc11 and ISEc11p did not reveal significant overall curvature, irrespective of the presence of the 39-bp duplication (data not shown), making it unlikely that DNA bending contributes significantly to the efficiency of the Pjunc promoters of ISEc11 and ISEc11p.
The role of the 35 box (5'-TTGCAA-3') located near the IRr of ISEc11 is quite interesting in the case of the integrated IS. By means of transposition experiments we showed that ISEc11 and ISEc11p can transpose into a specific entrapment vector containing the target sequence. Insertion of these IS elements promotes the transcription of a reporter gene through the generation of a new promoter. This is determined both by the presence of an available 35 box in the IRr and by the creation of a new 10 region resulting from the addition of a short AT stretch to the target site (Fig. 4B). It is plausible that the promoter-promoting potential of ISEc11/ISEc11p might have been exploited during evolution to shape the transcription profiles of EIEC strains, enhancing their adaptability to new environmental conditions. By using in silico analysis we found that ISEc11 is homologous to putative IS elements present on the chromosome of pathogenic strains related to E. coli, like Shigella strains (38). In particular, Shigella sonnei harbors three complete copies of ISEc11 (one of them is interrupted by an IS1 element), while two complete copies are present in both S. dysenteriae and S. boydi. The presence of remnants of ISEc11 in the pINV plasmids of all Shigella spp. strains strongly suggests that this IS has played an active role in the evolution of pINVs. Such a strong association between an IS element and its host has been reported previously for the IS605 and IS607 elements and Helicobacter pylori, leading to the hypothesis that there is a dynamic coevolutionary process involving these ISs and their hosts (14, 16).
Several features indicate that ISEc11 and its variant, ISEc11p, belong to the IS1111 family; besides the presence of subterminal IRs and the homology of these IRs with those of other IS1111-type elements (Table 2), a relevant trait is the existence of a specific target site into which the IRs insert in a preferred orientation (26, 29). Another strong common feature is the presence of a Tpase lacking the classical DDE motif and exhibiting relevant similarities to the Piv protein, a site-specific invertase of Moraxella lacunata (18, 29). Recently, it has been shown that the catalytic domain of the Piv invertase contains a DEDD motif and that the four residues are required for inversion catalysis and for intramolecular recombination (3). Our in silico studies (Table 2) revealed that a DEDD motif is also present in the ISEc11/ISEc11p Tpase and that it is highly conserved (D9E52D89D92) in several members of the IS1111 family. The similarity of this Tpase to DNA recombinases belonging to the Piv/MooV family and the fact the 5'-AAAT-3' tetranucleotide that acts as a spacer between the IS ends in the circular form is also present on both sides of the element in its integrated form strongly suggest that integration could occur by site-specific recombination rather than by transposition.
The high number of transposable elements identified so far and the availability of in vitro transposition assays have highlighted the variety of transposition strategies (1, 10, 13, 23, 32). A transposition mechanism involving site-specific recombination at a 3-bp core site, located within the spacer separating the IS ends in the circular form, has been described for IS1383 and IS117, two other elements that form circular intermediates (2, 26). To explain the presence of a 4-bp direct duplication in ISEc11/ISEc11p, it is possible to envisage that the circular form is generated by a cut at the end of the target site, leaving short single-stranded sticky ends which could be joined together to form a circular intermediate. Following the circularization event integration into a new target site may occur around a central recombination site consisting of the homologous tetranucleotides located within the target site, as well as within the spacer separating the TSs in the circular form.
Integration at specific target sites and the sophisticated complexity of the regulatory networks underlying Tpase expression stress the importance of the compromise evolutionary strategies used by transposable elements to cope with conflicting requirements, like the management of a potentially damaging transposition activity and the maintenance of the functional integrity of the host genome. Although the evolutionary advantage conferred by the increased Tpase levels associated with the presence of the ISEc11p is not clear yet, the structure of PjuncP nicely exemplifies how a highly efficient promoter can be generated by a comparatively simple event, such as the joining of small DNA consensus elements at suitable reciprocal distances.
This work was supported by grants from MIUR (PRIN, Progetti Ateneo) to B.C., M.C., and M.N. and in part by the Foundation "Istituto Pasteur-Fondazione Cenci Bolognetti."
G.P. and M.C.L. contributed equally to this work. ![]()
|
|
|---|
subunit. Genes Dev. 13:2134-2147.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»