Previous Article | Next Article ![]()
Journal of Bacteriology, July 2008, p. 4959-4970, Vol. 190, No. 14
0021-9193/08/$08.00+0 doi:10.1128/JB.00229-08
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
School of Molecular and Microbial Biosciences, University of Sydney, Sydney, New South Wales, Australia
Received 15 February 2008/ Accepted 6 May 2008
|
|
|---|
|
|
|---|
Insertion sequences are the smallest autonomously mobile elements (generally <2.5 kb). The definition of what constitutes an IS remains relatively broad, and the features that unite them are simple organization, the capacity to insert at multiple sites in a target DNA molecule (degree of specificity varies), and a phenotypically cryptic nature (encoded proteins have no function other than mobility). Most characterized IS can be divided into about 20 families (33, 48). IS can exert impact directly, either through insertional inactivation of genes or influencing the expression of adjacent genes (1, 28, 47, 50). Of greater interest, however, are impacts that emerge through their interaction with other elements. Acting in concert, IS can further alter bacterial genomes by forming compound transposons which can excise, flip, or move, leading to gene deletion, inversion, or rearrangement. By interacting with other types of genetic elements, IS can have even broader consequences, and they are clearly involved in arranging genes into complex modular assemblies that have higher-order properties (48). Here we focus on their interaction with integrons.
The interaction between integrons and mobile elements underlies many questions regarding integron biology. Integrons were discovered through their role in the assembly of antibiotic resistance genes, and there are now approximately 100 distinct integron classes known (4). Five classes are known to confer antibiotic resistance phenotypes and typically occur as part of complex mosaic elements with transposons and plasmids. In stark contrast, the vast majority of integron classes occur on chromosomes and have no obvious phenotype. Since all integrons thus far experimentally tested have been shown to have similar functions (3, 12, 14, 15, 24, 26, 30), the implication is that there are strong biological consequences from interaction between integrons and mobile elements. This has led many workers to distinguish between mobile resistance integrons (MRIs) and chromosomal integrons (CIs). (For an exchange of opinions as to the significance of differences between integrons, see references 22 and 34.)
Interactions between integrons and IS elements are a likely forerunner to incorporation of either the integron or assemblies of gene cassettes into larger mosaic structures with higher-order biological properties (7, 38). They potentially explain the origin of MRI involvement in multiple antibiotic resistance and could also explain various aspects of the biology of CIs. Of the various IS reported in association with integrons, one of the most interesting is ISPa21 (37). Since this element is a member of an IS grouping (IS1111 section within the IS110 family) considered to typically show target site specificity, it raises the prospect of specific IS-integron interaction (36). We discovered another member of this group, ISPst6, in a Pseudomonas stutzeri strain. Interestingly, ISPst6 was within an integron that is a member of a family of CIs strongly associated with Pseudomonas species (26, 54). Such genus-associated CI families are predicted to have been continuously present and strongly influenced the evolution of their respective genera (18, 44). The P. stutzeri species complex is a well-studied model for bacterial evolutionary genetics, and numerous distinct genomovars have been described. These subtaxa are genomically distinct subgroups within the species complex with well-defined evolutionary relationship (2, 19, 29, 43). Consequently, sampling P. stutzeri genomovars provides a unique opportunity to explore the outcomes of interaction between chromosomal integrons and insertion sequences over evolutionary significant periods of time. We confirmed that ISPst6 and relatives specifically target cassette arrays and show that these interactions have influenced the fate of cassette arrays in pseudomonads. This direct interaction between distinct genetic entities is the result of convergent evolution between distinct recombinases for the same target site. We postulate this may represent a general principle contributing to assembly of more complex mosaic elements.
|
|
|---|
DNA manipulations. High-molecular-weight DNA was extracted from all strains by using a CTAB (cetyltrimethylammonium bromide)-phenol-chloroform method, with a preliminary lysozyme and RNase A treatment (46). Plasmid preparations were carried out according to a standard alkaline lysis protocol (46). Restriction enzymes were used according to the manufacturer's instructions (New England Biolabs). PCR amplifications were carried out by using Taq DNA polymerase (New England Biolabs) in 25-µl reactions, unless otherwise stated. Primer sequences are given in Table 1. DNA purification was carried out with Qiaquick PCR purification or gel extraction kits (Qiagen). Automated sequencing was carried out by the Australian Genome Research Facility (Brisbane, Australia) on an ABI 3730xl 96-capillary automated DNA sequencers using BigDye terminator chemistry.
|
View this table: [in a new window] |
TABLE 1. PCR primers
|
Promoter activity assays. Three sections of ISPst6 from ATCC 17595 were tested for promoter activity: the 336-bp ISPst6 junction region (this section contains 49 bp of the transposase gene and the 287 bp upstream in circular form, 72 bp from left-hand side [LHS] and 215 bp from right-hand side [RHS]; ST89 to ST123); a subsection of this section containing only PIRL (49 bp of transposase gene and 67 bp from LHS [ST123 to ST117]); and this same 116-bp region in the opposite direction, which would affect expression of downstream sequence (Pout). All PCR products were amplified in 200-µl reactions with Phusion polymerase (New England Biolabs) and ligated into the SmaI site of the promoter analysis vector pKK232-8 (6), upstream of the promoterless chloramphenicol acetyltransferase gene to create pUS84 (Pjunc+IRL), pUS90 (PIRL), and pUS92 (Pout). The E. coli lacZ gene promoter was similarly cloned into this vector in forward and reverse orientation as positive and negative controls (pUS85 and pUS86). The insert regions of all constructs were confirmed by sequencing. The chloramphenicol MIC was determined by serial twofold dilution in Mueller-Hinton agar (BD), following the guidelines of the National Committee for Clinical Laboratory Standards. Inocula of 104 CFU per spot were applied with an antibiotic sensitivity replicator. After 16 h of incubation at 37°C, the chloramphenicol MIC was defined as the lowest concentration that prevented the visible growth of colonies on the plate.
Transcript mapping. For pUS84, where more than one putative promoter was present within the construct, transcript start points were mapped by using the method of Lloyd et al. (31) with minor alterations to the reverse transcription (RT) reaction. Briefly, RNA was extracted from log-phase E. coli cells carrying these constructs by using Qiagen RNeasy mini-columns according to the manufacturer's standard protocol (Qiagen). Approximately 5 mg of RNA was used in RT-PCR with a FAM-labeled reverse primer situated in the chloramphenicol acetyltransferase gene (see reference 31 for the sequence) using 400 U of Superscript III (Invitrogen) and a 3-h incubation at 50°C. The products were treated with RNase H (New England Biolabs) for 20 min and then ethanol precipitated and resuspended in 2 µl of distilled water. The size of the fluorescently labeled RT products was directly determined by using an ABI Prism 377 GeneScan analyzer (Macquarie University Sequencing Facility, Sydney, Australia) and compared to the insert sequence to determine the transcript start point.
Southern hybridizations. PstI and PvuII digests of approximately 5 µg of genomic DNA of each strain were separated via electrophoresis in a 0.8% agarose gel, visualized, and then transferred to a nylon membrane (Hybond N+; Amersham) using capillary transfer under alkaline conditions. Probes for Southern hybridization were labeled with digoxigenin-6-dUTP by PCR according to the Roche protocol. All hybridizations were performed in DIG Easy-Hyb buffer (Roche) containing 25 ng of digoxigenin-labeled probe/ml, followed by incubation overnight at 55°C. Detection and visualization of hybridization products was performed by using a Roche DIG detection kit according to the manufacturer's instructions, with washing conditions of 68°C in 0.5x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate)-1% sodium dodecyl sulfate. Two IS probes were tested; the first consisted of 1,365 bp of the element amplified from ATCC 14405 genomic DNA with the ST87-ST117 primer pair. The second probe consisted of 194 bp from the 3' end of ISPst6 amplified from ATCC 14405 with the primers ST87 and ST88; this probe was designed to bind to only one fragment of ISPst6 digested with PvuII, so that the copy number could be estimated.
Construction of attC containing plasmid as an ISPst6 "trap." A 219-bp section of P. stutzeri ATCC 17587 (gv2), containing a typical Pseudomonas-type attC (also known as the PAR element) (54), with a small amount of flanking sequence was amplified by PCR (ST139 and ST96) and then cloned into pGEM-T Easy vector (Promega) according to the manufacturer's instructions to make pST001. This construct was transformed into electrocompetent E. coli Epi300 (Epicenter) cells, together with a large insert Fosmid clone carrying multiple copies of ISPst6 within ATCC 14405 array 1. Cells were plated out on LB-ampicillin-chloramphenicol agar. A small number of clones containing both Fosmid and pSGT001 were recovered; one was chosen for subsequent work. Cells were screened for transposition events via PCR with one primer in pGEM-T backbone and one in ISPst6 (pgemR-ST123). Once PCR products indicated transposition had occurred, dilutions were plated onto LB-ampicillin-chloramphenicol agar to screen individual colonies. After 1 week at 37°C followed by 3 weeks at 4°C, individual colonies gave PCR products spanning the IS-plasmid boundary; these products were sequenced confirming the presence of ISPst6 in pST001. PCR with primers spanning the other boundary (pgemF-ST148) did not give a product with either the broth or the single colony template.
Sequence analysis. The nucleotide sequence of the putative IS were used to search the National Centre for Biotechnology Information BLAST website (www.ncbi.nlm.nih.gov:80/BLAST) utilizing both BLASTN and BLASTX programs. In addition to searching the default NCBI database, searches were made of the env_nt and unfinished genome databases. The analysis function in the IS finder database (http://www-is.biotoul.fr/is.html) was used to search for characterized IS with homology to the ISPst6 element and its putative transposase. The ISPst6 sequence in integrated, and the circular form was screened for potential promoter elements by using the Neural Network Promoter Prediction service for prokaryotes (41), found on the Berkeley Drosophila Genome Project website (http://www.fruitfly.org/seq_tools/promoter.html), with a minimum score of 0.7 for the promoter prediction. Vector NTI (version 10.0, Invitrogen) was used to view and analyze sequence information.
Multiple sequence alignment was performed using the default settings of the AlignX module of the Vector NTI software package, which is based on the CLUSTAL W algorithm. Alignments were manually optimized in the GeneDoc alignment editor. Phylogenetic analyses were performed by using programs from the Phylip package through Biomanager. Trees constructed from protein alignments were based on a restricted alignment that removed positions affected by frameshifts in ISUnCu1 and ISAvX1. Trees constructed from nucleotide sequence alignments included these corresponding regions.
GenBank accession numbers. Sequences generated in the present study have been deposited in GenBank under accession numbers EF439838, EF648209, EF648210, EF648212, EF648211, EF648213, EF648215, EF648214, and EF648216. Other sequences cited here are given in Table 2.
|
View this table: [in a new window] |
TABLE 2. IS1111-attC subfamily elements
|
|
|
|---|
ISPst6 is active and has all of the typical features of the IS1111 family. IS1111 family elements are typically comprised of a single ORF encoding a transposase flanked by relatively long noncoding sequences, and bounded by conserved subterminal 12- to 13-bp inverted repeats (consensus, ATGGACgCcCCC). In ISPst6 the noncoding regions were 53 and 240 bp at the left and right ends, respectively. The inverted repeat boundaries were precisely determined to be 5'-GGCTGTAATGGACTCTCCC-3' (LHS) and 5'-GGGAGAGTCCATATA-3' (RHS). The transposase of ISPst6 does not show a particularly close relationship to IS1111A, the archetypical member of the family (only 35% amino acid identity), but the percent identity to another previously described member of the family, ISPa21 (37) was 84%. Notably, ISPst6 possesses the conserved D-E-D-D motif (D7 E51, D88, D91) that has been demonstrated to be essential for activity in the Piv/MooV family of DNA recombinases (8, 52) and is inferred to be essential for IS1111, IS110/IS492 family members (10, 36).
Previously studied members of the IS1111 family have been shown to move via circular intermediates (36, 39). We looked for similar ISPst6 minicircles by PCR and sequenced products of the expected size from two independent P. stutzeri strains containing ISPst6 (ATCC 17587 and ATCC 17595). In each case the recovered ISPst6 sequence contained the expected junction of 10 bp between the two inverted repeats and no additional sequence, a finding consistent with amplification from a circular intermediate (Fig. 1.). Interestingly, another typical IS1111 family feature was seen in this junction sequence: the formation of a putative sigma 70 promoter (36, 39). The region upstream of the transposase gene in the circular form contains two putative promoters, one formed in the junction region (Pjunc) and one entirely at the LHS, expected to drive the expression of the gene in the integrated form (PIRL). This section was recovered by PCR and tested for promoter activity in the promoter analysis vector pKK232-8. The pUS84 construct containing both putative transposase promoters showed significant chloramphenicol resistance (MIC = 128 mg/liter). Transcript start-point mapping indicated that the junction promoter Pjunc was responsible for the expression of the downstream gene in this construct (data not shown). A subsection containing only PIRL was also tested to determine whether expression in the circular and integrated forms is likely to be different for this element. Neither the PIRL-only construct (pUS90) nor another putative promoter POUT (pUS92) conferred significant chloramphenicol resistance.
![]() View larger version (14K): [in a new window] |
FIG. 1. Structural features of ISPst6 in integrated and circularized forms. (A) Schematic representation of a segment of an integron-assembled gene cassette array showing the inverted repeat and PIRL sequences for an inserted IS1111-attC element. Note that the IS inserts in reverse orientation and the displayed sequence are for the sense strand of the IS but written 3' 5' (i.e., as shown in panel B). Cassette-associated genes are represented by block arrows, and attC sites are represented by triangles. Components of the first gene cassette are colored gray, and components of the second gene cassette are colored black. The integron (not shown) is located upstream of the orfA gene cassette. (B) Linear diagram of ISPst6 as it occurs in cassette arrays. The location and orientation of the putative promoters PIRL and Pout, which were tested for activity in pUS90 and pUS92 constructs, are also shown. The sequence of the junction formed between IRR and IRL upon excision of ISPst6 as a circle is shown below. The recombination junction between the left and right termini (slash), the subterminal nature of the IR sequences (boldface letters with underline arrows), and the putative –35 and –10 sequences of the junction promoter (overlined) are all indicated. The start of the major transcript mapped in the pUS84 construct lies within the sequence CCCTG at the end of IRL.
|
![]() View larger version (36K): [in a new window] |
FIG. 2. Detection of multiple copies of ISPst6 in P. stutzeri genomovar 2 strains by Southern hybridization. PvuII digests of chromosomal DNA from genomovar 2 strains ATCC 17587, ATCC 17595, and ATCC 14405 are in lanes A, B, and C, respectively. The probe used was a 194-bp sequence from within the RH noncoding region of ISPst6. This probe is predicted to yield only one band per IS copy when DNA is digested with PvuII. A single very faint band was also seen in PvuII digests of the four other strains (see the text).
|
![]() View larger version (23K): [in a new window] |
FIG. 3. Dendrograms showing the relationship of the IS1111-attC group to other members of the IS1111 family. The numbers shown at the nodes are those from a bootstrap analysis of 100 replicates. Tree topologies with respect to the indicated groups were identical in parsimony and maximum-likelihood analyses. (A) Phylogeny based on the transposase sequence. All sequences associated with attC sites comprise a set (IS1111-attC group) within the strongly supported subgroup 1 (indicated by heavy lines). Note the predicted sequence of ISUnCu1 after correction for frameshifts was used. The tree was from an alignment of 308 amino acids. The tree was rooted using IS110 as an outgroup. (B) Phylogeny based on an alignment of 305 bp from the right hand noncoding region of the IS. All IS-like fragments associated with attC sites cluster with sequences that were within the IS1111-attC group in transposase trees. The tree was rooted by using ISMlo1as an outgroup.
|
The IS1111-attC subgroup specifically target gene cassette attC sites. Target specificity is thought to be a general feature of the IS1111 family (36). Although only ISPst6 and ISPa21 were observed in multiple sites, a good sample of target sites used by the IS1111-attC subgroup was obtained since collectively the 16 different IS1111-attC subgroup elements (or fragments thereof) were observed in at least 27 independent loci (Table 2). All intact elements were inserted within the recombination site of a gene cassette (attC) in inverse orientation with respect to the gene cassette array (Fig. 1). Our data set also included a number of partial sequences and degenerated sequences. In all cases where an IS boundary (IRR or IRL) was still identifiable the adjoining sequence was within an attC site. Two degenerated elements (ISPstX4a and ISPstX4b) had lost both IS boundaries. These elements were located within 300 bp of a cassette array and are likely to also derive from an ancestral element that inserted into an attC site. Significantly, a total of seven degenerated elements had lost the left boundary (IRL), and in all cases this sequence fragment marked the boundary of the cassette array. The adjoining sequence was a chromosomal framework gene. The insertion sites of the most closely related outgroups of the IS1111-attC group (ISHhaX, ISPpu11, and ISPnaX) were also examined. None of these were located in the proximity of integron features (intI homolog or gene cassette array).
The attC sites of gene cassettes are unambiguously recognizable by their imperfect inverted repeat (pseudo-palindrome) structure with two characteristic "flipped-out" positions (21, 26, 32, 49). They are however highly variable in length and sequence. Length variation (range, 57 to 145 bp) arises due to one or more insertions to the common pseudopalindromic backbone structure of 51 bp. Broad patterns in attC diversity can be recognized according to the location (left, central, right, none, etc.) and the sequence of these insertions. Of the 25 attC sequences harboring IS, 10 were only partial sequences. The 15 sites for which the complete sequence could be inferred include examples with left and central insertions of various length and sequence. We categorize them as six aadB-like sites (
60 bp, with a 9-bp insertion between positions s and S), seven Pseudomonas-like sites (
76 bp, with
18-bp insertion at m-n and an 8-bp insertion at s-S), one blaGES-like site (
110 bp with a 59-bp insertion at s-S), and one blaOXA-like site (111 bp with a 18-bp insertion at n-o and a 42-bp insertion at s-S). The pairwise divergence between two members of different subcategories (e.g., aadB-like and Pseudomonas-like) is equivalent to the extremes of divergence seen for all attC sites. In all cases the IS element had precisely inserted into an equivalent position within the attC site, corresponding to between positions h and i (Fig. 4). In the five cases where we could not reconstruct the complete attC site, we could identify an IS-attC junction consistent with position h (unambiguous identification of position h is not always possible from a partial sequence [error of ±1 nucleotide]). Position h was also the immediately adjacent nucleotide to five of the degenerate IS-like fragments. The sequences immediately surrounding ISHhaX, ISPpu11, and ISPnaX did not show the diagnostic secondary structural features of attC sites, nor did they show local primary sequence conservation (Fig. 4).
![]() View larger version (33K): [in a new window] |
FIG. 4. Comparison of all available sequences harboring IS1111-attC elements (or fragments thereof) aligned according to secondary structure. The 51 conserved positions in the attC pseudopalindrome are indicated; upper- and lowercase letters are complementary. Agreement with the inverted repeat predicted by the pseudopalindrome model is indicated by shading (note that positions F-A derive from the next attC site in the array, so mismatches between a-f and F-A are expected [see also Fig. 1]). Partial sequences inferred to derive from attC sites are also shown. The sequence surrounding the most closely related nonmember of IS1111-attC (ISHhaX1) is shown below the alignment to illustrate the lack of relationship to attC sites. Dots represent unavailable sequence, dashes represent alignment gaps, and a space represents the absence of a gene cassette-related sequence (array truncation). The numbers shown in the blaGES-9 and blaOXA-17 attC sites represent additional nucleotides not shown.
|
|
|
|---|
70 to 80% conservation in most characterized examples) (22). Therefore, the ability of IS1111-attC elements to exploit integrons, and particularly MRIs, will be dependent on their activity in the host bacterial species and their ability to tolerate attC sequence variation. Integron integrases (IntIs) appear to tolerate attC variation by recognition of a specific structure rather than a specific sequence (5, 32). All known attC sites form a similar stem-loop with conserved structure and extrahelical bases (25, 49), and it is proposed that the key to IntI-mediated recombination is a DNA template formed by self-pairing of a uniplex DNA molecule. This mechanism is consistent with integron biology since the majority of DNA is acquired in single-stranded form (34). IS elements also seem to tolerate remarkable levels of attC variation, but the notion of a single-stranded DNA target is more difficult to reconcile with IS biology. ISPst6 was found in nine distinct target sites that showed up to 23% sequence divergence, ISPa21 was found in three target sites (encompassing 70% divergence), and collectively the target sites of the family encompassed attC site variation equivalent to that of IntIs. It is probable that all IS1111-attC elements are capable of specifically inserting into a wide range of attC sites. However, they may only be active under specific conditions since their distribution was limited and we could not demonstrate full function in E. coli.
Impact of IS1111-attC elements on integron-related phenotypes. Integrons depend on a host replicon for survival. Where an integron's maintenance in its host replicon is dependent on selective advantage conferred by integron-related phenotypes, IS insertion in gene cassettes is potentially detrimental. Integron-related phenotypes result from their ability to capture, and then direct the expression of, cassette-associated genes through their PC promoter. Integrons can coexpress multiple cassette-associated genes via readthrough transcription from PC to at least some extent (13), and it is therefore useful to distinguish between simple and complex integron-related phenotypes. Simple phenotypes are those that result from activation of a single cassette-encoded gene. The direct interaction between IS1111-attC and gene cassettes is predicted to have minimal impact on simple phenotypes. By specifically targeting the attC site, IS1111-attC elements avoid disruption of the ORF and any direct interference with its transcription. This elegant strategy is reminiscent of the interaction between group II retroelements and terminators (42), and it is noteworthy that a subset of group II retroelements also targets attC sites (9, 40).
Complex phenotypes are those that are encoded by multiple genes. These could be additive, as in the case of multiple drug resistance, or emergent as in the case of multicomponent enzymes or degradative pathways. The expression of complex phenotypes is dependent on the simultaneous expression of cassette genes (13), and IS insertion is highly likely to uncouple PC-directed expression of downstream gene cassettes. An IS could minimize this impact by restoring simultaneous expression if it carried outward-facing promoters (1, 28, 47, 50). The presence of such a promoter was proposed in ISPa21, although no confirmation of expression of the downstream aminoglycoside resistance genes was reported (37). In the case of ISPst6, an equivalent putative POUT promoter was tested and found to be inactive in E. coli (pUS92 construct). In the P. stutzeri arrays examined here no phenotype is known, so further testing of this aspect was not feasible. In summary, the IS are unlikely to impact simple phenotypes but are predicted to influence complex phenotypes. To the extent that complex phenotypes are important to the maintenance of an integron, IS1111-attC elements may impact its evolutionary fitness. It is important to recognize here that complex phenotypes are a product of integron function over time, so we also need to consider ongoing activity of the integron.
Impact of IS1111-attC elements on integron functions. There is a strong correlation between genomic context and observable complex phenotypes in integrons. Multiple resistance phenotypes have only been reported from integrons on mobile elements, despite CIs typically containing more gene cassettes. This may reflect that for MRIs, their success is linked to that of their host plasmids. Resistance plasmids exploit a niche of frequently changing antibiotic challenges. In this selective context, it is important not only to express resistance genes but also to maintain the capacity to rapidly change the resistance phenotype. Indeed, it has been proposed that superior integron functions contribute to the relative success of class 1 integrons over class 2 and class 3 integrons in environments of intensive antibiotic use (23, 24).
The insertion of IS1111-attC elements is predicted to impact integron functions via the disruption of cassette mobility. It was therefore surprising to find examples where identical (ISPaX3_blaOXA-17) or near-identical (IsUnCu1_aadA1) copies of an IS-gene cassette fusion existed in two distinct integron contexts (Table 2). These observations could reflect either IntI-mediated mobilization of the fusion cassette or, alternately, integron-mediated capture or excision of adjacent gene cassettes. Irrespective of the mechanism, we note that the presence of a single fusion cassette has not prevented the capacity to develop new combinations of gene cassettes.
Long-term outcomes of IS-integron interaction. We also considered that the impact on integron function may increase as occupancy rates of attC sites increases or only become evident over long periods of time in a suitable host for IS activity. Our sample design enabled us to examine long-term outcomes by comparison of orthologous CIs in Pseudomonas strains separated by a range of evolutionary distances. We detected evidence for eight distinct IS elements (at the 95% identity level), each of which was confined to a single Pseudomonas lineage. We interpret this as independent and recent acquisition of a distinct IS in each lineage. Nonfunctional versions of all eight elements were found, and only for ISPst6 did we also find fully functional copies. Multiple near-identical copies of ISPst6 were present in nonidentical gene cassettes. We interpret this as ISPst6 acquisition by a common ancestor of the three genomovar 2 strains, followed by independent expansion to available attC sites in each array. A consequence of this expansion is that the operon-like structure of the cassette array is heavily disrupted. Perhaps the most significant observations were that in all cases where an integron contained an IS1111-attC element the cassette arrays were different and that in all cases where the available sequence information extended to the chromosomal framework at the right-hand end of the integron, a degenerated IS1111-attC element marked the cassette array boundary.
When viewed collectively, these data suggest that a cycle of "infection," expansion, and extinction of IS1111-attC elements occurs within CI arrays (Fig. 5). The IS invasion of CIs does not directly result in loss of the integron or stabilization of the cassette array. It also does not result in loss of simple phenotypes, but it is expected to influence the expression of complex phenotypes (if these exist). The eventual outcomes are extinction of the IS, truncation of the cassette array, and modification of the right-hand boundary of the integron. This extinction/truncation process may involve other genetic elements (five unrelated IS elements were also seen) and is sufficiently rapid that it can be completed much faster than speciation in the P. stutzeri complex (Fig. 5).
![]() View larger version (19K): [in a new window] |
FIG. 5. Representation of the cycle of interaction between IS1111-attC elements and chromosomal integrons in Pseudomonas. A schematic 16S rRNA phylogenetic tree is shown on the left with four P. stutzeri genomovars and selected other species shown (the evolutionary distance within genomovars is exaggerated). (A) The preinfection phase was observed in genomovar 8 strains. In all five strains, no IS1111-attC sequences were seen, and integrons had the typical CI array structure. A small number of ORFs of unknown origin separate the last cassette from chromosomal framework genes. (B) The expansion phase was observed in genomovar 2 strains. This process is probably rapid since the IS must have been acquired after divergence from Gv8 and has given rise to multiple gene cassette/IS1111-attC element fusions by independent expansion in all three strains in the time since acquisition. (C) The postextinction state was observed in genomovars 1 (two of three strains) and 7 (one of five strains) and in P. mendocina (one of two strains) and P. alcaligenes (one strain). No intact IS were observed, and IS fragments defined an abrupt boundary between the last cassette and chromosomal framework genes.
|
Conclusions. The "DNA translocases" of integrons and IS1111-attC are heterologous proteins with different catalytic mechanisms. Comparison of the structural basis of their binding with the same recombination site is likely to shed significant insight into the nature of specific DNA-protein interactions. We consider that the recognition of gene cassette attC sites by both enzyme families represents convergent evolution. Both the IS1111-attC transposases and the IntIs are monophyletic groups in which all known members share target specificity for attC sites and the most closely related outgroups of the same protein superfamily do not. This implies that attC specificity is an ancestral feature of both the IS1111-attC and IntI clades that has retained its biological relevance to both types of genetic element over the course of their evolutionary radiation. The advantage to IntIs of targeting attC sites is obvious given the relationship between integrons and gene cassettes. At first impression, the advantage to IS1111-attC transposases would seem to be that it gives access to large numbers of insertion sites. However, given the extinction cycle of IS-integron interactions and the rarity of functional IS, this is arguably not a successful strategy. We postulate that this unusual example of convergent evolution represents a broader biological theme, namely, that there is long-term selective advantage in interaction between genetic elements, and shared recognition of specific, extragenic target sites enables this. Insertion sequences are known to be involved in the formation of mosaic genetic elements (48), and the interaction between integrons and IS1111-attC group elements in P. stutzeri represents a model for how IS elements may initially come into contact with diverse assemblies of genes. As genomic data sets with greater replication at multiple levels of evolutionary divergence are assembled and analyzed, we predict that further examples of families of simple genetic elements sharing target sites will be recognized and ultimately shown to contribute to the formation of mosaic genetic elements.
Strains used in this study were provided by Johannes Sikorski and Neil Wilson. We acknowledge Nick Coleman and Ruth Hall for helpful discussions.
Published ahead of print on 16 May 2008. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»