Previous Article | Next Article ![]()
Journal of Bacteriology, March 2007, p. 2443-2459, Vol. 189, No. 6
0021-9193/07/$08.00+0 doi:10.1128/JB.01688-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Lutz Wiehlmann, and
Burkhard Tümmler*
Klinische Forschergruppe, OE 6711, Medizinische Hochschule Hannover, D-30625 Hannover, Germany
Received 1 November 2006/ Accepted 8 January 2007
|
|
|---|
|
|
|---|
Genomic islands are chromosomal regions that are typically flanked by direct repeats and inserted at the 3' end of a tRNA gene. They contain transposase or integrase genes that are required for chromosomal integration and excision and other mobility-related genes. Genomic islands are clone or strain specific and are never found in all clones of a taxon. Most islands are easily differentiated from the core genome by their atypical G+C contents and atypical oligonucleotide compositions, with steep gradients at their boundaries (37, 38). First identified in pathogenic bacteria (pathogenicity islands), genomic islands have since been detected in numerous nonpathogenic species. Genomic islands may confer fitness traits, increase metabolic versatility or adaptability, or promote bacterium-host interaction in terms of symbiosis, commensalism, or virulence (10).
The ubiquitous and metabolically versatile Pseudomonas aeruginosa is an important opportunistic pathogen for humans, plants, and animals (34). Several large genomic islands have been detected in strains from human infections and aquatic habitats. All known large genomic islands of P. aeruginosa but one (28) are integrated into tRNA genes. Two different types were identified, the islands PAGI-2/PAGI-3 (25) and pKLC102 (21)/PAPI-1 (16), respectively. PAGI-2 and PAGI-3 were sequenced in strains C and SG17M of the major clone C (41), an isolate from the lungs of a patient with cystic fibrosis and an isolate from a river. PAGI-2 and PAGI-3 integrate into tRNAGly genes adjacent to the PAO homolog PA2820. In both islands, the first open reading frame (ORF) adjacent to the tRNAGly gene encodes a bacteriophage P4-related multidomain integrase. PAGI-2 and PAGI-3 have a modular bipartite structure. The first part (the cargo region) adjacent to the tRNA gene consists of strain-specific ORFs encoding metabolic functions and transporters, the majority of which have homologs of known function in other eubacteria. The second part (the conserved part) is made up of a syntenic set of ORFs, the majority of which are either classified as conserved hypotheticals or related to DNA replication or mobility genes. Forty-seven of these ORFs are arranged in the same order in both islands with amino acid identities of 35 to 88%.
The other known large genomic islands are integrated into one of the two identical tRNALys genes adjacent to PAO1 homologs PA0976 and PA4541. The sequenced islands that integrated adjacent to PA4541 are the pathogenicity island PAPI-1 of strain PA14 (16) and the mobile genetic element pKLC102 of the clone C strain SG17M (21). The 104-kb pKLC102 and the 108-kb PAPI-1 share a phage module that confers integrase, the att element, and a syntenic set of conserved genes, similar to those detected in PAGI-2 and PAGI-3 (21). The other tRNALys gene adjacent to PA0976 is targeted by genomic islands of various sizes (4 to 81 kb in six sequenced strains) and with various gene contents (16, 21, 24). These islands encode the type III secretion effector protein ExoU, a potent cytotoxic lipase (43), in exoU-positive strains (24). The sequence analysis suggests that the exoU-containing genomic islands probably evolved from an ancestral plasmid similar to pKLC102. Subsequent integrations of insertion elements, deletions, and rearrangements may then have led to the contemporary diversity of the islands (24).
The integration sites for all of these large genomic islands are located in the three hypervariable regions of the P. aeruginosa chromosome (17, 39). Since the PAO gene contig of these regions spans genomic segments of various sizes in other clones (17), we hypothesized that genomic islands account for their pronounced plasticity. We were curious to know whether and to what extent the sequenced pKLC102 and PAGI-2 are prototypes for these suspected genomic islands. PAGI-2 and pKLC102 share a set of 36 homologous genes, 15 of which have been identified in numerous genomic islands of other proteobacteria (29). In this study, the presence of homologs of all ORFs of pKLC102 and PAGI-2 was investigated in a panel of 71 genetically unrelated strains from diverse habitats and geographic origins (30) to assess the abundance and conservation of these types of genomic islands in P. aeruginosa.
Genomic islands are typically stably integrated into the host chromosome. The reversible integration and excision of genomic islands has so far been documented for only a few cases, such as the clc element of Pseudomonas putida strain RR21 (14); pathogenicity islands of Vibrio cholerae (32), Shigella flexneri (42), and Yersinia pseudotuberculosis (27, 33); integrative and conjugative elements (ICEs) of Escherichia coli strain ECOR31 (44) and of Vibrio cholerae (7, 8); and the SaPIbov2 pathogenicity island of Staphylococcus aureus (51); the last two are not integrated into a tRNA gene. Among the P. aeruginosa islands, pKLC102 is known to coexist in episomal and chromosome-integrated forms in clone C strains (21, 41), but no information was available about the chromosomal stability of the other three sequenced large genomic islands. Hence, the relative numbers of integrated and episomal forms were determined for PAGI-2, PAGI-3, pKLC102, and PAPI-1 during growth in vitro. In parallel, the oligonucleotide usage (OU) patterns of the four genomic islands were analyzed to unravel their genomic signatures and any commonalities with each other and their P. aeruginosa host chromosome. In particular, pKLC102 proved to behave like a foreign selfish element, consistent with its exceptionally high mobility.
|
|
|---|
Lseq in all cases. For a given word length lw, an Nw of
different words is possible for a sequence of four letters, A, T, G, and C. The observed counts of words (Co) were compared with the expected counts of words (Ce). Assuming the same distribution frequency for all words of a common length lw irrespective of their compositions and sequences, Ce matches the standard count number Cn0:
![]() | (1) |
![]() |
w of observed from expected counts is given by
![]() | (2) |
Variances OUV of word deviations were determined as follows:
![]() | (3) |
w values according to equation 2. Rank numbers instead of word counts were used to simplify pattern comparison.
The distance D between two patterns was calculated as the sum of absolute distances between ranks of identical words in patterns i and j as follows:
![]() | (4) |
![]() | (5) |
![]() | (6a) |
![]() | (6b) |
. The computational program for determining OU patterns and their comparative analysis and storage in a database was written in Python 2.2 [http://www.python.org/] (38).
Strains. Seventy-one P. aeruginosa strains from diverse origins and unrelated SpeI genotypes (Table 1) (30) were selected from an in-house strain collection. Moreover, the sequenced reference strains PAO1 and PA14 were included. Multilocus genotyping was performed in informative single-nucleotide polymorphisms (SNPs) of the loci oriC, oprL, alkB2, gltA, oprI, ampC, fliC, exoS, and exoU as described previously (5, 30). The 16 binary SNP genotypes of the 71 strains (Table 1) were represented by a four-digit hexadecimal code (see Table S1 in the supplemental material). The 16 SNPs were divided into four groups of 4 SNPs each, and the 16 possible combinations in each group were differentiated by 16 characters (0 to 9 and A to F). Sequential P. aeruginosa isolates were collected from the airways of 36 individuals with cystic fibrosis in half-year intervals after the onset of airway colonization over a period of up to 21 years. Strains were screened for the presence of PAGI-2 by PCR with specific primers for the gene C10 (25). Cupriavidus strains were supplied by M. Mergeay, Mol, Belgium (Table 1). Unless otherwise stated, strains were grown in liquid LB medium or on LB agar plates.
|
View this table: [in a new window] |
TABLE 1. Description of strains
|
Combinatorial PCR.
PCR was performed with PA14-, PAPI-1-, SG17M-, pKLC102-, C-, PAGI-2-, or PAGI-3-derived target-specific primer sequences (see Table S2 in the supplemental material) and 50 ng P. aeruginosa DNA in a 50-µl reaction mixture (5 µl 10x reaction buffer [Eurogentec], 3.3 µl 25 mM MgCl2, 1 µl dimethyl sulfoxide, 10 µl primer solution [5 µM each], 3 µl deoxynucleoside triphosphates [2 mM each], 1 U Goldstar DNA Polymerase [Eurogentec]). For PCR kinetics, aliquots of 5 µl were withdrawn at the indicated cycles, separated by electrophoresis, and stained with ethidium bromide. The relative amounts Ni and Nj of the template DNA sequences i and j in the reaction mixture were determined from the titration for the first reaction cycle n when the PCR products became visible by ethidium bromide fluorescence during the late exponential phase of PCR according to the following equation:
![]() | (7) |
Southern hybridization analysis. To visualize the copy numbers of PAGI-2- and pKLC102-type islands in P. aeruginosa strains, XhoI- or NcoI-restricted genomic DNA was separated by agarose gel electrophoresis, blotted onto Hybond N+ membranes (Amersham), hybridized with digoxigenin (DIG)-labeled PCR-generated probes, and detected by chemiluminescent immunoreactive signals by applying standard procedures (40). According to BlastN analysis, the primer sequences were specific for PAGI-2 or pKLC102 and showed no homology to the PAO1 genomic sequence (49).
Macroarrays. (i) Design. PCR products generated with PAGI-2- or pKLC102-derived primer sequences were spotted onto nylon membranes. The scheme is shown in Fig. 1. For the PAGI-2 macroarrays, 91 PCR products were distributed onto the membrane representing 93 of the 111 predicted ORFs (Fig. 1A). ORF C47 was represented by two different products ("C47a" and "C47b"); the adjacent genes C54 and C55, C76 and C77, and C82 and C83 were each represented by a single ORF-spanning PCR product.
|
View larger version (20K): [in a new window] |
FIG. 1. Schematic diagram of the positions of ORF-derived PCR products on the PAGI-2 (A) and pKLC102 (B) macroarrays. (A) PAGI-2. ORF C47 is represented twice (C47a and C47b) by different PCR products. (B) pKLC102. ORF CP103 is represented twice (CP103a and CP103b) and CP94 three times (CP94a, CP94b, and CP94c) by different PCR products. Five (A) or 10 (B) positive or negative control dots were spotted in the lower left corner.
|
(ii) Production of macroarrays. Probe sequences of 208 bp to 805 bp were generated by four PCRs with cosmids encoding pKLC102 (21) or PAGI-2 (25) sequences as templates. The contig CP39 to CP41 was amplified from P. aeruginosa C genomic DNA. The primer sequences are listed in Table S2 in the supplemental material. All PCRs were performed with 40 to 200 ng cosmid DNA or 100 to 200 ng genomic DNA in a final volume of 100 µl (10 µl 10x buffer [500 mM Tris-HCl, 160 mM NaNH4SO4, 0.1% {vol/vol} Tween 20, pH 8.8], 2 µl 50 mM MgCl2, 6 µl each of 5 µM primer A and B stock solutions, 2 µl dimethyl sulfoxide, 6 µl 8 mM deoxynucleoside triphosphates [2 mM each nucleotide], 2 U Taq DNA polymerase [InViTek]). After denaturation for 300 s at 96°C, 35 cycles were run (annealing for 45 s at 60°C or 58°C, elongation for 45 to 90 s at 72°C, and denaturation for 120 s at 94°C). According to agarose gel electrophoresis and subsequent ethidium bromide staining, more than 80% of all PCR products were at least 99.9% pure and all other PCR products were at least 98% pure. Macroarray copies were produced in parallel from the same stock of pooled PCR products to ensure that the corresponding ORFs were represented by identical amounts of DNA on each membrane. Hence, for each of the 96 PCR products, an aliquot of 50 µl of pooled PCR product, 85 µl Tris-EDTA buffer, and 15 µl 3 M NaOH was dispensed in a well of a 96-well plate, denatured for 30 min at 65°C, and chilled on ice. After the addition of 100 µl 3 M ammonium acetate, aliquots of 100 µl each were transferred by a minifold-dot-vacuum-blot apparatus (Schleicher & Schüll) onto a Hybond N+ nylon membrane soaked in 1 M ammonium acetate. The membrane was dried, and the DNA was immobilized by irradiation with UV light.
(iii) Hybridization of macroarrays. Membranes were incubated for 2 to 16 h at 68°C with hybridization buffer (0.5 M sodium phosphate, 7% sodium dodecyl sulfate, 1 mM EDTA, 0.5% blocking reagent [Roche], pH 7.2), hybridized for 16 to 24 h at 68°C in the same buffer with DIG-labeled genomic DNA, and then washed twice for 30 to 45 min each at 68°C in washing buffer (40 mM sodium phosphate, 1% sodium dodecyl sulfate, 1 mM EDTA, pH 7.2). Detection of DIG-labeled fragments by anti-DIG conjugate antibody, enzymatic cleavage of CDP-Star, and exposure to X-ray films were performed as described previously (40).
(iv) Evaluation of macroarray hybridization signals. Signals were classified as strong, weak, or negative according to the signal intensity of the hybridization of labeled PCR products of known sequence onto restricted cosmid DNA. Strong hybridization signals were obtained for homologs with 85% sequence identity or more. Control hybridizations of PAGI-2 onto the pKLC102 macroarray gave negative signals for all pKLC102-derived gene fragments of the array, whereas the reciprocal hybridization of pKLC102 onto the PAGI-2 array revealed weak signals for four of the 34 homologs. The nucleotide sequence identities of the PCR-amplified fragments with their homologous genes were 72%, 76%, 74%, and 63% for C49, C65, C71, and C108, respectively. The E values of the corresponding BlastN comparisons were 1E-80, 2E-115, 2E-127, and 2E-55. Importantly, the weak homolog of C108 in pKLC102 carried a 28-bp stretch of identical sequence, which may explain the occurrence of the weak cross-hybridization signal despite the lower overall homology. In general, however, applying the stringent hybridization conditions, a minimal sequence identity of 75% between the membrane-bound PCR product and the DIG-labeled genomic sample was estimated to be the threshold for generating hybridization signals.
(v) Parsimony analysis. Parsimony analysis was performed with the program "PARS" from the software package "PHYLIP 3.66" (http://evolution.genetics.washington.edu/phylip.html). Signals obtained with the positive controls PAGI-2 and pKLC102 were defined as the standard normalized to "1" for all island ORFs on the macroarrays. In the cases of PAGI-2 subtypes, signals of C1 (integrase gene), C84 and C85 (transposon genes), and C68/C69 were excluded from the analysis because of possible cross-hybridization of homologs or occasional false-negative signals (C68/C69). Similarly, the ORFs CP84, CP85, CP86, and CP103 of pKLC102 were excluded because homologs are found elsewhere in the genome. The purified datasets of all strains were then either combined or separately evaluated by parsimony analysis with PAGI-2 and/or pKLC102 as a reference, respectively.
|
|
|---|
![]() View larger version (42K): [in a new window] |
FIG. 2. Tetranucleotide usage of the four P. aeruginosa genomic islands pKLC102, PAPI-1, PAGI-2, and PAGI-3. Local OU patterns were analyzed in 5-kb sliding windows with steps of 0.5 kb. Curves of the distance D:n0_4mer, pattern skew PS:n0_4mer, and oligonucleotide variance OUV:n1_4mer are specified by color code: blue for D, green for PS and brown for OUV. Protein-coding genes are shown by red bars. The abscissa separates genes by their direction of transcription. The tetranucleotide usage of the genomic islands was significantly different from that of the whole chromosome. The median (inner quartile) values of local tetranucleotide patterns in the whole P. aeruginosa PAO1 chromosome were 13.9 (12.3 to 16.0) for D:n0_4mer, 21.4 (17.9 to 25.6) for PS:n0_4mer, and 0.37 (0.32 to 0.43) for OUV:n1_4mer.
|
|
View this table: [in a new window] |
TABLE 2. ORFs of the phage and plasmid modules of the genomic island pKLC102
|
In summary, the local tetranucleotide signatures of all four islands were distinct from that of the P. aeruginosa chromosome. PAGI-2 is homogeneous in its tetranucleotide composition throughout the island, but pKLC102, PAPI-1, and PAGI-3 each contain regions of highly atypical tetranucleotide composition.
Chromosomal stability of island integration. The atypical oligonucleotide signature, particularly the pronounced strand asymmetry, prompted us to investigate whether the islands could be spontaneously excised from their host chromosomes. All four islands are endowed with genetic elements of mobility. They harbor phage modules (Table 2) that encode chromosome-partitioning proteins (soj) at one terminus and integrases of the bacteriophage P4 subfamily (PAGI-2, PAGI-3 [25]) or a phage tyrosine integrase (pKLC102 [21] or PAPI-1 [16]) at the other end. PAPI-1 and pKLC102, moreover, include numerous ORFs that are related to plasmid-encoded replication and recombination functions.
Combinatorial PCR that spans the integration sites of the islands was applied to detect excised circularized islands and island-free chromosomes compared to integrated genomic islands. Overnight growing cultures were diluted with fresh liquid LB medium, and samples were then taken from the early-exponential to the late-stationary phase of growth. The relative copy number of circularized PAPI-1 was estimated to be 2% of that of PA14 chromosomes. About 0.3 to 1% of PA14 chromosomes did not carry an integrated PAPI-1 island. A copy number of 30 circular pKLC102 molecules per SG17M host chromosome was estimated from semiquantitative PCR kinetics (Fig. 3). During growth, the percentage of pKLC102-free chromosomes increased from about 2 to 3% in early exponential phase to approximately 10% in stationary phase (Fig. 3). In contrast, no circular forms of PAGI-2 or PAGI-3 were detected by combinatorial PCR. Hence, the spontaneous excision rates, if they occur, are below the sensitivity threshold (1 x 107) of the assay. Consistent with this finding, no strain C or strain SG17M chromosomes were identified by PCR that had cured PAGI-2 or PAGI-3, respectively.
![]() View larger version (58K): [in a new window] |
FIG. 3. Combinatorial PCR analysis of integrated and episomal versions of genomic islands PAPI-1 in strain P. aeruginosa PA14 and pKLC102 in P. aeruginosa SG17M. An aliquot from an exponentially growing culture was inoculated into 100 ml fresh medium adjusted to an optical density at 578 nm (OD578) of 0.2. Samples were then taken from the growing culture (from left to right) at OD578s of 0.9, 1.3, 2.0, 2.9, and 4.0 and after 24 h (left) or at OD578s of 0.9, 1.3, 2.0, and 4.0 and after 24 h (right). Bacteria were growing aerobically in 250-ml flasks in liquid LB medium at 37°C at a mixing frequency of 250 rpm. Chromosome-integrated islands were detected by PCR products spanning the 5' tRNA (il) or the 3' tRNA (ir) integration sites by utilizing PA14- and PAPI-1- or SG17M- and pKLC102-derived primer sequences. Circularized episomal forms (ce) were identified by PCR products spanning the breakpoints in PAPI-1 or pKLC102. PA14 or SG17M chromosomes (fa) devoid of PAPI-1 or pKLC102 were detected by PCR products spanning the tRNALys gene adjacent to the PAO1 homolog PA4541. PCR kinetics were performed with 50 ng P. aeruginosa DNA in a 50-µl reaction mixture. Aliquots of 5 µl were withdrawn at the indicated cycles, separated by electrophoresis, and stained with ethidium bromide.
|
The precise excision of enterobacterial pathogenicity islands has been reported to occur spontaneously at a frequency of 105 to 104 (27, 32, 42, 44), although mutations, deletions, and genome rearrangements are likely to be responsible for the inability of most genomic islands to achieve precise excision and mobilization. In the cases of pKLC102 and PAPI-1, the frequencies of spontaneous excision from the host chromosome are 1 or even 3 orders of magnitude higher. pKLC102 and PAPI-1 harbor the phage module with the xerC integrase gene, some plasmid-related genes, a type IV pilus biogenesis gene cassette, and a syntenic set of conserved ORFs, similar to those detected in PAGI-2 and PAGI-3 (Table 2). These features probably allow the islands to be excised exactly from the chromosome and to form a circular extrachromosomal intermediate of sufficient stability. The lower copy number of PAPI-1 indicates that circular forms were only present in a few percent of cells and probably modulate the phenotype of the PA14 community only to a minor extent. The opposite conclusion applies to pKLC102. Circular forms were in 10-fold excess of chromosomal forms, demonstrating that circular pKLC102 replicates in its host cell. Moreover, a substantial number of SG17M chromosomes became devoid of pKLC102 during growth to higher cell densities. These data verify the previous assignment of pKLC102 as a plasmid (20, 21, 41). The functional plasmid module of pKLC102 is apparently responsible for the highest mobility of a genomic island that, to our knowledge, has ever been reported. As a hybrid of phage and plasmid origin (Table 2), pKLC102 may be considered an intermediate between a mobile genetic element and a genomic island.
Epidemiology of PAGI-2- and pKLC102-like genomic islands in P. aeruginosa. PAGI-2 and pKLC102 share a syntenic set of ORFs (21), homologs of which have been detected in more than 30 genomic islands of other beta- and gammaproteobacteria (29). The presence of these island types in numerous taxa suggests that they form a family with a deep evolutionary origin (29). However, since no epidemiological data have yet been reported, the roles of PAGI-2 and pKLC102 in the contemporary P. aeruginosa population are unknown. Therefore, we investigated the abundance and diversity of PAGI-2- and pKLC102-like genomic islands in 71 strains of unrelated SpeI genotypes (Table 1) (30). The panel included isolates from diverse habitats and geographic origins and was a representative sample of present-day P. aeruginosa clones. Note that 36 of the 71 strains share their SNP genotypes with at least one other strain in the panel (seven pairs, six trios, and one quadruple; the hexadecimal genotypes of strains are listed in Table 1). This finding implies that differences in the accessory genome frequently give rise to macrorestriction fragment patterns that are classified as distinct P. aeruginosa genotypes by accepted criteria (40), although the SNP genotypes of the core genomes are identical.
Macroarrays of PAGI-2 and pKLC102 ORFs (Fig. 1) were hybridized with the strains' DNAs under high stringency to suppress equivocal cross-hybridization signals of homologous genes (see Materials and Methods). The hybridization analyses were calibrated with samples and probes of known sequence so that a sequence identity of at least 75% was required for a positive signal. An identity of 85% or more between the two sequences yielded strong hybridization signals. Tables 3 and 4 show the results of macroarray hybridizations of strains with positive hybridization signals.
|
View this table: [in a new window] |
TABLE 3. PAGI-2 macroarray hybridization patterns of island-positive strains
|
|
View this table: [in a new window] |
TABLE 4. pKLC102 macroarray hybridization patterns of island-positive strains
|
![]() View larger version (117K): [in a new window] |
FIG. 4. Examples of PAGI-2 (upper two rows) and pKLC102 (lower two rows) subtype macroarray hybridization patterns. The PAGI-2 macroarrays show (A) strain PAO (DSM1707) (negative control), (B) strain C (positive control), (C) strain 7 (subtype G1b), (D) strain 3 (subtype G2a), (E) strain 54 (subtype G2c), and (F) strain 63 (subtype G4). The pKLC102 macroarrays show (G) strain PAO (DSM1707) (negative control), (H) strain SG17M (positive control), (I) strain 6 (subtype K1c), (J) strain 10 (subtype K3c), (K) strain 36 (subtype K3d), and (L) strain 53 (subtype K4).
|
Figure 5 summarizes the hybridization results. The signal patterns of the PAGI-2 macroarray were in accordance with the known bipartite structure of individual cargo and syntenic homologs in the sequenced islands PAGI-2 and PAGI-3 (25). The "cargo" genes C2 to C35, which have homologs with known functions in other eubacteria (see Table S3 in the supplemental material), were detected only in PAGI-2 and a close derivative thereof (subtype G2c). PAGI-2 subtypes vary in their attributes encoded by the accessory clusters of "cargo" genes. The commonalities of PAGI-2-type islands are 68 to 77 homologs that include genes related to replication or genetic mobility or that are conserved hypotheticals with unknown functions. Thirty-six of these ORFs have homologs in pKLC102. pKLC102-type islands were more diverse than PAGI-2 types in their combinations of gene cassettes, in accordance with their nested arrangements of island- and subtype-specific ORFs (21), but they apparently carried fewer strain-specific cargo genes. The backbone of more than 50% of the ORFs, including the 36 PAGI-2 homologs, was found to be highly conserved among all pKLC102-type islands. For 90% of the pKLC102 ORFs, homologous sequences were identified in the majority of islands. Only the contig CP94 to CP101 and ORF CP32 were missing in most strains (Table 4). The least abundant ORF, CP32, had the lowest G+C content (41.6%) of all ORFs in pKLC102 and served as the integration site for an integron in strain C (21), which led to large genome rearrangements in sequential isolates from individuals with cystic fibrosis (22). In summary, the diversity of PAGI-2 islands is mainly caused by the insertion of one large block of strain-specific cargo genes, whereas the diversity of pKLC102 islands is primarily generated by subtype-specific combinations of gene cassettes.
|
View larger version (23K): [in a new window] |
FIG. 5. Summary of macroarray hybridization data for 31 PAGI-2-type-positive (A) and 50 pKLC102-type-positive (B) P. aeruginosa strains. The shading indicates the percentages of island-positive strains with a hybridization signal for the respective ORF. Black, 96% of strains positive; dark gray, 90 to 95% positive; light gray, 50 to 89% positive; white, <50% positive.
|
![]() View larger version (26K): [in a new window] |
FIG. 6. Relatedness of macroarray hybridization patterns of 55 PAGI-2- and/or pKLC102-positive P. aeruginosa strains. The unrooted tree is based on the parsimony analysis ("PHYLIP 3.66") of the hybridization data.
|
![]() View larger version (92K): [in a new window] |
FIG. 7. Loss of PAGI-2-type islands in sequential P. aeruginosa airway isolates from patients with cystic fibrosis. (Upper row) PAGI-2 macroarray hybridization patterns of clone C strains SG1 (A) and SG3 (B), indicating the loss of PAGI-2 in the later isolate SG3 while another PAGI-2 subtype was retained. SG1 (strain C) was isolated from the patient's first P. aeruginosa-positive sputum specimen; SG3 is the sixth isolate, collected 2 years later. (Lower row) PAGI-2 macroarray hybridization patterns of clone C strains NN18 (C) and NN86 (D), indicating the loss of a PAGI-2-type island(s) in strain NN86, which was isolated from the patient's last clone C-positive culture 17 years after the acquisition of clone C.
|
![]() View larger version (49K): [in a new window] |
FIG. 8. PAGI-2 macroarray hybridization patterns of Cupriavidus strains C. campinensis AE2701 (A) and C. metallidurans CH79 (B). The boxes highlight absent hybridization signals.
|
![]() View larger version (22K): [in a new window] |
FIG. 9. Similarity of pKLC102-type genomic islands in proteobacteria based on the distance of oligonucleotide usage. The distance D:n0_4mer of tetranucleotides was calculated for each genomic island. The matrix of D values obtained was sorted for the degree of evolutionary relationship between the genomic islands by the Fitch-Margoliash criterion, assuming a constant molecular clock, and by the least-squares methods using the KITSCH program of the PHYLIP library (11). The bar indicates branch length 5. Branch lengths are not drawn exactly to scale. Short branches are exaggerated in length so that they are more visible.
|
50%). The 22 studied genomic islands of the pKLC102 family were intermediate between chromosome and random sequences. The n0_4mer PS values of 18 islands were above the 95% confidence intervals of the n0_4mer PS values of 155 completely sequenced bacterial chromosomes and 316 plasmids (37) (Fig. 10). PAGI-2 belonged to the four islands with PS values within the confidence interval. The PS of PAGI-3 was within the inner quartiles of the PS of the pKLC102 family. PAPI-1 and pKLC102, together with four other islands from Erwinia carotovora SCRI1043, Photorhabdus luminescens TT01, Pseudomonas fluorescens Pf-5, and Yersinia enterocolitica 8081, exhibited the highest PSs of
26%. Thus, the tetranucleotide frequency of the complementary strands was least correlated in these six members of the pKLC102 family. Stably integrated genomic islands have an atypical oligonucleotide composition compared to that of the core genome, but strand symmetry is locally maintained (38). The conjugative islands of the pKLC102/PAGI-2 family, particularly the six islands with the highest PSs, however, do not adhere to this rule and not only locally (Fig. 2), but also globally, perturb strand symmetry. Typically, the pattern skew values of the two replisomes that replicate bidirectionally from the origin and meet in the terminal region are approximately the same (37) and hence compensate for each other to values close to zero for the whole chromosome (Fig. 10). These data mean that both replisomes carry the same burden of physicochemical constraints exerted by strand asymmetry (37). The integration of a pKLC102 element perturbs this subtle balance. As long as the PS of the foreign genetic element remains high, spontaneous excision from the chromosome will occur. In summary, the high pattern skew, together with functional phage and/or plasmid modules, may account for the high mobility of the islands of the pKLC102/PAGI-2 family, suggesting that they behave like selfish parasitic DNA that is prone to horizontal spread within and among taxa.
![]() View larger version (18K): [in a new window] |
FIG. 10. Pattern skew of pKLC102-type genomic islands (squares) and their corresponding chromosomes (triangles). Pattern skew values (n0_4mer PS) are plotted against the logarithmic scale of sequence lengths. The gray-shaded area depicts the 95% confidence intervals of variation of n0_4mer PS values in 155 completely sequenced bacterial chromosomes and 316 plasmids (37). The PS values of n0_4mer patterns of bacterial chromosomes are typically in the range of 1 to 8%. Outliers in the investigated panel are Haemophilus ducreyi 35000HP, with a PS value of about 9%, and Xylella fastidiosa 9a5c, with an extreme value of 24.3%. The n0_4mer PS values of pKLC102-type genomic islands exceed the 95% confidence interval. The genomic islands, identified by their host strains, with the name of the island given in brackets if available, were as follows: 1, Azoarcus sp. strain EbN1; 2, Erwinia carotovora subsp. atroseptica SCRI1043; 3, Haemophilus ducreyi 3500HP; 4, Haemophilus influenzae 86-028NP (ICEHin-like); 5, Haemophilus somnus 129PT; 6, Methylobium petroleophilum PM1; 7, Photorhabdus luminescens TT01; 8, P. aeruginosa C (pKLC102); 9, P. aeruginosa C (PAGI-2); 10, P. aeruginosa PA14 (PAPI-1); 11, P. aeruginosa SG17M (PAGI-3); 12, Pseudomonas fluorescens Pf-5; 13, Pseudomonas syringae pv. syringae B728a; 14, Salmonella enterica subsp. enterica serovar Typhi CT18 (SPI-7); 15, Xylella fastidiosa 9a5c; 16, Yersinia enterocolitica 8081; 17, H. influenzae 1056.b (ICEHin 1056); 18, Neisseria gonorrhoeae MS11 (GGI); 19, Nitrosomonas eutropha C71; 20, Pseudomonas putida RR21 (clc-transposon); 21, P. syringae pv. phaseolicola 1302A (PPHG-1); 22, Yersinia pseudotuberculosis 32777 (YAPI).
|
The genomic islands of the pKLC102/PAGI-2 family are a major exception to this rule. The islands spread across barriers of taxa while retaining the identity of their oligonucleotide signature. Family members were identified at high frequency in the global P. aeruginosa population but were also widespread among other beta- and gammaproteobacteria. Identical islands were detected in phylogenetically distinct clades and isolates from diverse habitats and geographic origins. High strand asymmetry and phage and/or plasmid modules make up the signature of this evolutionarily ancient island family.
pKLC102/PAGI-2 family members share a syntenic set of homologs. Accessory gene clusters are nestled in this core and encode island-specific features. The sequence diversity of these family-typical core genes is higher than that of vertically transmitted orthologs, probably because divergent evolutionary forces act on sequences during horizontal and vertical transmission. Genes that are irreversibly captured by the host chromosome minimize strand asymmetry and become subject to purifying selection, like the genes of the core genome (13, 19), whereas the self-transfer of genomic islands into phylogenetically distinct host chromosomes counterselects strand symmetry, loss of the island-typical oligonucleotide signature, and loss of sequence diversity. Thus, ongoing horizontal transfer maintains a higher sequence diversity of a genetic element than its irreversible incorporation into a host genome.
The functions of most genes of the conserved module of the pKLC102/PAGI-2 family are still unknown, although at least a subset should be involved in the excision, transfer, integration, or stabilization of the island (45, 46). Moreover, mutagenesis studies in PAPI-1 demonstrated that genes of the conserved core are involved in the animal and plant virulence of strain PA14 (16). Future research should unravel in more detail to what extent the syntenic gene set is not only essential for the maintenance of the genomic islands of the pKLC102/PAGI-2 family, but also affects the phenotype of its host strain, as has been demonstrated for individual cargo genes (12, 14, 31).
|
View this table: [in a new window] |
J.K., D.W., and O.R. were members of the International Research Training Group Pseudomonas: Pathogenicity and Biotechnology (IRTG 653 of the Deutsche Forschungsgemeinschaft). Financial support by the priority program Ecology of Bacterial Pathogens: Molecular and Evolutionary Aspects and by the Collaborative Research Program SFB 587 (project A9) of the Deutsche Forschungsgemeinschaft is gratefully acknowledged.
Published ahead of print on 28 December 2006. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
Present address: Bioinformatics and Computational Biology Unit, University of Pretoria, Pretoria 0002, South Africa. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»