Department of Fundamental Microbiology, University of Lausanne, CH-1015 Lausanne, Switzerland,1 INRA-INAPG, 78850 Thiverval Grignon, France,2 Department of Genetics, Faculty of Biology, Bielefeld University, D-33594 Bielefeld, Germany,3 Swiss Federal Institute of Aquatic Science and Technology (Eawag), CH-8600 Dübendorf, Switzerland4
Received 30 November 2005/ Accepted 19 December 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The clc element was originally discovered in Pseudomonas sp. strain B13, the first described Pseudomonas able to metabolize 3-chlorobenzoate (3-CBA) (17). Essential for the metabolism of 3-CBA are the clcRABDE genes that encode the enzymes for 3- and 4-chlorocatechol degradation, which are two metabolic intermediates of 3-CBA (20). It had been known for a long time that the clc element is capable of self-transfer to other Beta- and Gammaproteobacteria (44, 51, 71). However, the self-transfer process could not be attributed to a conjugative plasmid (70). Instead, it was discovered that the clc element is normally integrated into the chromosome of its hosts but can excise at a low frequency and self-transfer to a new host in which it reintegrates (49). Southern hybridization analysis suggested that the clc element is present in two copies in the chromosome of Pseudomonas sp. strain B13 (49). Critical to the integration process is the integrase gene (intB13), which is situated near the right end of the clc element and near the integration site (50). Apart from the intB13 integrase gene and the clc genes, only one other region of the clc element had been characterized previously. This region at the outer left end was postulated to contain regulatory factors that are possibly involved in integrase expression control (56). This region has later been recognized as being part of a larger well-conserved core in other syntenic GEI sequences (38).
Even though ICE and GEIs share common features, such as the presence of integrase or conserved gene regions, many unanswered questions remain, particularly on their evolution and on their mode of mobilization. A GEI "core" structure was proposed on the basis of comparisons among a set of GEIs (38). In different GEIs, this core was interrupted by more variable regions, although the mechanisms governing such variability are not clear. As far as mobilization modes, clear hints exist on the nature of the transfer system of some GEIs and ICE. For example, the symbiosis island of Mesorhizobium loti strain R7A carries a trb operon, potential tra genes, and a type IV secretion system (60). Potential type IV secretion systems are carried on pKLC102 of Pseudomonas aeruginosa C (33), on ICEEc1 of E. coli strain ECOR31 (54), and on the biphenyl catabolic transposon Tn4371 (65), although the latter element is not self-transferable. Also, the SXT element carries tra-related genes and is an active mobile element (3, 7). This led to the hypothesis that ICE and GEI arose from merges between phages and plasmids during their transition to a chromosomal integrative element.
Because the clc element is one of the few genomic islands which is completely self-transferable and one of the only three ICEs currently known to contain catabolic gene functions, we decided to determine its complete nucleotide sequence in order to find out other specific catabolic properties of the clc element, to discover the possible nature of the self-transfer system, and to derive evolutionary relationships between the clc element and other GEIs or plasmids. Two other new catabolic gene clusters were detected on the element, reinforcing the general idea of the clc element being a specialized catabolic genomic island. By way of growth studies and gene expression analysis, we analyzed the newly discovered catabolic properties carried by the clc element. On the other hand, the origin of the presumed conjugative transfer system remained unclear.
| MATERIALS AND METHODS |
|---|
|
|
|---|
(Gibco Life Technologies, Gaithersburg, Md.) was routinely used for plasmid propagation and cloning experiments. Pseudomonas sp. strain B13 (17) is the original host of the clc element. The clc element was cloned from the chromosome of Pseudomonas putida strain RR221, a transconjugant that was previously obtained from mating between P. putida F1 and Pseudomonas sp. strain B13 (49). This transconjugant contains two copies of the clc element, which are integrated at two separate locations in the chromosome of P. putida F1. Pseudomonas aeruginosa strain 1999 is a transconjugant with a single integrated copy of the clc element and was obtained by conjugation between Pseudomonas strain B13 and P. aeruginosa PAO1-rif, a spontaneous rifampin-resistant mutant of the type strain PAO1 (obtained via Dieter Haas). Bacterial strains were stored at 80°C in spent Luria-Bertani (LB) medium containing 15% (vol/vol) glycerol. E. coli, P. aeruginosa, and Pseudomonas sp. strain B13 were routinely grown in LB medium (52). The type 21C mineral medium (MM) (21) was used to grow Pseudomonas strain B13, P. aeruginosa PAO1-rif, and P. aeruginosa 1999 under defined conditions. This medium was supplemented with 10 mM 3-CBA, 1 mM 2-aminophenol (2-AP), or 10 mM glucose as a carbon source. When necessary, the following antibiotics were used at the indicated concentrations: ampicillin, 100 µg/ml; rifampin, 50 µg/ml; and kanamycin, 50 µg/ml. Pseudomonas sp. strain B13 was grown at 30°C; P. aeruginosa and E. coli were grown at 37°C. Molecular techniques. The PCR, DNA cloning, plasmid or cosmid DNA isolations, DNA fragment recovery, DNA ligations, and restriction enzyme digestions were all carried out according to standard procedures (52) or according to specific recommendations by the suppliers.
Cosmid, plasmid, and transposon insertion libraries. From the total DNA of P. putida RR221, a SuperCos 1 cosmid library with inserts ranging in size from 33 to 42 kb had been constructed previously (49). Cosmids 2B1, 1G3, 4H12, and 3G3 (49) were overlapping and covered the major part of the clc element. The complete sequence was then derived by the partial shotgun sequencing of plasmid sublibraries of cosmids 3G3 and 4H12, according to established procedures (4), and by sequencing a random transposon insertion library of cosmid 1G3, which was constructed with the help of the EZ::TN <oriV/KAN-2> insertion kit (Epicentre, Madison, Wis.). The outermost right end of the clc element was recovered as a 10-kb NotI fragment from cosmid 2B1 (resulting in plasmid pTCB172) and subcloned in different overlapping fragments in pUC28.
DNA sequencing on cosmids with transposon insertions was performed bidirectionally on double-stranded DNA templates with a Thermo Sequenase cycle sequencing kit with 7-deaza-dGTP (Amersham Biosciences AB) by using the suggested primers facing outwards from the kanamycin gene (Epicentre). Those sequencing reactions were analyzed on an automated DNA sequencer model 4200 IR2 (LI-COR, Lincoln, NE) as described previously (50). Plasmid DNA from sublibraries of the cosmids 3G3 and 4H12 was prepared by either alkaline lysis by applying the Montage Plasmid Miniprep96 kit (Millipore, Schwalbach, Germany) or in vitro amplification by applying the TempliPhi DNA sequencing template amplification kit (Amersham, Freiburg, Germany). Subsequently, the inserts of these plasmids were end sequenced by using M13 universal and reverse standard primers and the DYEnamic ET terminator cycle sequencing kit (Amersham, Freiburg, Germany) and were separated on MegaBACE 1000 (Amersham) or ABI PRISM 3700 capillary sequencers (PE Applied Biosystems). All plasmid DNA preparation and sequencing reactions were performed as recommended by the manufacturers. Multiple single sequences read in both directions covered the complete clc element region. After a first assembly and alignment, two gaps of 200 bp and 2.5 kb remained within cosmids 3G3 and 4H12, respectively. These two regions were amplified by PCR, cloned into pGEM-T Easy, and directly sequenced by primer walking using custom-made oligonucleotides as primers. Low-quality regions were resequenced with custom-made infrared-labeled oligonucleotides as primers and cosmid DNA as the template on the LI-COR system.
Assembly, correction, and annotation. For each cosmid, the individual sequences were assembled into contigs by using the Staden software package (59), including the base-calling program phred (18, 19) and the assembly program phrap. For the finishing phase, the graphical tool Consed (22) was used to correct assembly errors and resolve low-quality regions. The Autofinish program (23) suggested primers to close gaps, improve sequence quality in regions of high error rates, and eliminate any single subclone regions. Two sequences that were submitted previously, AJ004950 and AJ536665, corresponding to the right and left extremity of the clc element, respectively, were included in the final assembly. Open reading frames (ORF) were identified with the application MapDraw of the DNAStar software (DNAStar, Madison, Wis.) and compared with sequences in GenBank using the BLASTP search tools (1). Assignment of ORFs was based on additional contextual information, such as the proximity of ribosome binding sequence motifs. The BLASTN algorithm was used to determine pairwise GEI homologies. Graphical comparison between GEIs was generated with the Artemis comparison tool (ACT) (9), which was downloaded from the website http://www.sanger.ac.uk/Software/ACT. Cumulative TA skew (i.e., the number of T residues minus the number of A residues per 100 bp) and G+C content were analyzed by previously described methods (25, 26).
Growth and induction experiments and RNA isolation. Pseudomonas sp. strain B13 was grown at 30°C in 30 ml MM supplemented with 10 mM glucose to an optical density at 600 nm of 0.5. Cells were then washed and resuspended into 30 ml fresh preheated (30°C) MM. To measure catabolic gene expression, various compounds were added (a single compound per assay): 10 mM glucose, 10 or 1 mM 3-CBA, 10, 1 or 0.1 mM 2-AP, 10 or 1 mM anthranilate, 10 or 1 mM salicylate, 10 or 1 mM benzoate, 10 or 1 mM nitrobenzene, 10 or 1 mM 4-hydroxybenzoate, or 10 or 1 mM 4-CBA. Cultures were incubated for an additional 1 h at 30°C with rotary shaking before isolating total RNA. For RNA isolation, 30 ml of culture was immediately harvested by centrifugation (13,000 x g, 1 min) in 1.5-ml tubes and the supernatant was decanted. Cell pellets were resuspended in 50 µl of RNAprotect bacteria reagent (QIAGEN GmbH) in order to stabilize the RNA. Suspensions from 6 or 7 pellets were pooled together and centrifuged again, the supernatant was discarded, and the pellet was stored at 80°C for a maximum of 1 month. Prior to RNA isolation, pellets were thawed and resuspended in 0.5 ml of TES buffer (10 mM Tris-HCl [pH 8.0], 1 mM EDTA, 100 mM NaCl), after which the total RNA was extracted and purified according to a protocol described previously (2).
Growth on 2-AP was tested on Pseudomonas sp. strain B13, P. aeruginosa strain 1999 (containing one integrated copy of clc), and P. aeruginosa PAO1. All strains were grown during 16 h in 5 ml LB and subsequently diluted 1:500 (vol/vol) into fresh minimal medium with 1 mM 2-AP. Higher 2-AP concentrations resulted in the toxicity and retardation of cell growth (not shown). Flasks were completely covered to avoid exposure to light. At regular time intervals, samples were taken from the culture, serially diluted in sterile physiological salt solution (i.e., 0.9% NaCl, 1 mM MgCl2), and plated on LB medium to count the number of CFU. Regular measurements of culture turbidity were unreliable due to the formation of dark photooxidation products from 2-AP.
Probe preparation. Probes for mRNA analysis were generated by digoxigenin-labeling of DNA templates in the PCR by using a mixture of deoxyribonucleotides containing DIG-11-dUTP (Roche Diagnostics AG). The following probes were prepared: (i) a 425-bp fragment of the clcA gene, amplified with primers 5'-AGGTGGCCCAGCAGACGCC-3' at position 13304 and 5'-CACCGAGGTCGCACAGACGC-3' at position 13729; (ii) an 808-bp amnB fragment, amplified with primers 5'-CGCATCTGGTCTATGGGGA-3' at position 23997 and 5'-GTTGCCAGTGCCGATGAC-3' at position 24805 with plasmid pCBA253 as a template; and (iii) a 1-kb fragment coding for the large subunit of the putative aromatic dioxygenase (ORF5994), amplified with primers 5'-TCGCAGGAAGTGTATGACC-3' at position 7142 and 5'-GCGATGTACGAATGGCTGT-3' at position 6109 of the total sequence. The labeling quality as well as the specificity of probes was judged satisfactory after Southern hybridization against Pseudomonas sp. strain B13 genomic DNA digested with different restriction enzymes (data not shown).
Dot blot hybridization and relative mRNA quantification.
For gene expression analysis, the purified total RNAs were centrifuged, dried, and dissolved in diethyl pyrocarbonate-treated water to achieve a concentration of
1 µg per µl. Volumes of 15 µl of each RNA sample and dilutions thereof containing 1, 0.3, and 0.1 µg RNA were dot blotted onto positively charged nylon transfer membranes (Hybond-N+, Amersham Biosciences AG) in a 96-well dot blot manifold (Gibco Life Technologies). Dilutions of denatured plasmid DNAs containing the targeted catabolic gene inserts were included on the same blot. Membranes were hybridized, and the digoxigenin-label was detected by an antidigoxigenin alkaline phosphatase-conjugated antibody and chemiluminescence according to the supplier's instructions (Roche Diagnostics AG). Signal intensities of hybridization spots on film were measured densitometrically and calculated as intensity volumes (i.e., mean pixel intensity times the total number of pixels per spot area) and then standardized for the area of an ideal spot (i.e., with a diameter of 6 mm) similar to that described by Leveau et al. (35). Signal intensities of RNA samples were then interpolated on the standard curve, expressed as "equivalent number of DNA copies," and divided by the amount of total RNA that was blotted on that spot. RNA copy numbers were then divided by the value that was derived under incubation conditions with glucose to obtain an induction factor. All experiments were performed with independent biological triplicates.
Nucleotide sequence accession number. The DNA sequence of the clc element of strain P. putida RR221 is deposited in the NCBI/EMBL database under accession number AJ617740.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
The first gene next to the attR sequence was coding for the IntB13 integrase. This enzyme belongs to the bacteriophage-type integrases of the phage P4 subfamily and has been shown to be implicated in the site-specific chromosomal integration of the clc element (50). Downstream of the intB13 gene in the direction of attL, a gene coding for a putative permease of the major facilitator superfamily (ORF2848) is situated. This was followed by a cluster of genes (ORFs 4438, 5512, and 5994) putatively encoding an aromatic ring dioxygenase enzyme complex (Table 1, Fig. 1D). Products of these genes had strong homologies to the anthranilate 1,2-dioxygenase from Burkholderia cepacia DB01 (57% amino acid [aa] identity for the small subunit and 67% for the large subunit, AY223539) (10) as well as to the PhnA1A2 salicylate dioxygenase present in Sphingomonas (50% amino acid identity to PhnA1a and 34% to PhnA2a, AJ633532) (15). At positions 9151 to 14893, the previously identified clcRABDE genes were found (20, 30, 32). The next easily recognizable region (positions 22813 to 31341, Fig. 1E) consisted of a complete catabolic operon for 2-AP degradation via meta cleavage (31) that was similar to the amn gene cluster of Pseudomonas sp. strain AP-3 (61, 62) and to parts of the nitrobenzene pathway genes of Pseudomonas fluorescens strain KU-7 (41) and P. putida HS12 (45). The amn operon on the clc element is formed by a putative transcriptional repressor (amnR, ORF22813), a ferredoxin-like protein (ORF23526), and eight amn genes, amnBACDFEHG (Fig. 1E). The amn genes of the clc element were in slightly different order than in the amn operon of Pseudomonas sp. strain AP-3 (AB020521), in which the amnD gene is located downstream of amnE. The amino acid sequence identity levels between orthologous amn partners of the clc element and those of strain AP-3 were as follows: AmnB, 76%; AmnA, 54%; AmnC, 66%; AmnD, 66%; AmnF, 56%; AmnE, 49%; AmnH, 68%; and AmnG, 77%. Next, we found three components of a putative multidrug efflux pump (ORF32963, ORF34495, and ORF36077) that were closely related to those present in the plant pathogen Ralstonia solanacearum (Table 1). Among other putative genes with a distinguishable function, we found a nitrilase (ORF41917).
Functionality of the 2-AP pathway. Growth experiments and gene expression analysis were used to reveal whether the newly discovered catabolic genes on the clc element were functional. Since the clc element is present in two copies in Pseudomonas sp. strain B13, a functional analysis was more easily performed in a transconjugant carrying only one copy. The clc element was transferred by conjugation from Pseudomonas sp. strain B13 into P. aeruginosa PAO1-rif. As expected, PAO1 transconjugants could grow on 3-CBA, whereas PAO1 itself could not. This confirmed functionality of the clc genes for chlorocatechol degradation in PAO1 as well. Some PAO1 transconjugants (e.g., strain 1999) carried only a single clc insertion as determined by Southern hybridization and PCR analysis (not shown). In contrast to PAO1-rif, strain 1999 formed a 10-fold higher biomass on 2-AP (Fig. 2A). Due to the toxicity and rapid photooxidation of 2-AP, growth experiments could be conducted with only 1 mM. Strangely enough, Pseudomonas sp. strain B13 itself could not grow on 1 mM 2-AP, which might be the result of a metabolic misrouting or production of a toxic intermediate from 2-AP (Fig. 2B). Since PAO1 does not carry the clc element, whereas strain 1999 does, we can conclude that the clc element indeed confers the ability to grow on 2-AP (in some strains). Most likely, this is the result of the presence of the amn genes.
|
|
Expression of the putative large subunit dioxygenase (ORF5994) increased 26-fold in the presence of 1 mM anthranilate and 18-fold with 1 mM 3-CBA compared to that in the presence of glucose alone. In contrast to amnB, a concentration of 10 mM anthranilate or 10 mM 3-CBA provoked no induction of ORF5994. Strong (19-fold) induction of ORF5994 was also observed with 1 mM 2-AP, and slightly weaker induction was observed with 1 mM but not with 10 mM of benzoate, 4-chlorobenzoate, and salicylate (Table 2).
Exposure of Pseudomonas sp. strain B13 to 1 mM 3-CBA provoked a fivefold induction of clcA compared to that for the culture grown in glucose only. Similar amounts of clcA mRNA were observed with 1 mM anthranilate, and twofold less was observed after exposure to 1 mM salicylate or 4-chlorobenzoate. As for ORF5994 expression, less clcA mRNA was produced after exposure to 10 mM 3-CBA and 10 mM anthranilate. The strongest clcA induction (10-fold) was detected in bacteria exposed to 1 mM benzoate. In contrast to amnB and ORF5994, 1 mM 2-AP and 4-hydroxybenzoate did not induce clcA expression relative to glucose. The clear expression observed for all three gene clusters confirms (clc or amn) or suggests (orf5994) that the genes are functional and implicated in aromatic compound metabolism. However, as mentioned above, Pseudomonas sp. strain B13 could not grow on 2-AP despite amn gene expression.
Expression patterns were much more complex than expected from the nature of the metabolic pathways themselves and suggested cross-activation by nonnative regulators. Only two specific regulatory proteins for regulating catabolic gene expression can be deduced from the clc element's sequence: ClcR, a Lys-type transcriptional regulator activating the clc genes in response to 2-chloro-cis,cis-muconate (arising from 3-chlorocatechol metabolism) (Fig. 1D) (12), and AmnR, a repressor for the 2-AP pathway (in analogy to NbzR, a transcriptional repressor for the nbz genes of which the effector compound is unknown) (Fig. 1E) (45). In contrast to what we expected, we observed that the clc genes were induced in the presence of not only 3-CBA and 4-chlorobenzoate but also anthranilate, benzoate, and salicylate. Anthranilate, benzoate, and salicylate can typically be converted by pseudomonads to catechol and cis,cis-muconate, an analog to 2-chloro-cis,cis-muconate. However, ClcR is not effectively induced by cis,cis-muconate (45). Therefore, activation of the clcA promoter by anthranilate, benzoate, and salicylate seems to take place via cross-activation by a non-clc-element-encoded activator such as CatR (46). ORF5994 (taken as representative for the four-gene cluster in this region) showed a similar expression profile as clcA (i.e., induction with 3-CBA, 4-chlorobenzoate, anthranilate, salicylate, and benzoate), but this was in addition to 2-AP as well. This suggests that there is a separate regulatable promoter in front of ORF5994 which is different from the clcA promoter. Several potential LysR-type binding regions (i.e., ATAC-N7-GTAT) are located upstream of ORF5994, but it is presently unclear whether these are involved in binding ClcR, CatR, or even another transcription regulator. To complicate matters further, the amn genes were not only induced with 2-AP (as expected for a functional 2-AP pathway) but also with 3-CBA (at 10 mM), anthranilate, and 4-hydroxybenzoate. Of these, only anthranilate might theoretically be converted into 2-AP, thus leading to the effector needed for derepression of AmnR. The experiments showed that, whereas we usually think of metabolic pathways as "linear" (i.e., induced in the presence of only the true pathway substrate or intermediate), many of them are actually part of a cross-induced network. This, in turn, may be the cause for the misrouting of certain metabolites and formation of toxic intermediates and may be one reason for the absence of growth on 2-AP by Pseudomonas sp. strain B13.
A highly conserved, yet unknown, left end.
The other half of the clc element, from ORF50240 to the 18-bp, left-end repeat, encoded mostly proteins of unknown function, some of which have been recognized previously to be conserved in various different bacterial strains (38) (Table 1). Due to the absence of recognizable transfer or mobilization functions in the integrase-containing first half of the clc element, we hypothesize that the second half must encode such functions, given the self-transferable nature of the clc element (49, 51). The size of the region (
50 kb) would be sufficiently large to harbor a complete set of genes for plasmid relaxosome and mating-pair formation (
20 kb) (11). However, no overall DNA or protein homologies to known plasmid conjugative systems or the type IV secretion systems were detected in this area. Hence, we can conclude that the clc element is not a hybrid between a phage and a known tra-like conjugative plasmid like the SXT-element (8) or Tn4371 (65). Only a few putative ORF products encoded in this region showed significant similarity to gene products involved in conjugative DNA transfer. For example, ORF59888 encoded a 956-aa peptide with a 43.7% alignment to the COG3451 VirB4 domain between amino acids 400 and 800 (including 15 gaps). The overall similarity of the predicted ORF59888 peptide to VirB4 of Agrobacterium tumefaciens, however, was less than 20%. At the same time, orthologs of ORF59888 with between 47% and 99% amino acid sequence identity over the full length of the peptide were detected in (currently) 28 complete bacterial genomes, including Xanthomonas campestris, P. aeruginosa, Rubrivivax gelatinosus, Azotobacter vinelandii, and Photorabdus luminescens. This strongly suggests an important functional conservation of this protein. Next, ORF68987 weakly resembled another component of DNA transfer/type IV secretion system in gram-negative bacteria. The region of aa 191 to 662 aligned 86.5% to the pfam02534 TraG/TraD family (E value 8 · 107) and 64.4% among 596 residues to the COG3505 VirD4 domain (E-value 6 · 105). The hypothetical protein encoded by ORF75419 (760 aa) contained two domains with putative DNA helicase function, between amino acid positions 112 and 186 (DExH box, 53.8% alignment without gaps) and 599 and 721 (HELICc motif, 90.8% alignment without gaps). Helicases are implicated in the unwinding of the nicked plasmid in relaxosome formation during conjugation (13, 37). Thirteen highly identical orthologs (49% to 99%) were currently detected in the NCBI database for the ORF75419 peptide by BLAST searches.
ORF91884 putatively encodes a type IA DNA topoisomerase (671 aa, 99.8% alignment to the COG0550 TOP1Ac domain between positions 150 and 600), and it matched more than 30 entries in the NCBI database with as much as 50% amino acid identity. Topoisomerases of this type are capable of reversible cleavage of double- and single-stranded DNA and function in DNA replication, conjugation, and cointegrate resolution. Finally, a putative single-stranded DNA binding protein (SSB) was predicted from ORF94175 (79.6% alignment with COG0629, SSB), which may be implicated in the primase complex during the nicking of a double-stranded circular intermediate and its transport into the new cell. No regions with significant similarities to the oriT sequence of IncP1-ß plasmids were found on the clc element. Two genes which previously had been implicated in the regulation of integrase expression, inrR (ORF94689) and ORF98147 (previously named ORF3) (56), were situated at the left extremity of the clc element. ORF100033 (previously named ORF1) and ORF98147 carry 70-aa domains with significant similarity to ParA and ParBc, respectively (56).
If we thus assume an analogy in the clc element's transfer process to known plasmid conjugative systems, clc transfer might possibly look like the following. At a certain low frequency under stationary-phase conditions, the clc element excises and forms a circular double-stranded intermediate, which has been detected by PCR and Southern hybridization (55). In some cells, the transfer of this circular intermediate may be initiated analogously to conjugative plasmids by the nicking, unwinding, and presentation of a single-stranded DNA at a mating-pair complex in the membrane, which is exemplified by such proteins as the ORF59888-encoded VirB4-domain containing peptide, the ORF75419 DNA helicase, the DNA topoisomerase Ia (ORF91884 peptide), and the ORF94175 SSB-like peptide. Interestingly, similarly to the expression of IntB13, the expression of ORF75419 (putative helicase) but not ORF59888 (putative VirB4) was induced under stationary-phase conditions in Pseudomonas sp. strain B13 (M. Gaillard, unpublished data), suggesting that excision and unwinding are regulated independently from the formation of a mating-pair complex. Further confirmation for the conjugative transfer hypothesis, however, has to await functional analysis of the many conserved hypothetical genes in this region.
Comparison and possible evolution of the clc element. The clc element was related to a number of GEIs found in P. aeruginosa (e.g., pathogenicity islands PAGI-2 and PAGI-3) (34), suspected GEIs (e.g., in Burkholderia xenovorans LB400, Ralstonia metallidurans, and Xylella fastidiosa), and nonclassified chromosomal regions (e.g., in Xanthomonas axonopodis pv. citri strain 306, R. gelatinosus PM1, and Azoarcus sp. strain EbN1). Various "core" functions shared between the clc element and an even larger set of syntenic genomic islands have been recognized before (38). Two more or less strongly conserved regions between GEIs related to the clc element are interspersed with a more variable region (Fig. 3). The first highly conserved region solely encompassed attR and the integrase gene (85% nucleotide identity with PAGI-2, 82% with PAGI-3, 88% to 92% with X. fastidiosa, 88% with Azoarcus sp., 85% with R. gelatinosus, and 99% with B. xenovorans). The second highly conserved region comprised the circa 50 kb at the left end of the clc element (Fig. 3). In between those regions, the GEIs differed, and in the case of the clc element, the variable part contained the catabolic genes. The most highly related partner to the clc element was a 125-kb chromosomal region detected in the unpublished genome sequence of B. xenovorans LB400, previously B. fungorum LB400, a microorganism known for its capabilities to degrade polychlorobiphenyls (39). The boundaries of the GEI present in B. xenovorans LB400 (AAAJ00000000) may be formed by two direct repeats of 79 bp encompassing two tRNAGly genes. The clc element and the chromosomal region on B. xenovorans shared an overall nucleotide identity of 99% to 100%, except for two distinct regions which were absent in the clc element (Fig. 3A). The first one is a 20-kb fragment occupying the area between the ORFs 5994 and 8052 on the clc element, containing genes for two subunits of an ortho-halobenzoate 1,2-dioxygenase, for two transmembrane proteins, and for a general secretion pathway protein A. The 20-kb region is flanked by two insertion element copies (3.3 kb, with 99% nucleotide sequence identity to ISPpu12, one of which was subject to a secondary 1.5-kb insertion), suggesting that it might have become inserted into an ancestor clc element present in or transferred to B. xenovorans. The second 2-kb region contains a gene coding for a putative reverse transcriptase maturase and is present between homologs of ORFs 83350 and 84338 (clc element annotation). The clc element is also very similar to a GEI present in Ralstonia sp. strain JS705 (AJ006307) (40) which has been only partly sequenced. The GEI of strain JS705 has acquired an additional 10-kb region (40) containing the mcb genes for chlorobenzene dioxygenase and dihydrodiol dehydrogenase at a site between ORF15405 and ORF15962 of the clc element (Fig. 3A). The cumulative TA skew distribution along the clc element (Fig. 1C) showed a region of 9.5 kb with a distinctly lower value compared to that for the rest of the clc element. This region corresponded to the location of the amn genes and suggests that an ancestor of the clc element had acquired the amn gene fragment in a distinctive single insertion. This suggestion is further supported by the ORF transcription direction in this area (Table 1). Although the majority of ORFs on the clc element are transcribed in a direction opposite to that of the integrase gene, the amn genes and a few other downstream-located genes are oriented like the integrase gene. These data demonstrate that the evolution of the clc element and its close relatives (as in B. xenovorans and Ralstonia) primarily takes place by acquisition of new gene fragments into the variable region.
|
Newer members of this family of GEI may include a chromosomal region of 73 kb in the R. gelatinosus PM1 genome (NZ_AAEM00000000), although no repeat sequences and no nearby tRNA gene were detected (Fig. 3C). This region almost completely lacked a variable part and consisted basically of only the genes present in the conserved region of the clc element. Ten ORFs from the clc element were absent in the R. gelatinosus region, including the putative DNA methyltransferase (ORF89746). On the other hand, the R. gelatinosus region had about 18 kb of DNA different from the clc element (between ORF81655 and ORF91884), coding among others for a cation efflux system, for a mercury resistance regulatory protein (MerR), and for an arsenate reductase. In contrast to those mentioned above, two chromosomal regions in X. axonopodis pv. citri strain 306 (14) and in Azoarcus sp. strain EbN1 (47) of 134 kb and 141 kb, respectively, showed strong similarity (79 to 89% nucleotide sequence identity) to the clc element but differed in gene organization (Fig. 3D). Both regions were not flanked by repeated sequences, and no gene for tRNA was found nearby. In addition, the X. axonopodis region did not contain an integrase gene. On the contrary, the putative GEI in Azoarcus sp. strain EbN1 carried a duplication of the integrase and of a 16-kb region corresponding to the segment extending from ORF50240 to ORF68987 on the clc element. The Azoarcus region contained several catabolic gene functions, such as an ethylbenzene dehydrogenase and an acetophenone carboxylase.
Thirteen ORFs were uniformly present in the conserved regions between the GEIs presented here, including the putative DNA topoisomerase III/Ia (ORF91884), the single-stranded-DNA binding protein (ORF94175), the probable transcriptional regulator inrR (ORF94689), and the chromosome partitioning-related protein (ORF100033). Parts of those functions have been recognized previously as core among an even larger set of GEIs (38, 56). This conservation may indicate that potentially more of the suspected GEIs than just the clc element of Pseudomonas sp. strain B13 and Ralstonia sp. strain JS705 are functionally self-transferable entities. However, although it is presumed that GEIs have arisen from a merge of plasmid and phage functions (8, 33), it is curious that no fully replicative plasmid counterparts have so far been detected for the clc and ICEHin1056 type of elements.
In summary, our results present the first complete sequence of a self-transmissible GEI from Pseudomonas implicated in aromatic compound degradation. Herewith, therefore, an important paradigm in aromatic compound degradation, namely, that of pertinent association of self-transmissibility, aromatic compound degradation, and plasmid conjugation, has to be changed. Although the recent past has seen various reports of complete catabolic plasmid sequences, including pWW0 (xylene degradation in P. putida) (24), pEST4011 (69), and pJP4 (2,4-dichlorophenoxyacetic acid degradation in Achromobacter xylosoxidans and Cupriviadus necator JMP134) (66), or pADP-1 from Pseudomonas sp. strain ADP for atrazine degradation (36), the clc element sequence demonstrates that genes for aromatic degradation may become part of a different class of mobile elements, namely that of ICE. In this respect, the clc element is similar to the biphenyl transposon Tn4371 of Ralstonia oxalatica A5 (65), which, however, is defective for self-transfer but carries recognizable tra and trb genes similar to those of IncP1-ß plasmids. A second important realization from the work presented here, which has been stated before in a slightly different form (38), is that GEIs with a similar core structure can display various more or less pronounced functions, e.g., antibiotic resistance, aromatic compound metabolism, or toxin production. However, there is a certain danger when subsequently classifying GEI on the basis of these "pronounced" functions (16). For example, although the clc element may seem "specialized" for aromatic compound metabolism (and, thus, would represent an "ecological" GEI), it is not exempt of putative pathogenicity functions, such as a potential multidrug or solvent efflux system (ORFs 32963, 34495, and 36077) (48). Hence, rather than maintaining absolute categories, it has to be stressed that there is a continuum of different functional characteristics among GEIs (such as from more pathogenic to more pronounced catabolic character) and a further evolution by gene acquisition and rearrangement by which GEIs contribute to adaptation and selection of bacteria in changing environments.
| ACKNOWLEDGMENTS |
|---|
The work of M.G. was supported by the Swiss National Science Foundation, grant no. 3100-67229.
| FOOTNOTES |
|---|
timent Biophore, University of Lausanne, CH-1015 Lausanne, Switzerland. Phone: 41 21 692 5630. Fax: 41 21 692 5605. E-mail: Janroelof.vandermeer{at}unil.ch. | REFERENCES |
|---|
|
|
|---|