Previous Article | Next Article ![]()
Journal of Bacteriology, June 2002, p. 3194-3202, Vol. 184, No. 12
0021-9193/02/$04.00+0 DOI: 10.1128/JB.184.12.3194-3202.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Kayoko Takashima,6,
Hideaki Nojima,3,
Keisuke Nakayama,2 Akira Kaji,7 Yoshiyuki Kamio,4 Takeyoshi Miki,8 Hirotada Mori,3,6 Eiichi Ohtsubo,5 Yoshiro Terawaki,1 and Tetsuya Hayashi2*
Department of Bacteriology, Shinshu University School of Medicine, 3-1-1 Asahi, Matsumoto, Nagano 390-8621,1 Department of Microbiology, Miyazaki Medical College, 5200 Kihara, Kiyotake, Miyazaki 889-1692,2 Institute of Advanced Biosciences, Keio University, 14-1 Baba, Tsuruoka, Yamagata 997-0035,3 Laboratory of Applied Microbiology, Department of Molecular and Cell Biology, Graduate School of Agricultural Science, Tohoku University, Tsutsumi-dori, Amamiya-machi, Aoba-ku, Sendai, Miyagi 981-8555,4 Institute of Molecular and Cellular Biosciences, University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113,5 Research and Education Center for Genetic Information, Nara Institute of Science and Technology, 8916-5 TakayamaIkoma, Nara 630-01,6 Department of Molecular Microbiology, Kyushu University Graduate School of Pharmaceutical Sciences, 3-1-1 Maidashi, Higashi-ku, Fukuoka 812-8582, Japan,8 Department of Microbiology, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104-60767
Received 23 October 2001/ Accepted 26 February 2002
|
|
|---|
|
|
|---|
As in many large plasmids of gram-negative bacteria, Rts1 requires two elements for its autonomous replication, a replication initiation protein encoded by the plasmid (repA) and a short segment containing the replication origin, ori (30, 34, 59). Because the replication of this mini-Rts1 plasmid is stable at 37°C but is inhibited at 42°C in Escherichia coli (30), as observed for Rts1 (56, 58), the replication machinery of Rts1 itself is thermosensitive. A locus termed tdi (temperature-dependent instability) has also been identified as another locus responsible for the temperature sensitivity (48, 55), but its real physiological role in the replication (or maintenance) of Rts1 is unknown.
As for the temperature-sensitive effect on host cell growth, Terawaki et al. (57) first reported that Rts1 inhibits its host cell growth at 42°C but not at 37°C. Since then, two loci responsible for this phenotype have been identified, tsg (43, 47) and hig (60-62). Since the tsg locus contains no open reading frame (ORF), the AT-rich DNA segment itself is thought to be responsible for the thermosensitivity of host cell growth. On the other hand, the hig locus encodes a system belonging to the proteic killer family, where the HigB and HigA proteins function as a toxin and an antitoxin, respectively. The temperature-sensitive host cell growth conferred by the hig system is mediated by selectively killing the host that has lost all copies of Rts1 at a nonpermissive temperature (postsegregation killing).
Conjugation of Rts1 is also thermosensitive, but the temperature range for conjugation differs from that for replication, efficient at 25°C but not at 37°C (56). Thus, the thermosensitivity of conjugation is a phenomenon unrelated to that of replication. The molecular mechanism underlying the thermosensitive conjugation is unknown except that the lengths of Rts1 sex pili were reported to be shorter at 37°C than at 30°C (8). No Rts1 gene for conjugation has been identified.
Another temperature-sensitive phenotype shown by Rts1 is the restriction of T-even phages: Rts1 inhibits the multiplication of T-even phages at 30°C but not at 43°C (66). A locus encoding a restriction enzyme, PvuRts1I, is responsible for this phenomenon (28, 32).
Except for above-mentioned thermosensitive phenotypes, only a few Rts1-related phenomena have been documented so far, and only nine genes have actually been cloned and analyzed. They indeed represent a very small portion of the large gene repertoire encoded on the Rts1 genome. In this study, we determined the entire nucleotide sequence of Rts1, comprising 217,182 nucleotides, and identified all the potential genes encoded by the plasmid. They include the set of genes for the conjugation function and a large number of genes involved in various processes of DNA metabolism. Furthermore, by inspecting the overall genome organization, we identified four modules constituting the Rts1 genome. Among them, two were generated by a large segment duplication and subsequently by structural and sequence diversification. These findings provide new insights into the mechanisms of the evolution and gene repertoire expansion of large plasmids in general.
|
|
|---|
(lacZ)r1 supE44 trp31 mcrA1272::Tn10(Tetr) his1 rpsL104(Strr) xyl-7 mtl-2 metB1
(mcrBC-hsdSMR-mrr)102::Tn10(Tetr)] (32). Plasmid pUC18 (65) and E. coli strains DH5
MCR and DH5
(both from Gibco-BRL) were used to construct random shotgun libraries of the Rts1 genome. E. coli HB101 (leuB6 supE44 thi-1 hsdS20 recA13 ara-14 proAB lacY1 galK2 rpsL20 xyl-5 ntl-1) (7) was used as a recipient strain in the mobilization assay. All strains were cultivated with Luria-Bertani (LB) or 2xYT medium supplemented with appropriate concentrations of antibiotics (ampicillin, 100 to 200 µg ml-1; kanamycin, 30 µg ml-1; and streptomycin, 500 µg ml-1).
DNA manipulation and sequencing determination.
Routine DNA manipulations were carried out by the standard methods (53). To purify the intact Rts1 plasmid, E. coli strain ER1648 containing Rts1 was cultivated overnight in 2xYT medium containing kanamycin, and the plasmid DNA was extracted with a plasmid kit (Qiagen). The DNA was further purified by cesium chloride density gradient centrifugation. The whole genome sequence was determined by the random shotgun method as described previously (45, 46). Collected sequences were assembled by the Sequencher sequencing software (version 3.0, gene codes). After the initial assembling of 4,750 sequences obtained by the forward primer (
500 bases in average), a short region remained to be determined. To fill the sequence gap, 15 clones that covered the gap were selected and sequenced by the reverse primer and custom primers. In addition, 178 clones, the opposite ends of whose inserts were expected to cover the regions with any sequence ambiguity, were selected and sequenced by the reverse primer. The final redundancy of sequencing was about 11-fold, and both strands were read at least once throughout the entire genome.
ORF prediction and computer analysis. ORFs were predicted with the software GeneMark (6) trained with the matrix generated from a set of genes from various phages and plasmids. In principle, ORFs larger than 150 bp preceded by recognizable ribosome-binding sequences were searched. By this search, 223 ORFs were identified. In the regions where no ORF was identified by GeneMark, ORFs larger than 150 bp with typical ribosome-binding sequences were searched for manually. DNA and protein sequences were compared to the public sequence databases with the Blast program (4) through DDBJ. Multiple alignments of protein sequences were made with ClustalW through DDBJ.
Construction of the physical map. To construct the physical map of Rts1, the purified plasmid DNA was digested with XbaI, SpeI, SalI, or various combinations of these enzymes. Digested DNAs were analyzed by conventional agarose gel electrophoresis with 0.35 to 0.8% agarose gels prepared by AgaroseH (NipponGene). Pulsed-field gel electrophoresis (PFGE) was also employed to precisely determine the sizes of large DNA fragments. PFGE was performed with the CHEF Mapper apparatus (Bio-Rad) on 1% agarose gels (pulsed-field certified agarose, Bio-Rad) in 0.5x TBE (Tris-borate-EDTA) and regularly run at 6 V/cm at 14°C with a linear increase in pulse intervals. To resolve fragments of 20 to 120 kb in size, pulse times were ramped from 2.98 to 10.29 s for 26.56 h.
Identification of the oriT locus.
To identify the oriT locus of Rts1, we screened the random shotgun library of Rts1 by the mobilization assay. To construct a pUC18-based library, the purified plasmid DNA was fragmented by sonication, and fragments of 0.5 to 5.0 kb in length were cloned into the SmaI site of pUC18. Resulting plasmids were introduced into E. coli DH5
containing the intact Rts1 that served as a helper plasmid. After precultivation at 28°C, each transformant was incubated with HB101 overnight at 28°C in 96-well microtiter plates. Each coculture was then transferred to fresh medium containing streptomycin and incubated overnight to reduce the background in the screening of transconjugants. Selection of transconjugants was done on LB agar plates containing streptomycin and ampicillin. Sizes of pUC18 derivatives in each transconjugant were analyzed and compared with those in the donor clones to confirm the absence of rearrangement during the mobilization. Frequency of kanamycin-sensitive transconjugants was also examined for each clone to determine whether each clone was transferred by mobilization or cointegration into the helper Rts1 plasmid. Finally, both ends of the inserts were sequenced to map their positions on the Rts1 genome. To further localize the oriT locus, various lengths of DNA fragments covering the oriT-containing region identified by the random screening were generated by PCR, cloned into pUC18, and subjected to the mobilization assay. Nucleotide sequences of all the clones generated by PCR were confirmed by sequencing.
Nucleotide sequence accession number. The annotated sequence of Rts1 has been deposited in DDBJ/GenBank/EMBL under accession number AP004237.
|
|
|---|
Rts1 encodes 300 ORFs (001 to 165 and 167 to 301, Fig. 1). They indeed include the nine Rts1 genes that were identified previously. Among the 300 ORFs, 253 are oriented in the same direction, left to right in Fig. 1. This remarkable bias may indicate that Rts1 genome replication proceeds mostly in this orientation. In the homology search, the products of 141 ORFs showed significant sequence similarity to known proteins, and among these, 99 were homologous to proteins whose functions are known or predicted. No ORFs directly related to bacterial pathogenesis were identified.
![]() View larger version (40K): [in a new window] |
FIG. 1. Genome organization of Rts1. Boxes indicate ORFs identified on Rts1, and numbers above and below the boxes are ORF numbers. ORFs shown above and below the line are transcribed left to right and right to left, respectively. Predicted functions of ORFs were categorized into eight groups, and ORFs in the same group are indicated by the same color. Locations of ori, oriT, IS elements, and transposons are also indicated.
|
Transposon and IS elements. Rts1 contains six kinds of insertion sequence (IS) elements (nine copies in total) and three transposons (Table 1, Fig. 1). All together, they comprise 28,644 bp, 13% of the plasmid genome. IS610 and IS611 are the ones newly identified in this study, and both belong to the IS3 family. IS611B and IS611C are located very closely (444 bp apart), but they do not appear to form a composite transposon because no target sequence duplication was detected in the regions flanking the two IS elements.
|
View this table: [in a new window] |
TABLE 1. Transposon and IS elements on the Rts1 genome
|
Tn9199 and Tn6901 are newly identified class II transposons. Besides the genes for transposases and resolvases, each of these transposons carries four ORFs. ORF088 and ORF089 of Tn9199 are similar to a putative ATPase and a putative sulfate permease encoded by Tn2501 on Yersinia enterocolitica plasmid pGC1, respectively (25, 68). The N-terminally truncated arsC gene (ORF090) is present in the upstream region of the two ORFs. ORF087 is similar to BetT, a transporter of choline, which is a precursor of glycine betaine, serving as an osmoprotectant (38). ORF009, ORF010, and ORF012 on Tn6901 are highly homologous to YaiN (90% amino acid sequence identity over the entire length), alcohol dehydrogenase AdhC (94%), and YaiM (78%), respectively, of E. coli K-12. The gene organization is also similar to the yaiN-adhC-yaiM locus on the K-12 chromosome, though an ORF of unknown function is inserted between ORF010 and ORF012 in Rts1.
Conjugation system. Genes for the conjugation functions are clustered in two regions separated by a 16-kb segment. Most of the gene products show the highest similarities to those of the conjugation systems of plasmid F or plasmid R27, the prototype IncFI and IncHI1 plasmids, respectively (16, 54).
Genes required for the conjugal transfer of plasmid F have been most intensively analyzed, and they are categorized into five groups according to their functions: pilus biosynthesis, aggregate stabilization, conjugal DNA metabolism, surface exclusion, and regulation of gene expression. The genes for sex pilus formation are further divided into two subgroups, those for pilin synthesis and for pilus assembly (reference 16 and references therein). Homologues of all the F genes for pilus assembly except trbI were identified in Rts1. ORF216 shows homology to two pilus assembly proteins of F plasmid, the N-terminal half to TrbC and the C-terminal half to TraW. Although little is known about the exact functions of these F proteins, both are known to be located in the periplasmic space (16). Fusion of the two genes probably indicates that they participate in the same or closely related operational steps in pilus assembly. The same type of gene fusion is observed in plasmid R27, and most of the pilus assembly genes of Rts1 are also similar to the R27 genes (54).
Making a sharp contrast to the pilus assembly genes, no homologues of the F genes for pilin synthesis, traA, traQ, and traX, were identified in Rts1. Instead, Rts1 contains a gene (ORF215) homologous to traF of RP4, which encodes a propilin-specific peptidase (14, 20). R27 also contains a traF homologue, TrhF (54). It should be also noted that ORF185 encodes a type I leader peptidase, which could participate in the first processing step of pilin maturation, removal of the signal peptide.
The aggregate stabilization system of Rts1 is also similar to that of plasmid F, since ORF242 and ORF244 showed similarity to TraG and TraN of plasmid F, respectively. However, no genes homologous to the F genes related to DNA metabolism, surface exclusion, and regulation of gene expression were identified. Instead, ORF201 and ORF202 exhibit significant similarities to a putative nickase (TraI) and an anchoring/coupling protein (TraG) of plasmid R27, respectively (54). These proteins are required for DNA transfer in R27, and thus the DNA transfer system of Rts1 is similar to that of R27.
Among the F genes that are not essential for conjugation but are located in the transfer region, only trbB exhibits a weak and local homology to an Rts1 gene, ORF212. ORF212 is a homologue of DsbC, which catalyzes disulfide bond formation in the periplasmic space (50). A predicted active-site motif, Phe-(X)4-Cys-Pro-Tyr-Cys (42), is conserved in ORF212, and the homology between ORF212 and TrbB of F is limited to a segment containing the motif. DsbC homologues are also present in the transfer regions of R27 (54) and pNL1 (52), suggesting that they play some role as redox proteins in the conjugation process.
ORF197 encodes a protein similar to Nuc, the EDTA resistance endonuclease of pKM101 (49). Nuc homologues are encoded in close proximity to transfer regions in many plasmids belonging to several incompatibility groups (36, 49), though their roles in conjugation are unknown. ORF197 is also located just upstream of the tra gene clusters in Rts1. An extracellular nuclease activity was previously detected in E. coli harboring Rts1 (41), but the relationship between the nuclease activity and ORF197 remains to be elucidated.
Identification of the oriT region. In the conjugation process, transfer of DNA initiates at a specific site, termed oriT. oriT is usually located at or close to the end of the tra gene cluster, and the tra genes are organized so that they are transmitted late in the conjugation process. Since our first attempt to identify the oriT site of Rts1 by sequence similarity to known oriT sequences of other conjugative plasmids (39) was unsuccessful, we tried to identify the oriT region of Rts1 by screening the genomic DNA library by the mobilization assay as described in Materials and Methods. The principle of this approach is that when the oriT region is cloned into a nonmobilizable plasmid, pUC18, the recombinant plasmid can be mobilized by cohabitation of the parent plasmid (64).
By this screening, we obtained 20 ampicillin-resistant clones. Except for one clone that contained the entire IS611(B), 19 clones were mapped to one of four regions (Fig. 2): positions 2145 to 6073 (region I), 56019 to 57984 (region II), 110435 to 111997 (region III), and 186537 to 193835 (region IV). In the cases of clones derived from regions I, II, and III, kanamycin resistance, a marker of the helper Rts1 plasmid, was always cotransferred with ampicillin resistance (Fig. 2A), implying that they were probably transferred to the recipient cells by cointegration into the helper Rts1 plasmid. In contrast, transfer of the clones derived from region IV was accomplished independently with the helper plasmid, indicating that region IV contains the true oriT site of Rts1. The region was in fact located in close proximity to the transfer region.
![]() View larger version (20K): [in a new window] |
FIG. 2. Identification of the oriT locus of Rts1. (A) Summary of oriT screening. Clones obtained in the oriT screening of the random shotgun library of the Rts1 genome are listed, and their map positions and kanamycin sensitivities are presented. All the clones were derived from one of the four regions, I to IV, but the clones derived from regions I, II, and III were always cotransferred with kanamycin resistance, a marker of helper plasmid Rts1. (B) Map positions of the clones derived from region IV and those prepared by PCR. The T2 segment represents the minimum consensus of the eight random clones obtained in the oriT screening. The T2 segment was able to mobilized by Rts1, but the segments outside of T2 were not (O1, O2, and O5), indicating that the oriT locus of Rts1 is located in the T2 segment.
|
Replication and maintenance systems. Rts1 contains one replicon, consisting of the repA gene (ORF001), encoding a replication initiator protein, and the ori segment, containing several sequence elements characteristic of plasmid replicons that employ the iteron-based replication initiation and control mechanism (10, 31, 34, 67). The minimum segment for replication initiation defined by Itoh et al. (31), which contains duplicated dnaA boxes, four GATC sequences, and five 19-bp iterons, spans from position 51 to 238.
In Rts1, no partitioning system for stable maintenance was identified before, but a system highly similar to the parABS partitioning system of plasmid P1 was identified. ORF293 and ORF294 are highly homologous to ParA and ParB of P1, respectively, and are arranged in the same order as in P1 (1). Furthermore, a parS-like structure containing three copies of repeated sequence of 7 bp and an integration host factor binding motif were identified just downstream of ORF294 (11, 17). Although two additional ORFs, ORF190 and ORF273, also exhibited weak similarities to ParB of P1, they are not accompanied by parA-like genes. Rts1 contains a proteic killer system encoded by higBA (ORF261 and ORF262), which functions as a plasmid maintenance system (61, 62). The tdi locus could also be involved in the maintenance of Rts1 (48, 55). Thus, the stable maintenance of Rts1 is achieved by multiple systems.
Large segment duplication on the Rts1 genome. By examining the sequence similarity among all the gene products of Rts1, we identified 33 pairs of homologous genes which were located on the first 100-kb region of the Rts1 genome (Fig. 3). Genes comprising each pair are separately located in the first and the second halves of the 100-kb region, and the gene order is completely conserved. This finding indicates that the 100-kb region was created by a large segment duplication. Homology observed in each pair of genes is, however, not so remarkable (up to 65.6% amino acid sequence identity), implying that each subregion has undergone significant sequence diversification after duplication, and each pair of genes may have functionally diverged. In addition, the two subregions contain different sets of IS or transposon elements, and several genes or gene clusters seem to have been deleted or replaced after the duplication.
![]() View larger version (22K): [in a new window] |
FIG. 3. Large segment duplication in the Rts1 genome. The gene organization of the first and second halves of the 100-kb region of Rts1 (corresponding to modules M1a and M1b, respectively, in Fig. 4) is compared. Homologous genes are indicated by shading, and transposable genetic elements inserted in the regions are depicted by triangles. The homologous genes identified in each subregion are organized in the same order in both subregions. At the bottom, sequence similarities between each pair of homologous genes are presented (percent amino acid sequence identity). Levels of sequence conservation observed in each pair of homologous genes are not so high, implying that each gene has undergone significant sequence diversification after duplication.
|
ORF015 and ORF091 of Rts1 both exhibit significant sequence similarity to the enterobacterial Tus proteins throughout the entire molecules. The sequences known to be essential for the Tus functions are well conserved (33), implying that both or either ORF015 and ORF091 encode Rts1-specific Tus proteins involved in the replication termination of Rts1. Because the 11-bp core sequence of the ter consensus (TGTTGTAACTA) is absent on the Rts1 genome, Rts1 Tus homologues must recognize different sequences.
ORF031 and ORF116 are highly homologous to the E. coli SSB protein (70% amino acid sequence identity over 147 residues and 84% over 113 residues, respectively). The ability of Rts1 to rescue the replication defect of an E. coli ssb mutant (18) could be explained by the presence of these ssb homologues. Rts1 contains another ssb homologue (ORF222) in the region between the two transfer gene clusters, but it is very distantly related to E. coli SSB (32% identity over 95 residues) and has an extremely long C-terminal tail, suggesting that it has some distinct function. ORF007/ORF008 and ORF079/ORF080 both encode UmuDC homologues, which mediate error-prone repair in the SOS repair system (51, 63). Although ORF008 is C-terminally truncated by the insertion of Tn6901, the other set of genes (ORF079/ORF080) appears to be intact and is probably responsible for the recently identified umu-complementing activity of Rts1 (35).
ORF053 and ORF131 are homologous to Dam methylase, which plays important roles in the initiation of replication and the sequestration of newly replicated oriC segments to the cell membrane (40). Both ORFs have been confirmed to possess Dam methylase activity by the SeqA focus formation experiment with the dam null mutant KA468 (24; T. Onogi and S. Hiraga, personal communication). Homologues of the beta and theta subunits of DNA polymerase III holoenzyme are also encoded (ORF050 and ORF081, respectively).
The gene for PvuRts1I, a T-even phage restriction enzyme, also resided in this region (corresponding to ORF063). In a previous study, an ORF encoding 290 amino acids was identified upstream of the PvuRts1I gene, but its protection activity relative to PvuRts1I was not detected (32). In this study, however, this ORF (ORF062) was found to actually encode a larger protein of 593 amino acids homologous to TraG1, a putative restriction methylase of plasmid R478. The function of this ORF needs to be reexamined with the possibility that it may have protection activity against PvuRts1I.
Genome organization and evolution of Rts1. When the genome organization of Rts1 is overviewed, the genome can be considered to be composed of four modules (Fig. 4), two modules created by duplication (M1a and M1b), a transfer region-containing module (M2), and a module containing the genes required for replication and stable maintenance of Rts1 (M3). As for the large segment duplication, some site-specific recombination system might be involved in the event, because all the clones that formed cointegrates with Rts1 in the oriT screening were derived from three regions (regions I, II, and III), each corresponding to the predicted boundary of the duplicated segments. Since the oriT screening was performed in the recA mutant background, some site-specific recombination must mediate cointegrate formation. Rts1 encodes several genes related to the recombination function, and some of them might participate in cointegrate formation and segment duplication.
![]() View larger version (50K): [in a new window] |
FIG. 4. Circular representation of the Rts1 genome. Locations of modules M1a to M3, constituting the Rts1 genome, as well as regions I to IV that were identified in the oriT screening (see Fig. 2 and the text) are shown. Positions of transfer regions in module M2 and genes for replication and stable maintenance in module M3 are also indicated.
|
Concluding remarks. Determination of the complete sequence of the large conjugative plasmid Rts1 provides several lines of important information on plasmid biology. All the genes potentially related to the conjugation function have been identified, which will facilitate comparative and experimental approaches to the full understanding of bacterial conjugation systems. An unexpected finding is the presence of multiple genes related to various types of DNA metabolism. They include plasmid-encoded tus homologues. The analysis of the genome organization of Rts1 provided a prominent example of the modular evolution of a large plasmid genome. The exchange or acquisition of modules from other replicons and the capture of IS elements and transposons are deeply involved in the generation of Rts1, as has been repeatedly discussed for plasmid genome evolution. In Rts1, however, another mechanism, the duplication of a large module and subsequent structural and functional diversification of the duplicated modules, has also played an important role in genome evolution and gene repertoire expansion.
This work was supported by a Grant-in-aid for Scientific Research from the Ministry of Education, Science, and Culture of Japan and grants from the Yakult Foundation.
Present address: Gene Regulation and Chromosome Biology Laboratory, Division of Basic Science, National Cancer Institute-Frederick Cancer Research and Development Center, Frederick, MD 21702. ![]()
Present address: Specific Toxicology Research R&D, KISSEI Pharmaceutical Co., Ltd., 2320-1 Maki, Hotaka, Minamiazumi, Nagano 399-8305, Japan. ![]()
Present address: Division of Molecular Oncology, Osaka University Graduate School of Medicine (C7), 2-2, Yamada-oka, Suita, Osaka 565-0871, Japan. ![]()
|
|
|---|
CTX, a cytotoxin-converting phage of Pseudomonas aeruginosa: implications for phage evolution and horizontal gene transfer via bacteriophages. Mol. Microbiol. 31:399-419.[CrossRef][Medline]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»