Previous Article | Next Article ![]()
Journal of Bacteriology, March 2008, p. 2150-2160, Vol. 190, No. 6
0021-9193/08/$08.00+0 doi:10.1128/JB.01598-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom,1 Department of Biological Sciences and Pest Management, Colorado State University, Fort Collins, Colorado 80523,2 Department of Ecology, Evolution and Behavior, University of Minnesota, St. Paul, Minnesota 55108,3 Department of Horticulture and Crop Science Ohio Agricultural Research and Development Center, The Ohio State University, Wooster, Ohio 44691,4 Department of Plant Pathology, University of Minnesota, St. Paul, Minnesota 551085
Received 2 October 2007/ Accepted 1 January 2008
|
|
|---|
|
|
|---|
The genus Clavibacter provides an excellent resource for obtaining a more comprehensive understanding of plant-microbe interactions. Clavibacter is a member of the family Microbacteriaceae in the Actinomycetales (60). The related plant pathogens include Leifsonia, Curtobacterium, and Rathayibacter and the more distantly related genera Rhodococcus and Streptomyces, (29, 47, 64, 78, 84). Clavibacter is generally considered a genus of plant pathogens, but recent ecological surveys suggested that environmental, nonpathogenic isolates occur more commonly than was previously thought (21, 25, 36, 85). Members of C. michiganensis can usually be further classified to the subspecies level. A cornerstone of subspecies identification in C. michiganensis is the striking host specificity of its plant-pathogenic members. Polyphasic schemes also support the current subspecies classification, but it is noteworthy that the genetic basis for subspecies identification remains unknown (3, 16, 21, 50).
To provide genetic resources that could lead to a better understanding of pathogenicity and host specificity for the genus Clavibacter specifically and for coryneform plant pathogens in general, our studies were focused on obtaining the complete genome sequence of Clavibacter michiganensis subsp. sepedonicus (Spieckermann and Kotthoff 1914) Davis et al. 1984 comb. nov (21). This international and national quarantine pest causes bacterial ring rot, a devastating disease of potato. C. michiganensis subsp. sepedonicus spreads easily within potato farms during seed cutting and can be readily disseminated in latently infected tubers, by infested farm equipment, in storage facilities, and in packing materials. Infections can result in crop losses in the fresh and processed potato industries, but the main economic losses occur in the seed industry, where there is strict zero tolerance for the disease (23). Bacterial ring rot is usually associated with the temperate climates of North America, Asia, Scandinavia, and Northern Europe (73). Crop losses are due to colonization of the tuber vascular system and surrounding tissues, which can lead to extensive secondary breakdown in storage (43, 70).
A specific objective of this work was to conduct whole-genome comparisons between C. michiganensis subsp. sepedonicus and its plant-pathogenic relatives C. michiganensis subsp. michiganensis and Leifsonia as a means of identifying putative pathogenicity-related genes in the ring rot pathogen. Because C. michiganensis subsp. sepedonicus thrives almost exclusively as a plant endophyte, while C. michiganensis subsp. michiganensis is both an endophyte and an epiphyte, the comparative genome studies should also provide insight into aspects of niche adaptation (17, 18, 51, 77). Whole-genome comparisons also were used to identify genomic events associated with the evolution of host specificity and therefore subspeciation within C. michiganensis.
|
|
|---|
Genome sequencing. An approximately 8x shotgun sequence was produced from a total of 49,536 end sequences from pUC clones with 2.0- to 2.8-kb inserts using a Big Dye terminator cycle sequencing kit from Applied Biosystems. Reactions were performed with Applied Biosystems 3700 sequencers. Approximately 0.1x sequence coverage (4.7x clone coverage) was obtained from 768 end sequences from 40-kb inserts cloned into fosmid pFOS1 and was used to scaffold contigs and bridge repeat sequences. The sequence was finished to standard criteria (61). Sequence assembly, visualization, and finishing were performed using PHRAP (www.phrap.org; P. Green, unpublished data) and Gap4 (10). All insertion element sequences were individually verified.
Annotation and genome comparison. Coding sequences were initially identified using a combination of Glimmer 2 (24) and Orpheus (31) and then manually curated using Artemis (61) and Frameplot (7). All genes were manually annotated in Artemis using standard criteria (6). Genome comparisons were visualized using the Artemis comparison tool (19). Putative orthologs were identified by reciprocal best-match FASTA searches of the C. michiganensis subsp. sepedonicus, C. michiganensis subsp. michiganensis, and Leifsonia xyli subsp. xyli protein sequences with the following cutoffs: 80% sequence length and 30% identity.
Time of divergence. Orthologous sequences were obtained for the following seven protein-encoding genes: atpG (921 nucleotides), dnaK (1,889 nucleotides), fadA (1,185 nucleotides), gcpE (1,107 nucleotides), purM (860 nucleotides), rpoA (944 nucleotides), and trmU (869 nucleotides). Sequences were aligned using MUSCLE v3.6 (28) and were edited using Se-Al v2.0a9 (http://tree.bio.ed.ac.uk/software/seal/), and maximum likelihood trees and branch lengths were obtained using GARLI v0.951 (http://www.molecularevolution.org/software/garli/). Lacking external calibration dates (i.e., a fossil record), we employed a relatively simple distance estimation method of Kumar et al. (44): t = d/2r. The divergence time (t) at nodes was estimated from the nucleotide distance (d) calculated by adding the branch lengths obtained in maximum likelihood trees, and a mutation rate (r) of 5 x 10–10 mutation/bp/generation (26) and a generation time of 1 h were assumed. To take into consideration the slow in vitro and in planta growth of C. michiganensis subsp. sepedonicus, calculations were also made with an assumed generation time of 0.5 generation per day (5, 22). Divergence times were estimated for each gene individually, and standard deviations were calculated using individual estimates.
Nucleotide sequence accession numbers. The sequence and annotation of strain ATCC 33113 have been deposited in the EMBL database under the following accession numbers: chromosome, AM849034; plasmid pCS1, AM849035; and plasmid pCSL1, AM849036.
|
|
|---|
![]() View larger version (63K): [in a new window] |
FIG. 1. Circular diagram of the chromosome of C. michiganensis subsp. sepedonicus. The circles provide the following information (from the outermost circle to the innermost circle): circles 1 and 2, all CDSs (transcribed clockwise and anticlockwise) (dark blue, pathogenicity and adaptation; black, energy metabolism; red, information transfer; dark green, surface associated; cyan, degradation of large molecules; magenta, degradation of small molecules; yellow, central and intermediary metabolism; pale green, unknown; pale blue, regulators; orange, conserved hypothetical; brown, pseudogenes; pink, phage and IS elements; gray, miscellaneous); circle 3, putative laterally acquired CDSs; circle 4, CDSs not present in C. michiganensis subsp. michiganensis or L. xyli; circle 5, pseudogenes; circle 6, IS element transposases; circle 7, G+C content (window size, 10,000 bp); and circle 8, GC deviation (G – C/G + C; window size, 10,000 bp).
|
|
View this table: [in a new window] |
TABLE 1. General characteristics of the C. michiganensis subsp. sepedonicus ATCC 33113 genome
|
Chromosomal rearrangements and IS elements. The recent evolutionary pathway followed by C. michiganensis subsp. sepedonicus (see above) appears to have led to the expansion of IS elements. The C. michiganensis subsp. sepedonicus genome contains 106 IS elements, which fall into three groups, 71 IS1121 elements (68 chromosomal, 2 on pCS1, and 1 on pCSL1), 25 ISCmi2 elements (24 chromosomal and 1 on pCSL1), and nine ISCmi3 elements, as well as one element that appears to be a chimera between IS1121 and ISCmi2. IS1121 and ISCmi2 are members of the IS481 family, and ISCmi3 is related to the IS30 family (Table 1). IS1121 is widespread among strains of C. michiganensis subsp. sepedonicus (54). ISCmi2 is related to IS1122, which is repeated many times in the genome of the alfalfa pathogen Clavibacter michiganensis subsp. insidiosus (67). In terms of chromosomal coordinates, the IS elements appear to be randomly distributed. However, the majority are located in non-protein-encoding DNA, and only five are inserted directly into CDSs where they are likely to have caused loss of function (see Table S1 in the supplemental material). Alignment with the chromosome of C. michiganensis subsp. michiganensis revealed high levels of sequence identity, typically 90 to 100% (median, 95%) DNA identity for orthologous genes, and extensive rearrangements in C. michiganensis subsp. sepedonicus, mostly associated with recombination between IS elements in the genome of C. michiganensis subsp. sepedonicus (Fig. 2). Fifty-nine IS elements in C. michiganensis subsp. sepedonicus lie at the boundary of a region of synteny between C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis and are likely to have been the foci of large-scale genomic recombination events. In three cases an IS element appears to have inserted within a CDS and a subsequent recombination has moved the two parts of the CDS to distant locations on the chromosome (see Table S1 in the supplemental material). The chromosome of the related actinomycete phytopathogen L. xyli subsp. xyli also contains large numbers of IS elements which appear to have generated extensive rearrangements (Fig. 2) (56). Although C. michiganensis subsp. sepedonicus IS1121 and ISCmi3 elements are related to IS elements found in L. xyli subsp. xyli, the levels of sequence identity are low and there are no syntenic/orthologous occurrences, indicating that all IS insertions occurred independently and after divergence of these species. Furthermore, the rarity of IS elements in the chromosome of C. michiganensis subsp. michiganensis suggests that the IS expansion in C. michiganensis subsp. sepedonicus was specific to this subspecies and may have coincided with or closely followed establishment of IS elements in the C. michiganensis subsp. sepedonicus genome. The high levels of sequence identity between C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis suggest that subspeciation was relatively recent. The average levels of sequence identity between members of the IS element families (IS1121, 99.8%; ISCmi2, 97.9%; ISCmi3, 99.9%), along with the intact inverted repeat sequences, support the hypothesis that the acquisition and expansion of IS elements followed subspeciation and that the IS elements may still be functional (79). The hypothesis that the expansion of IS elements was a relatively recent event is also supported by the estimated time of divergence between C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis, given the relatively long generation times and corresponding limited number of generations since divergence. Furthermore, the genomic variability among strains of C. michiganensis subsp. sepedonicus is low, as is the local variation associated with IS1121 (13, 30, 53).
![]() View larger version (93K): [in a new window] |
FIG. 2. Alignment of chromosomes of C. michiganensis subsp. sepedonicus (Cms), C. michiganensis subsp. michiganensis (Cmm), and L. xyli subsp. xyli (Lxx). The diagram shows forward and reverse DNA strands (gray bars), and base coordinates with the positions of pat-1 homologues are indicated by vertical black lines. Similar regions that are more than 1,000 bases long are indicated by red (colinear) and blue (inverted) lines.
|
The distribution of pseudogenes (and other possibly inactivated loci) across functional categories shows that genes for transport and degradation of carbohydrates, regulation, and specialized functions related to pathogenicity and adaptation are overrepresented (Fig. 3; see Table S1 in the supplemental material). Many pseudogenes encode enzymes likely to affect the ability of C. michiganensis subsp. sepedonicus to utilize carbohydrate nutrients, including cellulase CelB (encoded by CMS0045; cellulose utilization); glycerol kinase (encoded by CMS0701; glycerol utilization); N-acetylglucosamine-6- phosphate deacetylase (encoded by CMS0914; N-acetylglucosamine utilization, essential for growth of Mycobacterium tuberculosis) (68); three glycosyl hydrolases, one of which appears to be secreted (encoded by CMS0959, CMS1694, and CMS1700; carbohydrate degradation and utilization); glycogen-debranching enzyme TreX (encoded by CMS1527; glycogen utilization and trehalose synthesis); and tandem polysaccharide hydrolases (encoded by CMS2666 and CMS2667; carbohydrate degradation and utilization). Cellulase is an important determinant in C. michiganensis pathogenicity, so the disruption of celB is intriguing, although it should be noted that the plasmid-borne cellulase gene (celA), which is known to be involved in virulence in C. michiganensis subsp. michiganensis, appears to be intact (32, 45, 58). A gene encoding a 2,5-diketo-D-gluconic acid reductase is also disrupted. This enzyme is involved in the utilization of ketogluconates as a source of carbon and energy. It is also involved in the biosynthesis of ascorbate and has attracted much attention for its potential use in production of vitamin C.
![]() View larger version (28K): [in a new window] |
FIG. 3. Bar chart of functional classes of all CDSs (open bars) and pseudogenes (shaded bars). Note the overrepresentation of pseudogenes in CDSs encoding transport and binding proteins, macromolecule degradation, small-molecule degradation, pathogenicity, and regulation.
|
Genes for extracellular polysaccharide (EPS) biosynthesis have also been affected by genome decay in C. michiganensis subsp. sepedonicus. The C. michiganensis subsp. michiganensis genome has four gene clusters for production of EPS, and orthologous gene clusters are present in C. michiganensis subsp. sepedonicus. IS insertion in the gene for the polysaccharide polymerase Wzy (CMS2263) is likely to ablate the function of the EPS biosynthesis operon to which it belongs, and other mutations are likely to have inactivated at least one, and possibly two, of the remaining three clusters. The loss of the ability to produce an EPS coat suggests that C. michiganensis subsp. sepedonicus occupies a niche in which the production of such a coat is no longer advantageous or essential. It is tempting to speculate that IS insertions may play a role in generating naturally occurring mucoid and nonmucoid variants of C. michiganensis subsp. sepedonicus or the reported change from mucoid to nonmucoid morphology triggered by heat or nutrient stress (4, 41). Aromatic amino acid biosynthesis may be affected by an IS element insertion directly upstream of CMS2042 (aroQ; encoding 3-hydroquinate dehydrogenase [EC 4.2.1.10]).
The high proportion of pseudogenes in regulatory genes is expected to have had cascade effects on the global transcriptome and proteome and is likely to amplify the differences in the phenotypes of C. michiganensis subsp. michiganensis and C. michiganensis subsp. sepedonicus (52). Ten regulators, representing 10% of all pseudogenes and 5% of all regulators, are disrupted.
Agar plate-grown colonies of C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis can be distinguished by color; the colonies are white or faint yellow and yellow, respectively (21). The pigmentation is thought to be due to production of carotenoids. Both genomes have a complete carotenoid biosynthesis gene cluster (CMS2604 and CMS2609 and CMM2884 to CMM2889) with no apparent pseudogenes to account for the phenotypic difference. One possible explanation may be the presence of an extra pair of genes for carotenoid cyclases (CMS0965 and CMS0966), which may modulate the final product in C. michiganensis subsp. sepedonicus. Other unknown regulatory differences may also be important.
Three-way coding sequence comparison and laterally acquired DNA. The predicted proteomes of C. michiganensis subsp. sepedonicus, C. michiganensis subsp. michiganensis, and L. xyli subsp. xyli were compared by three-way reciprocal FASTA analysis to assess the numbers of orthologous and unique CDSs (Fig. 4; see Table S3 in the supplemental material). The genome sizes and numbers of CDSs of C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis are similar, so although the C. michiganensis subsp. sepedonicus genome appears to have undergone some decay, there does not appear to have been genome reduction in C. michiganensis subsp. sepedonicus compared to C. michiganensis subsp. michiganensis. However, given that these organisms are considered subspecies of the same species, they have surprisingly large numbers of subspecies-specific CDSs (12 to 16%), suggesting that there may have been significant differential gene acquisition or loss since divergence from the common ancestor. These proportions are equivalent to those seen in comparisons of Escherichia coli and Salmonella enterica genomes, where many of the unique genes are associated with horizontally acquired islands or prophages (81). Clearly, any subspecies-specific CDSs may be related to host-specific recognition, so it is notable that for both C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis these CDSs include several CDSs encoding surface-exposed or secreted proteins and proteins involved in production and modification of surface polysaccharides (see Tables S3 and S7 in the supplemental material).
![]() View larger version (12K): [in a new window] |
FIG. 4. Venn diagram showing numbers of shared and unique genes in the genomes of C. michiganensis subsp. sepedonicus (Cms), C. michiganensis subsp. michiganensis (Cmm), and L. xyli subsp. xyli (Lxx). The red numbers do not include the number of IS element transposases.
|
The C. michiganensis subsp. michiganensis genome contains a large island (130 kb) known as the chp/tom region, which encodes known and putative virulence determinants (see the accompanying paper). The C. michiganensis subsp. sepedonicus genome does not contain an equivalent single large island, although it does share much of the gene content (Fig. 2). One C. michiganensis subsp. sepedonicus island (CmsPI) has significant synteny with the tom region, suggesting that either a mobile element integrated into the common ancestral genome and has since diverged or related mobile genetic elements have been independently introduced since divergence of the two lineages. Other regions of the C. michiganensis subsp. sepedonicus genome have significant matches with the chp/tom region CDSs but no other part of the C. michiganensis subsp. michiganensis genome. These regions include the divergently transcribed gene pair CMS2233 and CMS2234, which encode a putative exported protein and putative secreted pectate lyase, respectively. It is also notable that C. michiganensis subsp. michiganensis chromosomal pat-1 homologous genes (considered to be potential virulence genes) are located exclusively in the chp/tom region, while in C. michiganensis subsp. sepedonicus they are scattered throughout the chromosome (Fig. 2). Although pat-1 genes have diverse sequences, making orthologue assignment impossible, it seems feasible that the C. michiganensis subsp. sepedonicus-C. michiganensis subsp. michiganensis common ancestral genome contained an island analogous to the chp/tom region which has remained largely intact in C. michiganensis subsp. michiganensis but has been dissipated throughout the C. michiganensis subsp. sepedonicus genome, possibly in association with IS-related recombination events. All but one of the C. michiganensis subsp. sepedonicus pat-1 genes is located within four CDSs of an IS element. An alternative explanation for the differential distribution of pat-1 homologues may be that pat-1 genes have been acquired on multiple occasions as discrete insertions. Indeed, of the eight pat-1 genes present on the C. michiganensis subsp. sepedonicus chromosome, six are present as pairs on three separate islands, one has inserted into and disrupted CDS CMS2908, and one (CMS2837) has inserted between two CDSs. There are no obvious repeat sequences flanking putative pat-1 insertions, and the mechanism of insertion is unclear.
Genes present in any of the three genomes may have been present in the common ancestor; therefore, genes present in C. michiganensis subsp. michiganensis but not present in C. michiganensis subsp. sepedonicus may have been lost from C. michiganensis subsp. sepedonicus, although clearly they could also have been acquired by C. michiganensis subsp. michiganensis. Accepting this caveat, it is interesting that the genes present in C. michiganensis subsp. michiganensis but not present in C. michiganensis subsp. sepedonicus have a distribution of functions similar to that seen for C. michiganensis subsp. sepedonicus pseudogenes, with a high frequency of catabolic functions such as degradation and transport of carbohydrates and peptides (see Table S5 in the supplemental material). Gene loss may therefore have been under the same selective influences as pseudogene formation.
Pathogenicity determinants and host adaptation. The relative genetic intractability of the Clavibacter species has meant that there has been little correlation of genes with pathogenicity. The major candidate functions so far are exopolysaccharides and secreted enzyme activities, such as endocellulase, xylanase, polygalacturonase, and serine protease. The clearest demonstrations of Clavibacter pathogenicity genes have been for the cellulase-encoding celA gene and the serine protease-encoding pat-1 gene, both carried on plasmids in C. michiganensis subsp. michiganensis (32). The C. michiganensis subsp. michiganensis celA gene is on plasmid pCM1, and an intact orthologue is present on C. michiganensis subsp. sepedonicus plasmid pCS1. However, a second cellulase gene, celB (CMS0045 and CMM2443), present on the chromosome of both subspecies, has been inactivated in C. michiganensis subsp. sepedonicus by a nonsense mutation at codon 192. The C. michiganensis subsp. michiganensis pat-1 gene is present on plasmid pCM2. Homologues of pat-1 are referred to as chp (for chromosomal homologue of pat-1) or php (for plasmid homologue of pat-1). For a phylogenetic analysis of pat-1 homologues in C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis, see the accompanying paper. C. michiganensis subsp. sepedonicus has 11 pat-1 homologues; 8 are chromosomal, 2 are on plasmid pCS1, and 1 is on pCSL1 (see Table S6 in the supplemental material). Of the eight pat-1 homologues on the C. michiganensis subsp. sepedonicus chromosome, six appear to be intact with N-terminal signal sequences, one lacks a signal sequence (CMS1260), and one has a frameshift mutation (CMS0980). Alignments of the C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis pat-1 homologues suggest that there are distinct lineages within pat-1 homologues, and the lineages are generally analogous in C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis, with C. michiganensis subsp. sepedonicus chp-3, chp-4, and chp-5 representing a lineage distinct from the C. michiganensis subsp. michiganensis lineage (see the accompanying paper). Only C. michiganensis subsp. sepedonicus chp-7 and php-2 contain the LPGSG sortase signal for cell wall anchoring of the protein. C. michiganensis subsp. sepedonicus chp-7 is most like the C. michiganensis subsp. michiganensis pat-1 gene, with 82% amino acid identity. In comparison, C. michiganensis subsp. michiganensis has three pat-1 homologues (including pat-1 itself) on pCM2, and seven homologues are clustered within the C. michiganensis subsp. michiganensis chromosomal chp/tom region. All 10 homologues have an N-terminal signal sequence, but only 7 appear to be intact and 3 of the chromosomal genes contain frameshift mutations.
The C. michiganensis subsp. michiganensis tomA subregion of the chp/tom region contains a gene, tomA (CMM0090), which encodes an exported endo-1,4-beta-glycosidase involved in the detoxification of the saponin
-tomatine, a plant defense and antimicrobial compound produced by tomato and other members of the Solanaceae (38). TomA is a member of a family of glycoside hydrolases which match the Pfam domain model PF00331. Bacterial proteins matching this domain model generally have a single domain and an N-terminal signal sequence, and characterized examples are involved with plant-specific glycans, primarily xylan (2, 33). They are relatively rare, and the encoding genes tend to occur only once in environmental bacteria likely to be associated with plants or algae. The C. michiganensis subsp. sepedonicus genome includes one CDS (CMS0087) with a match to PF00331. Although the similarity between CMS0087 and tomA from C. michiganensis subsp. michiganensis is low (23.9% identity and 41.8% similarity over 201 residues), they are both the only members of the PF00331 group in their respective genomes, and dot plot alignment showed that they are clearly related (Fig. 5). This suggests that they may have analogous, if not orthologous, functions, and CMS0087 may be involved in degradation of potato-produced glycoalkaloids present during infection with C. michiganensis subsp. sepedonicus (65). However, CMS0087 is likely to be disabled, and its genomic status is notable. It lies directly upstream of an IS element (CMS0086) which appears to have truncated the 3' end of the gene. The 5' region may also have been lost, as it does not encode the N-terminal signal sequence present in similar proteins (34). CMS0086 and CMS0087 lie within an exopolysaccharide biosynthesis gene cluster (EPS2), and alignment with the C. michiganensis subsp. michiganensis genome showed that the current genomic arrangement is likely to be the result of recombination between IS elements. The potential loss of the ability to degrade glycoalkaloids in C. michiganensis subsp. sepedonicus may indicate that either this organism does not encounter such growth inhibitors in its current niche or it has adapted to a slow-growth lifestyle to avoid such plant defense mechanisms.
![]() View larger version (15K): [in a new window] |
FIG. 5. Dot plot amino acid alignment of the CMS0087 with CMM0090 (TomA) proteins. The axes show the amino acid residue, and the dots and lines indicate amino acid identity between the protein sequences. The plot was generated using Dotter (71). The Karlin/Altschul statistics for the sequences and score matrix are as follows: K = 0.129; Lambda = 0.301; expected MSP score in a 100 x 100 matrix, 23.817; expected residue score in MSP, 1.186; expected MSP length, 20.
|
There are also several genes with the potential to encode resistance to antibiotics, such as CMS0149 (encoding a putative aminoglycoside phosphotransferase), CMS0172 (encoding a putative VanZ-like membrane protein) (3), CMS0216 (encoding a putative cytidine deaminase with 55% amino acid identity over the full length to blasticidin S deaminase from Aspergillus terreus) (39), CMS0694 (encoding a putative macrolide resistance protein), CMS0862 (encoding a putative multidrug efflux protein), CMS961 (encoding a putative drug efflux protein), CMS1440 (encoding a putative toxin resistance acetyltransferase), CMS1893 (encoding a putative macrolide phosphotransferase), CMS2286 (encoding a putative resistance protein), CMS2306 (encoding a putative dimethyladenosine transferase), CMS2483 (encoding a putative drug efflux protein), CMS2903 (encoding a putative drug resistance dioxygenase), CMS2936 (encoding a putative multiple-antimicrobial-agent extrusion protein), and CMS3023 and CMS3049 (encoding a putative beta-lactamase). C. michiganensis subsp. sepedonicus growth in culture is often inhibited by the presence of other microbes in plant samples, making disease diagnosis by pathogen cultivation especially challenging and necessitating other, less-culture-dependent approaches (69, 76, 77). Thus, finding several antibiotic resistance genes was unexpected.
Exopolysaccharide production. The C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis chromosomes each contain four gene clusters for Wzx/Wzy-dependent biosynthesis of exported polysaccharide (designated EPS1 to EPS4 according to their order in C. michiganensis subsp. michiganensis [see Table S7 in the supplemental material]). Such gene clusters generally encode glycosyl transferases necessary for linking of sugars to form the oligosaccharide repeat unit, a Wzx flippase required for transport of the repeat unit across the cytoplasmic membrane, and a Wzy polymerase responsible for linking repeat units to form the polysaccharide. The gene clusters are broadly syntenic, and there are some notable differences between the subspecies. All four EPS clusters in C. michiganensis subsp. michiganensis appear to be intact and are likely to be functional. In C. michiganensis subsp. sepedonicus three of the four clusters appear to have been disrupted, and at least two are likely to have been inactivated.
The C. michiganensis subsp. sepedonicus EPS1 repeat unit polymerase gene (wzy1 [CMS2263]) has been interrupted, and probably inactivated, by insertion of an IS1121 element. The divergence in Wzy sequences suggests that there are differences in substrate specificity, making it unlikely that the polymerase encoded by one of the other three EPS clusters could compensate. The mutation in wzy1 is likely to ablate the function of the entire gene cluster due to the central role played by Wzy. Also, EPS2 in C. michiganensis subsp. sepedonicus has been grossly disrupted by recombination between IS elements. The central portion of the gene cluster, including genes encoding two glycosyl transferases (CMS2390 and CMS2391), Wzx flippase (CMS2389), and a candidate Wzy polymerase (CMS2400), has been translocated to a distant region of the genome flanked by a pair of IS1121 elements and replaced by another IS1121 and a CDS (CMS0087) encoding a putative glycoside hydrolase. This rearrangement is likely to have disrupted the regulation of the gene cluster, probably rendering it nonfunctional. Further disruption of the C. michiganensis subsp. sepedonicus EPS2 gene cluster is evident in CMS0084, which has a nonsense mutation at codon 871 and deletions in the 3' region compared to the equivalent CDS in C. michiganensis subsp. michiganensis.
EPS3 and EPS4 seem to be largely intact and possibly functional in C. michiganensis subsp. sepedonicus, although an IS1121 element in the 3' end of the galE gene in EPS3 may disrupt the function of the protein product and may have polar effects on the expression of the rest of the gene cluster. Comparison of the EPS4 clusters of C. michiganensis subsp. sepedonicus and C. michiganensis subsp. michiganensis showed that although they are clearly related, they have different complements of genes in the central region, including the genes for Wzx flippase, Wzy polymerase, and alternative glycosyl transferases, indicating that they are likely to produce grossly different polysaccharides. Altogether, it appears that C. michiganensis subsp. sepedonicus has lost at least half of its ability to produce EPS due to genome degradation. Such polysaccharides are located at the cell surface, and their involvement in interactions with the environment and particularly host organisms has been well documented. This correlates well with the notion that C. michiganensis subsp. sepedonicus recently adapted to a narrowed niche where such interactions are less variable. Additional genetic studies are needed to demonstrate the specific gene set required for EPS biosynthesis and to reconcile reports on sugar composition and the contribution of EPS to virulence in C. michiganensis subsp. sepedonicus (37, 82).
Conclusions. While other members of the species C. michiganensis can grow in a variety of environmental and plant-associated niches, C. michiganensis subsp. sepedonicus is almost entirely restricted to the vascular system of its host plant. Analysis of the C. michiganensis subsp. sepedonicus genome revealed numerous correlations with this endophytic lifestyle and suggested that that there was recent specialization for life in this restricted niche and that the organism has a reduced ability to exploit previously occupied complex niches outside the plant.
Tolerance to generation of pseudogenes, expansion of IS elements, genome rearrangements, and the associated disruption of operons suggest that there was relaxation of selective pressure or an increase in fixation of mutations due to genetic drift during the recent evolutionary history of C. michiganensis subsp. sepedonicus. These events may have been due to passage through an evolutionary bottleneck associated with niche acquisition and adaptation in C. michiganensis subsp. sepedonicus (62, 63). The bottleneck may have involved a change from a plant-associated generalist lifestyle, where the organism thrived in soil and on various plant surfaces and was able to opportunistically exploit plant wounds, to a lifestyle where the bacterium could live successfully within the plant vascular system without a need for movement outside that niche. This change may have been triggered by loss or acquisition of a function leading to enhanced success within the newly acquired niche. One possible trigger is the observed disruption of C. michiganensis subsp. sepedonicus's ability to produce surface polysaccharides. EPS is considered a virulence determinant in many bacterial plant pathogens because of its central role in wilt induction, host colonization, and biofilm formation, and it is thus likely to be important in physical interactions with the host (1, 42). It is feasible that alteration of C. michiganensis subsp. sepedonicus surface polysaccharides or some other genetic change altered the host interaction such that reducing the host response to invasion by the pathogen enhanced survival within the vascular system. In this regard, it is interesting that C. michiganensis subsp. sepedonicus appears to have lost the ability to produce a plant defense detoxification enzyme.
Adaptation to the narrower niche of the vascular system would have allowed disruption of genes whose products are no longer required. Bacteria on leaf surfaces must adapt to an ever-changing set of environmental signals by modulating their own gene expression (49). The abundance of pseudogenes in regulatory genes suggests that C. michiganensis subsp. sepedonicus lost some of its capacity to adapt to such harsh or varied environments. Pseudogene functions indicate that there was a reduction in nutrient diversity, allowing loss of catabolic enzymes and nutrient transporters. This generally reduced constraint on genome disruption would have also allowed the multiplication of IS elements and associated chromosomal rearrangements. The fact that pseudogenes have not been removed from the genome suggests that the bottleneck and subsequent events were relatively recent. This suggestion is supported by the intact status of the IS elements and the fact that the genome size is not reduced.
This project was supported by Initiative for Future Agriculture and Food Systems grant 2001-52100-11428 from the USDA Cooperative State Research, Education, and Extension Service, the Colorado Experiment Station, and the Minnesota Experiment Station.
Published ahead of print on 11 January 2008. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»