Previous Article | Next Article ![]()
Journal of Bacteriology, February 2009, p. 1180-1190, Vol. 191, No. 4
0021-9193/09/$08.00+0 doi:10.1128/JB.01058-08
Copyright © 2009, American Society for Microbiology. All Rights Reserved.

Department of Microbiology and Infection Control Science, Juntendo University, 2-1-1 Hongo, Bunkyo, Tokyo 113-8421, Japan,1 National Institute for Basic Biology, National Institutes of Natural Sciences, Nishigonaka 38, Myodaiji, Okazaki 444-8585, Japan,2 Wellcome Trust Sanger Institute, Hinxton, Cambridge CB10 1SA, United Kingdom3
Received 30 July 2008/ Accepted 3 December 2008
|
|
|---|
|
|
|---|
M. caseolyticus, previously classified as Staphylococcus caseolyticus (34), was reportedly isolated from cow's milk, bovine organs and food-processing factories. Phylogenetic relationship analysis based on 16S rRNA sequences revealed that, in addition to Staphylococcus, Bacillus species are also closely related to M. caseolyticus. The morphology of this organism is globular; however, the cell size is larger than for staphylococci.
Since the genomes of macrococci are much smaller than those of staphylococci (34), we considered that knowledge of the macrococcal genome would be important in elucidating the evolution of staphylococci along with its acquisition of pathogenic potential to humans. We describe here a unique metabolic feature of M. caseolyticus as inscribed in its genome and the discovery of a primordial form of a mecA gene complex, the causative genetic determinant of the notorious hospital pathogen methicillin-resistant S. aureus (MRSA).
|
|
|---|
Shotgun sequencing and contig assembly.
The whole-genome sequencing was performed basically as described previously (1, 2, 40) with slight modifications. Briefly, total DNA was fragmented to 2 and 6 kb in length to be cloned into pUC118 plasmid vector for the construction of a genomic library, followed by sequencing. Gaps unable to be sequenced were closed with primer walking by reading 30 to 40 kb of the total DNA library cloned into a fosmid vector pCC1FOS (Epicentre Biotechnologies, Madison, WI). The sequencing was carried out by Hitachi High-Tech Fielding Co. (Tokyo, Japan) and Takara Bio, Inc. (Otsu, Japan). The total length of the shotgun sequence read was
13 Mbp, or the redundancy was 6 since the genome size was found to be 2.1 Mbp. Whole-genome sequence was assembled from each shotgun sequence read by using Arachne 2 (15). The accuracy of the assembly was confirmed by a physical mapping using visualization of restriction digestion pattern of the chromosomal DNA (OpGen, Madison, WI). We have deposited the whole-genome sequence of M. caseolyticus strain JCSC5402 to the DNA Database of Japan, with accession numbers AP009484 (chromosome) and AP009485 through AP009492 (plasmids pMCCL1 to 8), respectively.
Determination of ORFs, structural RNA, insertion sequences, phages, and plasmids.
Determination of open reading frames (ORFs), structural RNAs, and annotations were performed as described previously (2). Briefly, ORF candidates were initially extracted with the genome analysis-oriented In Silico molecular cloning software (v1.5; Insilico Biology, Yokohama, Japan), followed by individual inspection of each ORF candidate by manual searching for a start codon on the basis of ribosome-binding motifs, and then the ORFs coding for proteins were determined. tRNA and tmRNA genes were identified by using tRNAscan-SE (24) and by a web-based software (http://www.indiana.edu/
tmrna/), respectively. An illustration of the GC contents on a genome map and the G-C skew [(G – C)/(G + C)] were drawn by using In Silico molecular cloning software.
Insertion sequences were also found by the web-based software IS Finder (http://www-is.biotoul.fr/is.html). Insertion of prophage sequences was identified by the presence of typical phage gene structures, including genes for phage integrases, terminases, capsid, tail, and phage-related amidases (4). Plasmids were recognized since the sequence redundancy increased more rapidly than other regions upon shotgun sequencing, and their sequences were circularized with relatively small numbers of the shotgun reads upon assembly. Plasmids were also recognizable by sequences with genes for plasmid replication proteins.
Cloning of mecAm gene from M. caseolyticus JCSC5402 plasmid pMCCL2. Whole genome was isolated from M. caseolyticus JCSC5402 as described previously (2). By using this as a template, PCR was performed with the primers 5'-CCCGGATCCAGTACAATAACCCAAGGAA-3' and 5'-CCCGGATCCGGGAACAATAGTTCCTCATT-3', which allow amplification of mecAm structural gene and its promoter region. After digesting the PCR product with a restriction endonuclease BamHI, the Macrococcus mecA (mecAm)-containing fragment was cloned into the corresponding site of E. coli-S. aureus shuttle vector pYT3 (20), yielding pYMMA.
Elimination of pMCCL2 from M. caseolyticus JCSC5402. A 10–6 dilution of an overnight culture of M. caseolyticus JCSC5402 in antibiotic-free tryptic soy broth medium was grown overnight again and repeated this for six times. A 0.1-ml portion of the final 10–6-diluted culture was then plated onto drug-free tryptic soy agar. A few hundred of the grown colonies were replica plated onto both drug-free and oxacillin-containing (10 µg/ml) tryptic soy agar plates, followed by the isolation of oxacillin-susceptible colonies. A β-lactamase test using nitrocephin was performed, and negative colonies were tested to determine whether they possessed pMCCL2 by treatment of the cells with S1 nuclease followed by analysis by pulsed-field gel electrophoresis (5) compared to native JCSC5402. Loss of pMCCL2 was also confirmed by pulsed-field gel electrophoresis-based Southern hybridization using the PCR-amplified mecAm gene fragment as a probe.
Genomes used for comparative analysis and chromosome structure. Genome sequences used for comparative analysis and chromosomal structure analysis are listed in Table 1.
|
View this table: [in a new window] |
TABLE 1. Chromosomal sequences of various species used in this study
|
Chromosomal structure analysis, phylogenetic analysis, and the tree diagram display. Orthologous genes among various species were aligned by using a clustering algorithm named DomClust (42, 43), which is based on hierarchical clustering procedure for constructing orthologous groups at a domain level, and then constructed an alignment of syntenically conserved regions designated "core structure," using the CoreAligner program (44). This procedure allows identifying the order of orthologous groups that retains the conserved gene orders to the greatest possible extent in each genome. Among the core genes extracted by the CoreAligner, those that were conserved in all of the tested genomes in one-to-one correspondence were extracted (297 genes). Multiple sequence alignments were generated by CLUSTAL W program (41). From these alignments, the conserved aligned blocks suitable for phylogenetic analysis were then extracted by Gblocks program (7). The resulting sequences were concatenated and were subjected to the CLUSTAL W program to construct a phylogenetic tree using the neighbor-joining method (33). The phylogenetic tree was drawn using the FigTree program (http://tree.bio.ed.ac.uk/software/figtree/).
|
|
|---|
|
View this table: [in a new window] |
TABLE 2. Comparison of the M. caseolyticus chromosome to those of S. aureus and B. subtilis
|
![]() View larger version (48K): [in a new window] |
FIG. 1. Circular representation of the chromosome of M. caseolyticus JCSC5402. The first (outermost) circle shows the nucleotide position in base pairs. Red arcs indicate prophages, whereas yellow shows the insertion sequences. The second circle shows ORFs oriented in the forward direction, whereas the third circle indicates those oriented in the reverse direction. Green bars in the second and the third circles represent unique ORFs of M. caseolyticus, whereas blue bars show the ORFs that are the most similar to staphylococcal genes. Orange bars indicate the first-hit Bacillus ORFs, and red bars indicate those whose homologs are not found in staphylococcal genomes and yet first-hit Bacillus ORFs. Black bars indicate ORFs whose most homologous genes were not found in either Bacillus or Staphylococcus. The fourth and fifth circles show genes for transfer RNAs and rRNAs, respectively. The sixth circle represents the regions with high GC content ( 50%; purple) and low GC content (blue). The window size and shift increment for GC content were 5 kbp and 0.1 kb, respectively. The seventh circle shows the G-C skew with positive (yellow) and negative (blue) values.
|
MCCL1 to -3) were identified in the chromosome of strain JCSC5402. Of 113 ORFs carried by the phages, 33 are unique to M. caseolyticus, whereas 45 are similar to staphylococcal phage proteins, 23 are similar to those of Bacillus spp., and 12 are similar to those of other species such as Enterococcus, Streptococcus, Listeria, Lactococcus, and Mycoplasma spp. This may mean that the prophages of M. caseolyticus are mobile across great ranges of gram-positive bacterial genera. Three insertion sequences (ISs) were found in the JCSC5402 genome. The number is much smaller than that in S. haemolyticus strain JCSC1435, in which the insertion sequences cause frequent genome rearrangements (40). On the other hand, the strain JCSC5402 harbors as many as eight plasmids (pMCCL1 to -8) that range in size from 2 to 80 kb. Each of the plasmids carries a unique replication unit; therefore, the replications of the plasmids seem to be compatible with one another. None of the plasmids except pMCCL2 harbor known antibiotic resistance genes. Some genes were found to be involved in DNA modification and plasmid mobilization; however, most of other genes remain functionally unknown. Therefore, how they are maintained in the cells and what benefit each plasmid confers to the strain are unknown. Genes for replication proteins (rep) of the eight plasmids are most homologous to those of S. aureus (pMCCL3, -4, and -5), Bacillus thuringiensis (pMCCL1 and -2), Pediococcus acidilactici (pMCCL6), Tetragenococcus halophilus (pMCCL7), or Enterococcus faecalis (pMCCL8). The taxonomic distribution of the most similar genes for the 98 ORFs in the eight plasmids is as follows: 31 ORFs, Staphylococcaceae; 12 ORFs, Enterococcaceae; 11 ORFs, Bacillaceae; 3 ORFs, Lactobacillaceae; 3 ORFs, Listeriaceae; 4 ORFs, gram-positive bacterial families; 3 ORFs, mycoplasmas; and 3 ORFs, gram-negative bacterial families, in addition to 28 unique genes to M. caseolyticus strain JCSC5402. This suggests that the macrococcal plasmids are also omnipresent mobile elements in large varieties of bacteria belonging to the phylum Firmicutes (gram-positive bacteria) and frequently recombine with other plasmids.
Figure 2 illustrates similarity plot data of the chromosome of M. caseolyticus strain JCSC5402 with that of S. aureus strain MW2 (top panel) or B. subtilis strain 168 (bottom panel). Dots are rarely found in the MW2 chromosomal regions of ca. 0 to 500 kb and ca. 2,250 to 2,830 kb (Fig. 2, top panel), indicating that the regions contain genes whose orthologs are absent from the chromosome region of JCSC5402. The region corresponds to the "oriC environ" unique in staphylococcal species. The oriC environ contains many exogenous genes acquired across species barrier and is postulated to contribute to the evolution of staphylococcal species (40). The similarities to the macrococcal chromosome are distributed in the other regions of MW2 (Fig. 2, top panel). There is an inversion in the 2,000-kb region in the M. caseolyticus genome that corresponds to approximately 500 to 600 kb in the S. aureus MW2 genome. Since this region includes ribosomal and tRNA genes at the both ends, an inversion may take place due to a homologous recombination. Figure 2 (bottom panel) shows that the similarity with B. subtilis chromosome is much smaller than with S. aureus chromosome, but the homologous sequences are diffusely distributed throughout the B. subtilis genome. These data reconfirm that the genes in the M. caseolyticus chromosome are more similar to those in S. aureus chromosomes than to those in B. subtilis chromosomes.
![]() View larger version (8K): [in a new window] |
FIG. 2. Genome alignment between M. caseolyticus JCSC5402 and S. aureus MW2 (top) or between M. caseolyticus JCSC5402 and B. subtilis 168 (bottom). Red dots indicate homologous regions that were drawn by using In Silico molecular cloning software. The top panel includes genomic island locations and their names in either purple (unique to M. caseolyticus JCSC5402, vertical scripts) or sky blue (unique to S. aureus, horizontal scripts). SCCmec is a determinant for methicillin resistance (17). The Sa and Saβ are pathogenicity islands present in all of the S. aureus genomes thus far sequenced, whereas the Sa3 element is found in some S. aureus strains, including MW2 (2, 3). Sa elements represent prophages.
|
|
View this table: [in a new window] |
TABLE 3. Functional classification of ORFs in the M. caseolyticus JCSC5402 chromosome and taxonomic distribution of BLAST best-hit entries to the ORFs
|
The starch-digesting amylase gene of Bacillus species is also present in M. caseolyticus but is absent from staphylococcal genomes. The reason why M. caseolyticus possesses amylase gene is unknown. Interestingly, we also identified glycogen biosynthesis genes in M. caseolyticus, suggesting that the organism could store glucose as its polymerized form. The ability of M. caseolyticus to polymerize glucose and digest starch implies its efficient utilization of glucose in an environment where glucose shortage is a great concern. Both amylase and glycogen biosynthesis genes are the most similar to those of Bacillus species.
Staphylococcal pathways absent in M. caseolyticus. Reflecting its shorter genome size, M. caseolyticus lacks many biochemical pathways that are present in staphylococci. We failed to find genes similar to siderophore biosynthesis genes in M. caseolyticus JCSC5402 genome. The organism also lacks most of the known iron uptake transporters found in staphylococci. These facts indicate that M. caseolyticus colonizes in the environments where the organism does not have to be "aggressive" upon iron import. Such environments should also help M. caseolyticus to live in the absence of many vitamins and amino acid biosynthesis enzymes. Strain JCSC5402 lacks the full sets of genes required for panthothenate and biotin synthesis. It also lacks glutamate, tryptophan, leucine, and histidine biosynthetic pathways, indicating that the strain requires these vitamins and amino acids for its growth. Sugar utilization ability is also limited: the strain lacks complete sets of sugar metabolism pathways for trehalose, maltose, mannitol, and lactose.
We also found that M. caseolyticus lacks all virulence determinants identified in S. aureus (3) except for one gene showing 54% amino acid sequence identity to hemolysin A of Bacillus cereus, explaining the fact that this organism has not been thus far recognized as a human pathogen.
Regulatory systems. Recent studies have revealed that most of the virulence genes in S. aureus are expressed under the control of regulatory systems known as global gene regulators. With regard to such systems, no homologs for agr (28) or sarA (8) regulator genes are present in M. caseolyticus, which is consistent with the fact that the organism does not have toxin genes whose expression is known to be under regulation of agr and sarA. Therefore, it is likely that S. aureus virulence genes were acquired together with their regulatory systems after the divergence of the genera Staphylococcus and Macrococcus.
Two-component systems. Eleven sets of two-component systems were identified in the M. caseolyticus chromosome. Ten of them are the most similar to those in staphylococcal species that are well conserved throughout the entire sequence length. Six of the ten were found in both genera Staphylococcus and Bacillus, whereas four are not present in Bacillus spp. The other set, a sensor kinase (MCCL1698) and a response regulator (MCCL1701), hit those of Clostridium tetani and Clostridium perfringens, respectively, as the most similar orthologs.
The orthologs for the two-component system vraSR that serves as an upregulator of cell wall synthesis in S. aureus (22) were found in M. caseolyticus JCSC5402 (vraR, MCCL1614; vraS, MCCL1615). The vraSR system is widely distributed in gram-positive bacteria, suggesting that the regulator system is an important system originated early in the evolutionary history of gram-positive bacteria, although they are nonessential genes for cell viability (22). The orthologs for S. aureus two-component systems, phoPR (MCCL1358-9), nreBC (MCCL0141-2), lysSR (MCCL0109-10), srrAB (MCCL1137-8), and arlSR (MCCL1065-6) that are involved in alkaline phosphatase synthesis, nitrate reduction system, control of the cell wall lysis rate, respiratory response, and sugar transport, respectively, are also present in M. caseolyticus. The physiological function(s) of other four two-component systems are unknown.
A recent study has revealed that a two-component system graSR of S. aureus is involved in the vancomycin resistance mechanism in vancomycin-intermediate S. aureus (27). The graSR system was not found in the M. caseolyticus genome, supporting the notion that the system is dispensable for the viability in S. aureus (27).
Antibiotic resistance. We initiated the present study aiming to pursue the dissemination of drug-resistant microbes through the stock farm products. Quite a few beta-lactam resistant bacteria were obtained (K. Kuwahara-Arai et al., unpublished data), in which M. caseolyticus strain JCSC5402 was unique in its multiple-drug resistance profile. Table 4 shows that strain JCSC5402 harbors various antibiotic resistance determinants including beta-lactams, aminoglycosides, and macrolides.
|
View this table: [in a new window] |
TABLE 4. MICs of various antibiotics for S. aureus N315 and N315/pYMMA (mecAm) and M. caseolyticus JCSC5402 and JCSC5402/ pMCCL2
|
pMCCL2), a finding consistent with the fact that the plasmid pMCCL2 also carries resistance genes to aminoglycosides (aacA) and macrolides (ermB). The results suggest that the plasmid pMCCL2 is responsible for acquisition of resistance to those antibiotics, although coelimination of other plasmids was not completely assessed, and therefore it is still possible that the other plasmids harbor uncharacterized antibiotic resistance genes. Nevertheless, it is highly likely that pMCCL2 is the only plasmid that confers antibiotic resistance to the macrococcal cells because we found no resistant genes in other plasmids.
![]() View larger version (25K): [in a new window] |
FIG. 3. Tandem location of beta-lactam resistant genes mecA (designated mecAm) and blaZ in M. caseolyticus JCSC5402/pMCCL2 compared to mec and bla complexes in S. aureus N315. The numbers in the purple shaded areas represent mutual amino acid sequence identities.
|
Relatedness of chromosomal structure among species and their phylogenic relationship. We identified orthologous genes and their arrangements in the chromosomes of M. caseolyticus and related bacteria to infer the evolutional relationship among these bacterial species. We initially identified orthologs among various species by using a clustering algorithm DomClust (42, 43), which is based on hierarchical clustering procedure for constructing orthologous groups at a domain level, and then constructed an alignment of the syntenically conserved regions, or "core structure," by using the CoreAligner program (44), which finds the order of orthologous groups that retains the conserved gene orders to the greatest possible extent in each genome. (Fig. 4). In Fig. 4, colored lines across species indicate locations of the genes in the resulting core structures, and the color gradation (from red through yellow to green) shows the location of each set of orthologs on the M. caseolyticus chromosome. This analysis can identify the conserved chromosomal structures among M. caseolyticus and the genera Staphylococcus and Bacillus, structures that are likely to have been vertically inherited from their common ancestor. Although several large-scale chromosome inversions are visible with certain species, it is apparent that the latter genera have extra chromosomal regions studded among the core structures where the genetic information useful for their own habitats and biological activity are stored.
![]() View larger version (60K): [in a new window] |
FIG. 4. Distribution of conserved ortholog groups among Bacillaceae and Staphylococcaceae families and Listeria innocua. After identifying common orthologs among various species, we constructed the conserved chromosomal structure ("core structure") on the basis of the consensus arrangement of the conserved orthologs. An ortholog group in the resulting core structure is indicated as a colored line across horizontal black line representing a chromosome. To simplify the figure, only universally conserved, one-to-one correspondence ortholog groups are shown. To visualize chromosomal rearrangement of the core structure, color gradation is assigned according to the location on the M. caseolyticus JCSC5402 chromosome from red to yellow to green. The replication origins (oriC) are located at the center.
|
![]() View larger version (25K): [in a new window] |
FIG. 5. Phylogenetic relationships among gram-positive bacteria. The tree diagram was generated by using the concatenated protein sequence of 297 orthologous core genes that are conserved in all of the tested genomes in one-to-one correspondence. These genes are shown as colored lines in Fig. 4.
|
|
|
|---|
The shorter chromosomes of M. caseolyticus compared to those of staphylococcal species are rather similar to those of streptococci, which are known to lack many genes for the biosynthesis of small molecules, and therefore require many nutrients. It is, however, unknown which M. caseolyticus genes compensate for the shortage of such biosynthetic pathways, since the substrates of many of the genes regarded as transporters due to their possession of the motif sequences are unknown. For example, only one gene was identified as a ferrichrome transporter in the M. caseolyticus genome in contrast to S. aureus, in which more than ten genes are involved in iron transport. It is therefore still unknown how M. caseolyticus utilizes iron efficiently based on analysis of the whole-genome sequence. Nevertheless, it may be that the genome became shortened after the divergence of the genera Bacillus and Macrococcus, presumably because Macrococcus adapted to a nutrient-rich habitat.
The phylogenicity of the 297 core genes and our chromosome-clustering algorithm showed a close evolutionary relatedness of strain JCSC5402 with Staphylococcus and Bacillus spp., despite the great differences in genome size and morphology. The small chromosome of the strain JCSC5402 may reflect the essential ancestral chromosome before its divergence into the three species. This view is consistent with the result of the chromosome clustering analysis (Fig. 4), which clearly illustrates the well-conserved distribution of orthologous gene clusters across the chromosomes representing the three separate genera.
By assuming this view, it seems that the ancestral bacterium for family Staphylococcaceae must have downsized its chromosome by losing the genes that were not essential in the new habitat, or animal body, after the divergence from the family Bacillaceae. The large genomes of the Bacillus species, as being represented by 4.2 Mb with B. subtilis or 5.2 Mb with B. anthracis, harbor the genes encoding such features as rod-shaped cell morphology, spore formation, the production of insect toxins, peptide antibiotics, and antifungals. Such genes should have been unnecessary in the new environment to which the ancestor for macrococcal and staphylococcal species was adapted. Following parasitization of an animal body, the speciation into macrococcal and staphylococcal species would have started. Although Macrococcus remained avirulent to the animal, the speciation into Staphylococcus, especially S. aureus, was an aggressive one accompanied by the increase of genome size with accumulation of arrays of virulence genes against the animal host. The oriC environ is considered to have contributed much to this increase of genome size. In the case of S. aureus, the oriC environ contains genes encoding capsular polysaccharide biosynthesis (25), the restriction-modification system (13), immunoglobulin-binding protein A, tissue-binding proteins, and some superantigens (31). A notorious methicillin resistance determinant is also one of the genes transferred by SCC into the oriC environ (17). Thus, it is tempting to speculate that the differentiation of staphylococcal species was greatly promoted by the development of the SCC-mediated gene transfer system in an early phase of the speciation of genus Staphylococcus.
We found that strain JCSC5402 possesses various genes associated with drug resistance. This reconfirms the presence of active selective pressure by chemical agents in the animal husbandry. Figure 3 illustrates the most remarkable finding in the JCSC5402 genome: the mecA-carrying plasmid with a unique cluster of genes, i.e., mecIm-mecR1m-mecAm-blaZm. This may be the ancestral form of the methicillin-resistance determinant of the intractable hospital pathogen MRSA. An intensive study on the mobility and detailed nature of the mecAm gene complex is currently under way.
Published ahead of print on 12 December 2008. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»