Previous Article | Next Article ![]()
Journal of Bacteriology, June 2005, p. 3698-3707, Vol. 187, No. 11
0021-9193/05/$08.00+0 doi:10.1128/JB.187.11.3698-3707.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Infection Control Science, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, Japan,1 Department of Bacteriology, Juntendo University, 2-1-1 Hongo, Bunkyo-ku, Tokyo, Japan,2 Tokyo Metropolitan Institute of Public Health, 3-24-1, Hyakunin-cho, Shinjuku-ku, Tokyo, Japan3
Received 2 December 2004/ Accepted 1 March 2005
|
|
|---|
|
|
|---|
Staphylocoagulase has been studied in two ways. One approach was done by defining the structure and function via purification and examination of the characteristics (14, 19, 24, 25). It was reported that the protein binds to prothrombin with a 1:1 stoichiometry to form staphylothrombin, and the complex induces plasma coagulation by converting fibrinogen into fibrin. This would activate prothrombin without the usual proteolytic cleavages (14, 25). Crystal structure analysis of the first 325 amino acids of staphylocoagulase variants bound to
-thrombin or its precursor prethrombin 2 showed that staphylocoagulase inserted its N-terminal peptide into the activation pocket of prethrombin 2 and changed it allosterically to the active form (12). Kinetic studies also confirmed the importance of its N-terminal two amino acids in mature staphylocoagulase (12).
The second approach examines the diversity of staphylocoagulase in antigenicity as well as the gene, both of which are used as epidemiological markers (13, 17, 45). To date, staphylocoagulase has been classified into 10 serotypes, based on the differences in antigenicity via inhibition test of the clotting activity using specific antibodies against each serotype (23, 45).
The structures of three staphylocoagulase serotypes, namely type I, II, and III, have been reported, based on their nucleotide sequences (21, 22, 38). Comparison of the three deduced amino acid sequences and crystal structural analysis indicated that staphylocoagulase is composed of six distinct segments: (i) 26-amino-acid signal sequence, (ii) N-terminal D1 region, (iii) D2 region, (iv) highly conserved central region, (v) 27-amino-acid repeat region, and (vi) 5-amino-acid C-terminal sequence. The D1 and D2 regions consist of
-helices. Through these two domains, staphylocoagulase contacts prothrombin with its two surface sites, the 148 loop and exosite I. The N-terminal amino acids 1 to 270 that are involved in the D1 and D2 regions of the three serotypes are less homologous, with identity of nearly 50%. The repeat regions comprising 27-amino-acid tandem repeats are highly polymorphic in the number of repeats. Although the serotypes do not correlate with the polymorphism of 27-amino-acid tandem repeats (23), both the N-terminal region and the C-terminal region have been used for S. aureus differentiation by analyzing their diversity using amplified DNA fragments and subsequent restriction fragment length polymorphism analysis.
It is very curious that such a large number of serotypes have been found in a singular protein produced by a species. In this study, in order to understand the molecular basis giving rise to these structural variations, we determined the nucleotide sequences of seven coa genes and compared both nucleotide and deduced amino acid sequences of 10 serotypes of staphylocoagulase. Furthermore, aiming to understand the molecular evolution of the serotypes, we sequenced the upstream and downstream regions right after the coa, compared the coa flanking regions of each serotype, and found that they were highly homologous. The reasons a lot of staphylocoagulase serotypes are found in an S. aureus species are discussed.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Characteristics of S. aureus strains in this study
|
DNA preparation. Chromosomal DNA was extracted using Isoprant (Nippon Gene Co., Ltd., Tokyo, Japan) according to protocols recommended by the manufacturer with a slight modification. Briefly, cells were cultured in BHI broth at 37°C overnight. A 1.5-ml aliquot of the culture was poured into a tube, and cells were collected by centrifugation. The pellet was resuspended with 400 µl of SMM buffer (0.5 M sucrose, 0.1 M disodium maleate, 0.002 M MgCl2; pH 6.5), 20 µl of 2 mg/ml lysostaphin (Wako Pure Chemical Industries, Ltd., Osaka, Japan) was added, and the mixture was left at 37°C for more than 20 min until protoplasts were formed. Cells were collected by centrifugation, and their chromosomal DNA was extracted as recommended by the manufacturer.
DNA sequencing. coa genes and their flanking regions were amplified by PCR using chromosomal DNAs as template. DNA fragments were amplified using primer sets designed based on the S. aureus N315 genome sequence: coa-F (5'-CACAGATGACCCAGCATTAGCTG-3') and coa-R (5'-CGTTACGGCATGGTTACGATGTG-3'); coaup-F (5'-CTTGCATATGAATATGAGTC-3') and coaup-R (5'-AAGCTAGATGCAACTGCTAATG-3'); and coadown-F (5'-GACATATGGTCCTAGAGTAACA-3') and coadown-R (5'-AACTGGTAAGCCATTACGTA-3'). The reaction mixtures consisted of 100-µl total volume solutions containing 10 ng of genomic DNA, 10 pmol of each primer, 0.4 U of Ex Taq DNA polymerase (Takara Bio Inc., Shiga, Japan), 10 µl of 10 Ex Taq buffer, and 20 mM of each deoxynucleoside triphosphate. Amplification was performed with a PCR system 9700 (Applied Biosystems), and the parameters were an initial 1 min at 94°C and 30 cycles of 30 s at 94°C, 1 min at 55°C, and 1 min at 72°C. The PCR products were purified using a High Pure PCR product purification kit (Roche Molecular Systems Inc.). Sequencing reactions were performed with fluorescent dideoxy chain termination chemistry using the BigDye Terminator version 1.1 cycle sequencing kit (Applied Biosystems). DNA sequencing was performed using an ABI Prism 3100 genetic analyzer (Applied Biosystems).
MLST. Multilocus sequence typing (MLST) was preformed as described previously (11). Briefly, the inner regions of seven housekeeping genes (arcC, aroE, glp, gmk, pta, tpi, and yqiL) in S. aureus were amplified by PCR, and their nucleotide sequences were determined using an ABI Prism 3100 genetic analyzer. Strains were defined by the alleles present at the seven loci (the allelic profile), and each unique allelic profile was assigned a sequence type (ST) based on the MLST website (http://www.mlst.net). STs were then correlated to each other by using the program BURST (based upon related sequence types) in the MLST website.
Genetic analysis. Multiple alignments were performed with CLUSTAL X software program (version 1.8) (43). The alignment was adjusted manually for any obvious misplacement of gaps on the basis of physiochemical properties of amino acids. The phylogenetic tree was created using the maximum likelihood method in the GCG paupsearch program in the Wisconsin GCG package (Accelrys Inc.) Support for individual nodes on the tree was determined by bootstrap analysis. The phylogenetic trees were visualized with Treeview (34).
Nucleotide sequence accession numbers. The sequences of the coa genes and their flanking regions of S. aureus strains 104, Stp-28, no. 55, Stp-12, Ku, 17573, and 19 have been deposited in the DDBJ, EMBL, and GenBank databases under accession numbers AB158549, AB158550, AB158551, AB158552, AB158553, AB158554, and AB158555, respectively.
|
|
|---|
Table 2 shows the characteristics of coa in the 10 serotypes. The lengths of these genes are rather diverse. They ranged from the shortest size of 1,902 bp (type VII) to the longest size of 2,280 bp (type VIII), and the average length was 2,034 bp. The numbers of 81-bp tandem repeats located at the 3' terminus were diverse, too. They ranged from the smallest number of repeats, five (NCTC 8325, MW2, and 19), to the largest number of repeats, nine (Ku). The differences in nucleotide sequences of the 81-bp tandem repeats and the number of tandem repeats were reported and have been used for epidemiological markers as well (42). When we assigned alleles defined by nucleotide differences in the 81-bp repeat units that constitute the tandem repeat, 36 of 63 were new alleles (V3 to G5 in Table 2). The average GC content was 35.7%, which was slightly higher than that of the whole genome sequence of S. aureus N315 (29%) (28). Table 3 shows overall identities of the nucleotides and their deduced amino acid sequences of 10 coa genes. The nucleotide identity was 77.1% on average, and amino acid identities and similarities were 75.3% and 79.8%, respectively. The highest value of amino acid identity was 88.4% (type IV versus type IX), and the lowest was 69.5% (type V versus type VII).
|
View this table: [in a new window] |
TABLE 2. Comparison of 10 staphylocoagulase genes
|
|
View this table: [in a new window] |
TABLE 3. Nucleotide and amino acid identities among all serotypes
|
It was reported that staphylocoagulase could bind fibrinogen in addition to prothrombin (6). Cheung et al. purified a protein that binds to fibrinogen by using a fibrinogen column, and they designated it FbpA (9). However, after nucleotide sequence determination of the gene (fbpA), they found that its structure was very similar to coa, and they reported that it should belong to the coa family. When we compared the deduced amino acid sequence of FbpA to that of the 10 staphylocoagulase serotypes, we found that FbpA is nearly identical to type X staphylocoagulase, with an identity of 98.0%. The nucleotide identity between fbpA and type X coa was 98.5%.
Structural comparison of staphylocoagulases among 10 serotypes. Figure 1 shows the multiple alignments of the deduced amino acid sequences of staphylocoagulases of all 10 serotypes. We found that all deduced amino acids were composed of six regions as previously reported by Fridrich et al. (12). The six regions are as follows: (i) 26-amino-acid signal sequence, (ii) N-terminal D1 region, which contains the prothrombin-activating site and the prothrombin-binding sites, (iii) D2 region, which is required for high-affinity binding of prothrombin through pro-exosite I, (iv) highly conserved central region, (v) 27-amino-acid repeat region, and (vi) 5-amino-acid C-terminal sequence.
![]() View larger version (123K): [in a new window] |
FIG. 1. Deduced amino acid sequences of 10 serotypes of staphylocoagulase compared using the CLUSTAL X program. The corresponding nucleotide sequences were determined using DNA fragments of the following strains: 104 (type I coa), N315 (type II), NCTC 8325 (type III), Stp-28 (type IV), no. 55 (type V), Stp-12 (type VI), MW2 (type VII), Ku (type VIII), 17573 (type IX), and 19 (type X). The highly conserved residues among 10 staphylocoagulases are shown in gray to highlight them. The identical or conserved residues are marked with an asterisk, colon, or period at the top. The residues which are in contact with the 148 loop or exosite I reported by Friedrich et al. (12) are indicated as closed or open triangles, respectively. PA, prothrombin-activated domain; CT, C-terminal sequence.
|
The central regions located between the D2 region and the tandem repeats were relatively more conserved, and the averages of amino acid identity and similarity were 86.7% and 88.0%. Three strains had insertions that were not carried by other types. They were as follows: 18 amino acids (GTQGKIVGRSKYPTMEQH) in the type V staphylocoagulase of strain no. 55, 18 amino acids (TIQGVTAEGPKYPTMEQH) in the type VIII staphylocoagulase of strain Ku, and 11 amino acids (SVTLPSITGES) in the type X staphylocoagulase of strain 19.
Tandem repeats composed of 27 amino acids were located at the C terminus in all staphylocoagulases we investigated. The number of tandem repeat units varied, but the 27-amino-acid sequences were relatively conserved. The average amino acid identities among the 10 serotypes was 92.9%. The C-terminal 5-amino-acid sequences were completely identical among all serotypes.
Structural relation to vWbp. S. aureus produces another blood coagulation-causing protein, von Willebrand factor-binding protein (vWbp). Its overall structure is very similar to that of staphylocoagulase, and it is classified into the same zymogen activator and adhesion protein family as staphylocoagulase (3). The deduced amino acids of two regions, called D1 and D2 of vWbp, showed homology to D1 and D2 regions in type I staphylocoagulase, with identities of 30.7% and 28.3%, respectively. Interestingly, vWbp is allelic, too (4). Among seven S. aureus strains whose genome sequences have been published (2, 16, 18, 28), the gene (vwb) is not always intact. It was truncated in three strains, NCTC 8325 (staphylocoagulase type III), MW2 (staphylocoagulase type VII), and MRSA252-2 (staphylocoagulase type IV). When we compared vwb genes in four other strains, two alleles were found. The first allele was found in two strains, COL (staphylocoagulase type III) and MSSA476 (staphylocoagulase type VII), and the second one was found in N315 (staphylocoagulase type II) and Mu50 (staphylocoagulase type II). The vwb gene in strain Newman (3) and two truncated vwb genes in NCTC 8325 and MW2 belonged to the first allele. The amino acid identity and similarity of vWbp between the two alleles represented by S. aureus N315 and S. aureus COL were 70.9% and 76.3%, respectively. The amino acid identities between them are rather low in D1, D2, and the central region, with identities of 56.9%, 53.9%, and 59.7% and similarities of 66.2%, 58.6%, and 70.1%, respectively, whereas the identities and similarities are very high in the vWb binding site and the C-terminal region, with identities of 96.2% and 96.0% and with similarities of 96.2% and 97.6%, respectively. These regions showing low homology are similar to those of staphylocoagulase.
coa flanking regions are highly homologous. Determination of the entire genome sequences of all seven S. aureus strains showed that coa is located at approximately 1 o'clock on the chromosome. To investigate whether all coa genes are located at the same position, we determined the nucleotide sequences of coa flanking regions in strains whose genomes had not been determined. The strategy and results are summarized in Fig. 2. In the case of S. aureus N315, its coa is located between two open reading frames (ORFs), SA0221 (encoding a hypothetical protein) and SA0223 (encoding an acetyl-coenzyme A acetyltransferase homologue). By using the sets of primers indicated in Fig. 2, two types of DNA fragments, each spanning the upstream and downstream regions of coa, respectively, were amplified by PCR, using chromosomal DNA of the seven reference strains as templates, and sequenced. All 10 strains carried two ORFs corresponding to SA0221 and SA0223 that are located at upstream and downstream of coa. Nucleotide identities of the regions corresponding to the two ORFs are shown in Fig. 2. They were highly homologous in nucleotide identities, with averages of 98.4% and 99.0%, respectively. In addition, noncoding regions flanking coa were well conserved (Fig. 2). Our data showed that the regions flanking coa are nearly identical, while coa sequences located within them are very diverse, especially in the D1 and D2 regions and 81-bp repeat regions.
![]() View larger version (26K): [in a new window] |
FIG. 2. Organization of the region in and around coa and comparison of the nucleotide sequences of coa and surrounding regions. ORFs are shown by bold arrows. The locations of the primers used in this study are indicated by short arrows. The numbers in the boxes show average nucleotide identities of each of the genes or regions among the 10 serotypes.
|
![]() View larger version (17K): [in a new window] |
FIG. 3. Maximum likelihood trees constructed using GCG paupsearch and visualized with Treeview software. Nucleotide sequences used for comparison were as follows: A, nucleotide sequencesof coa other than the repeat region and C-terminal sequence; B, regions surrounding coa which were concatenated sequences of SA0220 noncoding region 1, SA0221 noncoding region 2 on the left region flanked by coa and noncoding region 3, and SA0223 on the right region flanked by coa; C, housekeeping genes, which were concatenated parts of seven housekeeping genes used for MLST. Branches with significant bootstrap support (more than 50%) are indicated. Group 1 strains are highlighted by black boxes; the remaining are group 2 strains.
|
Correlations among serotypes of coa and genetic backgrounds. To determine the correlations between coa alleles and the genetic backgrounds of the strains, we investigated the strains carrying the same serotype of coa but with different genetic backgrounds and the strains carrying different types of coa but with identical genetic backgrounds as determined by ribotyping.
The nucleotide sequences of type IV coa in six S. aureus strains (three of them belonged to ribotype 4 and three of them belonged to ribotypes 47, 48, and 49) were highly homologous, with nucleotide identities of more than 99.9%. The difference in genotypes inferred by ribotyping was also confirmed by MLST. Three ribotype 4 strains, 85/2082, 85/3907, and 86/961, belonged to CC8, and three strains of different ribotypes, MR108, 93/H44, and 85/2232, belonged to CC30. These data suggested that the same coa types are located in S. aureus strains of different genotypes.
Furthermore, we determined nucleotide sequences of type III coa of three strains which belonged to ribotype 4 and CC8. The nucleotide identities of coa of three strains, NCTC 10442, 61/6219, and 86/4372, were highly homologous, with identities of more than 99.9%. The data suggested that different types of coa are located on S. aureus strains of the same genetic background, ribotype 4 and CC8.
|
|
|---|
The finding of antigenic variation in staphylocoagulases produced by S. aureus dates as far back as 55 years (10, 39). Since then, 10 serotypes of staphylocoagulases have been identified. Using purified staphylocoagulases as antigens, antibodies specific to each type of staphylocoagulase are manufactured by Denka Seiken in Japan. The method of staphylocoagulase typing was developed based on coagulation inhibition with specific antibodies, and it has been used as a useful method for epidemiology of S. aureus (45). Here we show the genetic basis of those serotypes by nucleotide sequence determinations of staphylocoagulase genes of 10 serotypes.
The 10 staphylocoagulases investigated are composed of common regions with rather diverged regions among 10 serotypes. The N-terminal 33 amino acids comprised of the signal sequence (26 amino acids) and the first 7 amino acids in the D1 region containing the prothrombin-activating domain are exactly identical in all 10 staphylocoagulases. The central regions are relatively conserved. In contrast, D1 regions other than the first 7 amino acids and D2 regions are the most diverged regions among the 10 serotypes, with average amino acid identities of 52.8% and 60.2%, respectively.
Binding of staphylocoagulase to the prothrombin-binding region, which is an essential step for its activity, might be inhibited by antibodies. Purified antibodies that are specific to each serotype might recognize the amino acid sequences of the low-homology region, especially the D1 region rather than the D2 region or their three-dimensional structures, and prevent the binding of staphylocoagulase to the prothrombin-binding region. The fact that there are three pairs of serotypes with rather high amino acid identities (approximately 90%) in D2, namely, type II and type III, type I and type X, and type IV and type VI, also supported the above presumption.
Staphylocoagulase also binds to fibrinogen (6). Other than staphylocoagulase, seven proteins having fibrinogen-binding activity are known to be produced by S. aureus, e.g., clumping factor A (ClfA) (30), clumping factor B (ClfB) (32), major histocompatibility complex class II analog protein (Map) (31), extracellular adherence protein (Eap) (35), fibronectin-binding proteins A and B (FnbpA and FnbpB) (48), and extracellular fibrinogen-binding protein (Efb) (5). Among these proteins, Efb showed the highest similarity to staphylocoagulase. An identical sequence of 22 amino acids between a spacing of 9 amino acids in the N-terminal part of the mature protein showed homology to the tandem repeats of staphylocoagulase at the C terminus. Although Efb binds fibrinogen at two sites, which are the repeat regions at the N terminus and C terminus, only the N-terminal segment of the protein, containing the two 22-amino-acid repeats, could bind to soluble fibrinogen. The fact that Efb and staphylocoagulase compete for the same binding domain on fibrinogen might be due to their structural similarity (36). Staphylothrombin (a complex of staphylocoagulase and prothrombin) binds fibrinogen at the central region and repeat region. The staphylothrombin plays an important role in plasma coagulation by converting fibrinogen to fibrin.
Differences between staphylocoagulases produced by S. aureus and S. intermedius. Staphylocoagulase produced by S. intermedius is different from that of S. aureus. It contains sugar, fucose, and galactose, which are not found in the staphylocoagulase of S. aureus (26). They are different in antigenicity (20). The three amino acids at the N terminus of S. intermedius are Ser-Glu-Pro, whereas those of S. aureus are Ile-Val-Thr. Since the N terminus, including the three amino acids, is essential for prothrombin activation, this may explain the observation that clotting activities of these two coagulases are not identical (26).
Why are there so many serotypes of proteins that cause blood coagulation? From our results, we presume that antigenic variation in staphylocoagulases might be useful to evade the host immune response and/or to adapt to the different staphylocoagulase-prothrombin binding sites of mammalian species. Vaccines against the type b capsular polysaccharides of Haemophilus influenzae (Hib) have been used to reduce the incidence of invasive diseases caused by H. influenzae in industrial countries. However, as the use of the vaccines increased, the number of non-Hib strains in clinical isolates has also increased (29, 33, 49). The possibility of "serotype replacement" has been raising concerns. It can be postulated that staphylocoagulases changed their antigenicities by altering amino acids to evade the host defense system.
S. aureus could colonize and be pathogenic not only to human beings, but also to other warm-blooded animals. In mammals, prothrombin is one of the conserved proteins. However, its structure is not identical among various species; rather, it differs from species to species. The amino acid identity between human prothrombin and rat prothrombin is 79.7%, and that between human and cow prothrombin is 81.5%. Type IX staphylocoagulase of strain 17573 was isolated from a cow. Type I to VIII staphylocoagulases were carried in strains isolated from humans. Carter et al. reported that the 5'-terminal sequence of the strain AN3 staphylocoagulase was isolated from a lamb and that the HinfI restriction pattern of it was classified as type 3 (8). When we compared the two 5'-terminal nucleotide sequences of AN3 staphylocoagulase and type IX staphylocoagulase, we found they were exactly identical. Therefore, it might be speculated that type IX staphylocoagulase strains may be inclined to colonize animals. Other 5'-terminal sequences of coa isolated from animals were slightly different from coa types I to X. The nucleotide sequences were similar between HinfI type 1 and type I coa, HinfI type 2 and type VI coa, HinfI type 4 and type V, HinfI type 5 and type VI coa, and HinfI type 6 and type V coa. The amino acid identities of the D1 region between each pair were 98.6%, 99.3%, 94.1%, 94.3%, and 94.8%, respectively.
Correlations between staphylocoagulase serotype and genetic background of S. aureus.
The average coa nucleotide identities among 10 serotypes was 77.1%. It was lower than the identities between the whole genomes of S. aureus, since three methicillin-resistant S. aureus strains possess high nucleotide sequence identities: 99.7% between N315 and Mu50, 94.8% between MW2 and N315, and 94.7% between MW2 and Mu50 (2). Most of the nucleotide differences in the S. aureus chromosome are due to the insertion of some DNA regions, such as genomic islands (Gislands) and genomic islets (Gislets) (1, 2, 15). The Gisland is defined as a relatively large chromosomal region (10 to 200 kbp) whose GC content differs from the chromosome and which was acquired from other bacteria by lateral gene transfer. It encodes most of the genetic information involved in virulence and drug resistance (41, 44). In contrast to Gisland, Gislet is composed of a single gene, which is allelic, and its flanking regions are highly conserved (15). No recombinase-like protein was encoded in Gislet, and so an insertion sequence is not involved in this category. In addition to the whole genome sequences of seven S. aureus strains, data from this study indicated that coa should be regarded as a member of Gislets, since all 10 coa serotypes were found to be located at the same position on the chromosome between the highly conserved regions. Carter et al. previously reported that some strains with different pulsed-field gel electrophoresis patterns and MLST have the same types of staphylocoagulase, judging from the HinfI-random fragment length polymorphism pattern of the N-terminal fragment of coa (8). We have shown that seven strains of two different STs, CC8 and CC30, carried the same type IV coa. Furthermore, different types of staphylocoagulase, namely type III and type IV, were carried by strains of the same genotypes, CC8. The large chromosomal replacement in the S. aureus chromosome was reported by Robinson et al. (40). Since the size of the replaced region is at least
557 kbp and the region involves coa, this might explain the reason why different coa are carried in the same genotypes. However, further studies are needed to conclude this.
When we examined phylogenetic relatedness between coa and other chromosomal regions, we found small discrepancies between them, i.e., the type I coa of strain 104 belonged to group 2, whereas the flanking regions of coa and the housekeeping genes in the chromosome belonged to group 1 in their phylogenetic trees. Similarly, coa of NCTC 8325 is located at a slightly different position from the flanking regions of coa in the phylogenetic tree. These data strongly suggest that some S. aureus strains acquired coa by recombination, although its mechanism remains unknown. S. aureus is a part of our normal flora, and it persists in the human body by altering its genetic traits successfully. Hence, it could be possible that S. aureus strains might alter their staphylocoagulase type to adapt to host immunity and/or interspecies differences of prothrombin.
This work was supported by a Grant-in-Aid for 21st Century COE Research and Grant-in-Aid for Scientific Research on Priority Areas (13226114) from The Ministry of Education, Science, Sports, Culture and Technology of Japan.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»