Previous Article | Next Article ![]()
Journal of Bacteriology, August 2005, p. 5376-5386, Vol. 187, No. 15
0021-9193/05/$08.00+0 doi:10.1128/JB.187.15.5376-5386.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
and
Christopher G. Dowson
Infectious Disease Research Group, Department of Biological Sciences, University of Warwick, Coventry CV4 7AL, United Kingdom
Received 1 September 2004/ Accepted 28 April 2005
|
|
|---|
|
|
|---|
(2-3) and
(2-6) linked to galactose or
(2-6) linked to N-acetylgalactosamine (6). The pneumococcus contains several loci with sequence similarity to neuraminidases; nanA and nanB are expressed and possess activity (3, 6). A third putative neuraminidase, nanC, remains to be characterized (35, 37). The role of S. pneumoniae neuraminidase activity is not fully elucidated; however, evidence indicates that NanA is involved in colonization and virulence. One suggested contribution of NanA to colonization is to increase the availability of ganglioside receptors that are important for pneumococcal adherence (40). A nanA mutant has a significantly reduced ability to colonize tracheal epithelium (39) and persist in a chinchilla model of nasopharyngeal colonization (38). Furthermore, this mutant was cleared more efficiently in a chinchilla middle ear infection model (38). In contrast, little difference was identified in an intraperitoneal infection model (4).
Evidence suggests that NanA may contribute to long-term colonization by modifying both other nasopharyngeal organisms and host proteins. NanA has been demonstrated to desialylate the cell surfaces of Neisseria meningitidis and Haemophilus influenzae (35). Both these species reside in and possibly compete for the same host niche as pneumococci, and desialylation of lipopolysaccharides of competitors may well provide an advantage. In addition, NanA may contribute to a protease-independent mechanism to modify the function of host glycoproteins that bind to the pneumococcus and protect the airway (24). All isolates screened have been found to produce sialidase activity (21, 31). In patients with pneumococcal infection, a direct correlation exists between levels of N-acetylneuraminic acid in cerebrospinal fluid and development of coma and adverse outcome (31). Recently, immunization with NanA has also been demonstrated to afford protection against nasopharyngeal colonization and experimental otitis media (27).
The housekeeping genes of S. pneumoniae show little variability with sequence divergence ranging from 1 to 2% (7, 11). However, due to the natural transformability of this species it evolves largely though homologous recombination and not point mutations (12). Intra- or interspecies (9) recombination can occur, resulting in the transfer of an entire gene(s) or gene fragments with >20% nucleotide sequence diversity (9). This homologous recombination results in the formation of mosaic genes. Mosaic structure was first identified in the formation of low-affinity penicillin binding proteins found in highly penicillin resistant pneumococci resulting from the selective pressure of penicillin therapy (7). It is therefore not surprising that host immune selection has also led to horizontal transfer involving genes encoding some exposed pneumococcal surface proteins. Loci displaying evidence of this gene transfer include the capsule, pspA, pspC, and igaP (5, 16, 18, 33).
In addition, published data demonstrate some sequence divergence within the 3' region of nanA (Fig. 1). The function of this 3' region of nanA is unclear, as a clone lacking the 232 N-terminal amino acids still possessed neuraminidase activity (6). Dowson et al. (8) previously identified a serotype 14 clinical isolate (Pn13) that lacked three tandem 60-bp repeats present within R36A nanA (encoding amino acids 898 to 957). Furthermore, proximal to the 60-bp repeats, R36A possessed a 183-bp region (bp 2508 to 2691) that diverged 36% from the nucleotide and 34% from the amino acid sequence of Pn13. In addition, Pn13 contained a 15-bp duplication (nucleotide 2431) upstream of these alterations. Sequencing of a serotype 4 (TIGR4) genome also revealed sequence diversity in the 3' region of nanA, which was identical to Pn13 with the exception of an 11-bp deletion within TIGR4. This deletion results in a change in reading frame and termination of the gene at amino acid 804 (32, 37).
![]() View larger version (46K): [in a new window] |
FIG. 1. Nucleotide (A) and amino acid (B) alignment to demonstrate diversity within the 3' region of nanA from R36A, Pn13, and TIGR4. Numbering of the sequence starts from the first putative ATG within the R36A published sequence (X72967 [6]). Identical nucleotides and amino acids are illustrated by a dot, while absent sites are represented by a dash. The location of block D is boldfaced. The three 60-bp (20-amino-acid) direct tandem repeats are underlined, and a vertical line indicates the start of each repeat. A 7-bp sequence present at the start of each repeat and immediately distal to the repeats (A) is double underlined. The 15-bp (5-amino-acid) duplication within the Pn13 and TIGR4 sequence is italicized. The asterisk indicates that the protein terminates at that residue (B).
|
|
|
|---|
Preparation of chromosomal DNA. Chromosomal DNA was prepared from all the isolates used, following the protocol described previously (45).
PCR analysis. PCRs were performed under standard conditions with 30 cycles of 95°C for 1 min, X°C for 1 min, and 72°C for 1 min per kilobase of predicted product size where X°C represents an appropriate annealing temperature for the primer pair used. Products were visualized by agarose gel electrophoresis on 1.0 to 1.5% agarose, depending on the size of the PCR product, in the presence of 1 µg ml1 ethidium bromide. Details of oligonucleotides used in this study are provided in Table 1. For all PCRs appropriate positive controls were performed resulting in products of approximately the predicted sizes.
|
View this table: [in a new window] |
TABLE 1. PCR primers utilized in this study
|
Direct sequencing. A fraction of 5' and 3' PCR products were purified by passage through the QIAquick PCR purification kit (QIAGEN) and sequenced using amplification and internal primers with an ABI 373A system.
Bioinformatics. Nucleotide divergence from RFLP data was estimated using Nei3.bas (courtesy of B. Spratt, Imperial College, London, United Kingdom) based on the algorithm of reference 30. Primary sequence analysis was performed using the package DNASTAR. The bioinformatics tools located on http://ncbi.nlm.nih.gov/BLAST were utilized to investigate the similarity of sequences identified to those already present in the databases. Alignments were performed using MEGAMolecular Evolutionary Genetics Analysis, version 1.01 (25). Mosaic structures resulting from recombination events were detected by the maximum chi-squared test (P < 0.001) (28) and Sawyer's runs test (34). Programs for both tests were written in C++ for the Macintosh by Nick Ross (University of Sussex).
Neuraminidase activity.
Neuraminidase activity assays were performed on aliquots of sonicated stationary-phase cultures using the fluorimetric substrate 2'-(4-methylumbelliferyl)-
-D-N-acetylneuraminic acid (MUAN) as previously described (24).
Southern blot analysis.
Southern blot analysis was performed using the Boehringer digoxigenin (DIG) system according to the manufacturer's instructions. Five micrograms of chromosomal DNA from a nanAR (nanA allele containing 60-bp repeats) and a nanA
R (nanA allele lacking 60-bp repeats) pneumococcal strain and 10 other streptococcal isolates (three S. oralis isolates, one S. mitis isolate, three S. sanguinis isolates, one S. cristatus isolate, one S. constellatus subsp. constellatus isolate, and one S. ratti isolate) was restricted with EcoRV. The repeat and block D probe used was generated by restricting an R36A PCR product obtained by amplification using primers 9 and 2 with PvuII. These primers restrict the product 32 bp before the end of the repeat sequence. The resulting 624-bp fragment was purified from an agarose gel using a QIAQuick gel extraction kit (QIAGEN) and labeled with DIG-High Prime (Boehringer). Hybridization was allowed to proceed overnight at 42°C in EasyHyb solution (Boehringer). Posthybridization washes were performed at room temperature using 2x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate)-0.1% (wt/vol) sodium dodecyl sulfate followed by 0.1x SSC-0.1% (wt/vol) sodium dodecyl sulfate.
Nucleotide sequence accession numbers. Sequences of the 12 novel nanA 5' and 10 novel nanA 3' alleles have been submitted to GenBank and assigned the accession numbers AJ968992 to AJ969014. In addition, the S. oralis gene fragments were submitted to GenBank and assigned accession numbers AJ968986 to AJ968991.
|
|
|---|
Diversity and the mechanisms of evolution within 5' nanA. All 106 5' nanA PCR products, obtained by amplification with primers 1 and 3, were restricted independently with seven frequently cutting restriction enzymes to determine their sequence diversity. Comparison of restriction patterns resulted in the identification of 53 distinct RFLP profiles (5' RFLP). The nucleotide divergence between any two RFLP alleles as estimated using Nei3.bas was found to vary from 0.11% to greater than 10% (data not shown) (30). For this method of analysis, values above 10% are not accurate. The most common restriction profile identified, 5' RFLP9, was identified among 12.5% of isolates screened including strain R36A.
PCR products from 12 isolates with different 5' RFLP patterns were sequenced in order to further examine the extent and nature of genetic diversity within the 5' region of nanA (Table 2). Diversity between these sequenced PCR products ranged from 0.4% to 17.2% at the nucleotide level and from 0.4% to 14.8% at the amino acid level (accession numbers AJ968992 to AJ969003), confirming the high level of divergence initially identified by RFLP (Fig. 2A).
|
View this table: [in a new window] |
TABLE 2. Details of the nanA 5' PCR products sequenced and their nucleotide divergence from R36A
|
![]() ![]() ![]() View larger version (163K): [in a new window] |
FIG. 2. Diversity within the 5' region of nanA. Nucleotide (A) and amino acid alignment (B) and schematic (C) of the 5' published sequences from R36A, TIGR4, and 12 other pneumococci selected to represent a range of 5' RFLP profiles. The numbering refers to the position within the published sequence starting at the first putative ATG within the R36A published sequence (X72967 [6]). To allow reference to the location within the published sequence, residues within the 48-bp illegitimate insertion are not numbered. Only residues displaying diversity are presented. Absent sites are shown by dashes, while dots represent identical residues (A and B). Blocks A, B, and C are indicated by underlining, italics, and boldface, respectively (A and B). The location of the recombinant blocks within the 5' schematic is represented by diagonally striped boxes (block A), diamond-patterned boxes (block B), and white boxes (block C) (C). The 48-bp insertion is shown in black (C).
|
In addition to interspecies homologous recombination, alterations occurring by point mutations (or intraspecies recombination with an allelic variant) were identified. The frequency of point mutations identified was relatively low and the changes largely synonymous, with sequence divergence excluding the recombinant blocks ranging from 0.15 to 1.02% at the nucleotide level and 0.29 to 1.48% at the amino acid level (data not shown).
Despite the diversity identified within nanA all 11 strains selected for sequencing were shown to possess neuraminidase activity. Although other pneumococcal neuraminidases could contribute to the detected activity, the assay employed was optimized to detect NanA activity.
The origin of mosaic blocks identified within the 5' region of nanA. In order to investigate if any related streptococcal species were the donors of recombinant block A or C, primer pair 5 and 6 and pair 7 and 8 were designed to amplify fragments of 523 bp and 512 bp from recombinant blocks A and C, respectively. Sixty-six isolates representing 18 different streptococcal and two Abiotrophia species (19) were screened using both primer pairs. Block A primers (5 and 6) amplified a band of the predicted size for eight S. oralis isolates although a further 15 S. oralis isolates including the type strain (NCTC11427) were found to be negative by PCR. Sequencing six of these PCR products from S. oralis revealed that they were closely related to block A (5.9 to 6.1%) (accession numbers AJ968986 to AJ968991). The sequence most closely related to recombinant block A (Cr51) diverged only 5.9% at the nucleotide level and 3.2% at the amino acid level. Diversity between the six S. oralis sequences ranged from 2.3 to 5.9% at the nucleotide level. No amplification products were visualized for any of the 66 streptococcal isolates following amplification with block C primers (7 and 8) or for the whole region encompassing block A to block C (primers 5 and 8).
Diversity within the 3' region of nanA.
In order to investigate the distribution of the previously identified 3' sequence divergence (Fig. 1) and to investigate any additional sequence diversity within this region of nanA, the 3' region of the gene (bp 2383 to 3126) was amplified, successfully, from all 106 isolates using primers 4 and 2. To investigate the distribution of the previously identified 60-bp repeat sequences, the PCR products were visualized alongside R36A and Pn13 controls. The amplification products could be divided into five different groups based on size. Amplification from 94 isolates resulted in a product of approximately the same size as that from Pn13 (579 bp), known to contain no repeats and designated nanA
R. The remaining 10 isolates (9.6%) gave larger PCR products; details of these isolates containing putative nanARN (N indicating the number of repeats) alleles are presented in Table 3. Sequencing confirmed that the four different product sizes observed correlated with sequences containing one, two, three, or four 60-bp tandem repeats (Fig. 3). Sequencing also revealed limited diversity within the repeat sequences. Ten nucleotide variants were identified that code for five closely related 20-amino-acid sequences, including the three distinct R36A repeat sequences.
|
View this table: [in a new window] |
TABLE 3. Details of the 10 nanAR-containing isolates
|
![]() View larger version (24K): [in a new window] |
FIG. 3. Schematic of diversity identified within the 3' region of nanA. The numbering starts at the corresponding location within the coding sequence of the published sequence starting at the first putative ATG within the R36A published sequence (X72967 [6]). Block D is represented by diagonally striped boxes, and the 60-bp repeat sequences are represented by dark gray boxes, while the 15-bp repeat sequences are represented by light gray boxes.
|
![]() View larger version (87K): [in a new window] |
FIG. 4. Southern blot of block D and the 60-bp repeats to isolates of other streptococcal species. The repeat sequences and block D from R36A were DIG labeled and used as a probe. Five micrograms of chromosomal DNA from each strain was restricted with EcoRV, electrophoresed, and transferred to a membrane, and the Southern blot assay was performed. Isolates included were Cr7 (nanA R) (lane 1), Cr48 (nanAR4) (lane 2), three S. oralis isolates (lanes 4, 5, and 6), one S. mitis isolate (lane 7), three S. sanguinis isolates (lanes 8, 9, and 10), one S. cristatus isolate (lane 11), one S. constellatus subsp. constellatus isolate (lane 12), and one S. ratti isolate (lane 13). Numbers at right are molecular sizes in base pairs.
|
R isolates, primers were designed to exploit sequence alterations at the 5' end of this region between R36A (primer 9) and Pn13 (primer 10). Two reactions were performed for all 94 nanA
R isolates and relevant controls utilizing primer pair 9 and 2 and pair 10 and 2. All nanA
R isolates lacked full-length block D, giving a product of approximately the predicted size (444 bp) with primers 10 and 2. No product was visualized when using primer 9.
Sequencing of the 3' PCR products also demonstrated diversity in the numbers of 15-bp direct repeat sequences (nucleotide 2416) (Fig. 3). To further investigate the diversity in the number of 15-bp repeats, nanA
R 3' PCR products were digested with restriction enzymes that cut just distal to the 15-bp sequence, resulting in a small fragment that increased by 15 bp for each repeat. The restriction digests demonstrated that 16.5% of the isolates contain one 15-bp repeat, 79% contain two repeats, and 4.5% contain three repeats (data not shown). Neither sequencing nor restriction digests showed evidence of the deletion event that results in an early termination of the TIGR4 NanA, suggesting that this sequence alteration is not common within pneumococci. With the exception of these repeat sequences, and block D, the 3' regions of the sequenced nanAR are highly similar.
Origin of the 3' 60-bp repeat sequences and block D.
The predicted 20-amino-acid repeat sequence and block D identified within the 3' region of R36A are not present elsewhere within the NanA sequence, the available pneumococcal genomes, or other microbial genomes. Nevertheless, the 20-amino-acid repeats share up to 75% amino acid identity with repeat sequences within the 3' region of a proposed ß-galactosidase from S. pneumoniae (accession no. BAB91370.1). However, these repeats are not present within the predicted amino acid sequence of the other sequenced pneumococcal ß-galactosidases. The repeats also share 41% and 38% sequence identity to repeats within the immunoglobulin A1 proteases of S. oralis and S. sanguinis, respectively. A Southern blot assay was performed to investigate the donor of the repeat sequences and proximal divergent region using R36A as a probe (Fig. 4). This probe hybridized to a chromosomal digest of a nanAR4-containing isolate (Cr48) used as a positive control and very weakly to one fragment from the negative control nanA
R isolate (Cr7). Two fragments of DNA from each of two S. oralis isolates including the type strain (NCTC11427) reacted with the probe. A third S. oralis isolate did not hybridize with the probe under the conditions used. A weak interaction with the probe was also detected for the S. sanguinis type strain (NCTC7863).
|
|
|---|
Interspecies homologous recombination transfer within nanA has resulted in mosaic genes involving four different recombinant blocks (blocks A, B, C, and D). Block B has a high level of nucleotide diversity from the R36A sequence; however, there is only a single amino acid alteration, implying that this region is conserved between neuraminidases of different species. The three remaining recombinant blocks possessed high levels of both amino acid and nucleotide alterations. The level of nucleotide divergence for each recombinant block is greater than 30% and much higher than the 20% identified within mosaic pbp genes (10, 26), suggesting that a high selective pressure exists for these alterations. Multiple recombination events have occurred within a single allele, as indicated by the presence of more than one recombinant block within the nanA sequence of several isolates (Fig. 2). It is highly likely that multiple genes acted as donors, as two blocks (A and B) are within the same region of nanA. Despite the different sizes of individual blocks identified, the high sequence conservation of each block suggests that a single donor has been responsible for the introduction of each region into pneumococci. The fact that all partial blocks identified share one boundary with the full-length block suggests that a single interspecies recombination event with the donor has occurred for each block, followed by transfer of nanA regions between different S. pneumoniae strains.
The 20-amino-acid repeat sequences were identified within the nanA 3' region of 9.6% (10/104) of isolates screened. The number of 60-bp repeats varied (nanAR1 to -4) as has been seen within other streptococcal surface proteins, presumably due to slipped strand mispairing or recombination (44). A full-length block D was located proximal to the 20-amino-acid repeats from six of the nanAR sequences but none of the nanA
R isolates. Of the remaining four nanAR isolates, one contained a partial block D sequence while the remaining three nanAR alleles shared sequence identical to that from Pn13 with the addition of one repeat. As repeat sequences have been identified without block D, it is possible that they were introduced by two independent recombination events. Moreover, the introduction of the repeats may have facilitated introduction of block D. However, it seems more likely that they were transferred during a single recombination event and that the nanAR1 alleles were formed by recombination between a nanAR and a nanA
R isolate. This recombination could occur within a 7-bp sequence present at the start of each repeat and immediately distal to this location in all isolates (Fig. 1) and at any point distal to the repeat. Besides attachment to the cell surface the function of the 3' region is unknown (6). However, the nature of these recombination events suggests that there is a selective pressure upon this region.
Many surface-exposed and secreted proteins of streptococci contain repeats, for example immunoglobulin A1 protease of S. pneumoniae (43) and M proteins of S. pyogenes (17). The wide distribution of these repeat structures implies that these regions must be functional. It has been suggested that the presence of repeats within a protein may provide means for immune evasion (20) or posttranslational or postsecretional processing or aid retention of the protein in the membrane (20). Therefore, NanA containing repeats may have a selective advantage under certain environmental conditions. A function, preventing capsule expression, has been assigned to apparently random spontaneous repeats arising within the capsule locus of S. pneumoniae when grown in Sorbarod biofilms (41, 42). However, the physiological conditions apparently driving these events are as yet unclear.
The donors of the blocks identified within nanA are most likely related streptococci. Many streptococci, including S. oralis, S. pyogenes, S. agalactiae, S. intermedius, and some S. mitis isolates, are known to produce sialidase activity (15, 22, 47), although little is known about the sequences, cellular locations, or diversity of these sialidases. In addition, other streptococcal species may also contain previously undetected neuraminidases. While the donor(s) of blocks B and C as well as the 48-bp insertion (at residue 253) is unknown, the donor(s) of block A, the nanA 3' repeats, and/or block D is clearly related to sequences within some S. oralis isolates. The evidence provided for positive identification of a donor species in pbp genes was that the nucleotide variation between the S. mitis genes and recombinant blocks in resistant pneumococci was within the range of variation naturally found among pbp genes from different isolates of S. mitis (9). Applying these same principles to the data presented in this paper indicates that an S. oralis strain is the donor of block A, as the nucleotide divergence between block A from nanA and that from S. oralis (5.9 to 6.1%) is within the range of that between the six putative donor isolates of S. oralis (2.3 to 5.9%). The presence of sequences similar to the 3' repeats and/or block D in two of three S. oralis isolates suggests that a member of this species may also have been the donor for these sequences. Further Southern blot assays would be necessary to determine if both the repeats and proximal region are present in these isolates. In addition, it should be noted that another species could have been responsible for introduction of these sequences to both S. pneumoniae and S. oralis. Both the sequence and Southern blot assay support published reports of extensive genetic variation within S. oralis (1, 46, 48).
Despite the high level of diversity within nanA, it is still lower than that identified within the pneumococcal genes pspA and pspC, where multiple recombination events have led to complex mosaic alleles (5, 16). In the case of pspA, the blocks are so diverse that the boundaries of the recombinant blocks are impossible to determine. Hollingshead et al. (16) suggest that this extraordinary degree of mosaicism indicates the importance of this surface protein as a natural target for host defense against the pneumococcus. This indicates that pspA is often the target of positive or negative selection in the interaction between the pneumococcus and its host. It is possible that the high level of diversity identified within nanA, another surface-associated pneumococcal protein, indicates that this protein is under selective pressure due to antibodies elicited by the host, although no data exist to demonstrate that antibodies to NanA are elicited during human colonization. Sequence alteration may also affect the ability of the organism to utilize different naturally occurring sialic acids. Further studies would be required to determine whether any of the sequence alterations observed do affect evasion of the host immune system or enzyme kinetics. However, as there are no amino acid alterations identified within the aspartic boxes or predicted active site, alterations may be involved in immune system evasion.
The high level of sequence variation observed in this study is contradictory to that found by Hakenbeck et al. (14), who used DNA chip-based hybridization techniques and identified no variation within nanA, although mosaic structure was identified within pbp2x. Hakenbeck et al. used only 20 pneumococcal isolates in their research, and it is therefore possible that the oligonucleotides utilized do not include sequences representing mosaic blocks (14).
We can place the variation identified within the nanA gene in the wider context of pneumococcal diversity, as these 104 clinical and carried isolates were presented within a previously published multilocus restriction typing (MLRT) study (29). Eighty-three MLRTs were identified within the 104 strains. There are of course some examples of two strains that share similar nanA alleles and genetic background. However, the majority of strains show no association between nanA RFLP profiles and specific lineages or anatomical sites of isolation. The 11 strains selected for sequencing represented 11 different MLRTs. Furthermore, the MLRT data support multiple recombination events leading to the distribution of blocks within the S. pneumoniae population. For example, Cr51 and Cr33 (both containing the 48-bp insertion sequence) contained divergent restriction profiles at six of the nine loci utilized in the MLRT study.
This paper demonstrates that nanA genes from different S. pneumoniae isolates are divergent due to the presence of recombinant blocks, repeats, and an insertion. Recombination with multiple donors has occurred, at least one case of which appears to be an S. oralis isolate. The results of these recombination events have spread horizontally between pneumococci. The recombination event within the 3' region of nanA resulting in the introduction of the repeats and block D suggests a function for this region that is not essential for neuraminidase activity. The conservation of residues postulated to be involved in the active site or aspartic boxes implies that identified sequence divergence may provide a selective advantage in avoiding the host immune system. Furthermore, this suggests that nanA is an important target of the immune system in the interaction between the pneumococcus and host.
Present address: Department of Statutory and Exotic Bacterial Diseases, Veterinary Laboratories Agency, Addlestone, Surrey, United Kingdom KT15 3NB. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»