Previous Article | Next Article ![]()
Journal of Bacteriology, July 2004, p. 4228-4237, Vol. 186, No. 13
0021-9193/04/$08.00+0 DOI: 10.1128/JB.186.13.4228-4237.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Christian van Delden,3 Burkhard Tümmler,2 José Luis Martínez,1 and Fernando Rojo1*
Departamento de Biotecnología Microbiana, Centro Nacional de Biotecnología, CSIC, Campus U.A.M., Cantoblanco, E-28049 Madrid, Spain,1 Klinische Forschergruppe, Pediatric Pneumology, Medizinische Hochschule Hannover, D-30625 Hannover, Germany,2 Department of Genetics and Microbiology, University of Geneva, CMU, CH-1211 Geneva 4, Switzerland3
Received 11 December 2003/ Accepted 24 March 2004
|
|
|---|
|
|
|---|
An important question regarding P. aeruginosa is whether the strains isolated from infected patients correspond to specialized strains adapted to clinical habitats or whether the virulence of this bacterial species results from a set of traits that are present in most or all strains from any environment. Several data support the view that the strains isolated from nonclinical environments (referred to hereafter as environmental strains) are indistinguishable from clinical isolates in terms of several genotypic, taxonomic, or metabolic properties (2, 6, 10, 11, 17, 32, 33). Furthermore, a recent whole-genome analysis of the presence or absence of strain-specific genes within a set of 18 strains isolated from clinical and nonclinical habitats revealed no correlation between genome content and infection type as well as a remarkable conservation of genes, including those encoding most known virulence factors (48).
Current knowledge indicates that the P. aeruginosa genome is made up of a mosaic of a conserved core and variable accessory segments (10, 14, 17, 31, 41). The core genome is characterized by a conserved synteny of genes and a low average nucleotide substitution rate (about 0.5%). Only 2.5% of the coding sequences exhibit significantly higher sequence diversity. Clone- or strain-specific genome islands define the variable part of the chromosome (4, 10, 16, 19, 21, 32, 36) and lead to fluctuations in the genome size, which can range from 5.2 to 7 Mbp (36).
Single nucleotide polymorphism (SNP) genotyping of six genes of the core genome in a collection of 19 environmental and clinical P. aeruginosa strains and a restriction fragment length polymorphism (RFLP) analysis of their chromosomes revealed a high sequence conservation in four of the genes and a high frequency of recombination within the chromosome, leading to a random association of alleles (17). This indicates that the P. aeruginosa gene pool is in linkage equilibrium. Due to the frequent exchange of genomic islands and accessory DNA segments, P. aeruginosa populations would consist of a series of equivalent genotypes formed by related strains (termed clones), which form a net-like population structure (17). A different collection of 73 environmental and clinical isolates collected from diverse sources was studied by combining the analysis of some phenotypic traits, the DNA sequences of three genes, and amplified fragment length polymorphism pattern analysis (28). The results showed that the isolates could be grouped into clusters but that the clusters obtained by the different experimental approaches were not always congruent. Amplified fragment length polymorphism analyses suggested the existence of groups of strains (clones) having a related chromosome structure, although the relatedness and organization of this clonal population were obscured by DNA rearrangements and insertion of large DNA segments at conserved regions of the otherwise highly conserved chromosomal backbone. The detection of clonal complexes suggested a transient epidemic-like spread of certain genotypes. This epidemic distribution of P. aeruginosa populations is more clearly detected when the strains analyzed are collected in a small region or in a specific environment (3, 22, 27).
A clear view on the structure and dynamics of P. aeruginosa populations is relevant both to understand the biology of this ubiquitous bacterial species in different habitats and to perform epidemiological studies to trace nosocomial infections (27). Current analyses on this topic have been performed focusing either on a few markers on many strains or on many markers on only a few strains (3, 6, 10, 17, 22, 27, 28, 48). However, a consistent picture of P. aeruginosa populations would benefit from the analysis of large numbers of strains by using many, rather than a few, genetic markers and by using highly discriminative tests. To this end, 111 strains were selected, mainly from our own collections of more than 5,000 isolates, to gather a group containing strains from a broad range of sources and habitats (both clinical and nonclinical environments) and in which most of the strains represented a unique clone according to the SpeI macrorestriction fragment fingerprint, which is the current "gold standard" for genotyping P. aeruginosa (25). We have looked for SNPs in seven conserved loci of the core genome in these 111 strains. This has led to find a combination of 14 SNPs which, together with the type of fliC gene variant present (a or b), allows the assignment of a highly discriminative signature to individual strains. Assuming linkage equilibrium, we could expect free recombination of loci and one unique SNP marker genotype per clone. However, in some cases, an SNP genotype was shared by two or more strains. The possible meaning of these findings is discussed.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. SNP- genotypes of P. aeruginosa strains from different origins
|
PFGE. Samples were analyzed as described in reference 33, with minor modifications. P. aeruginosa strains were grown overnight at 37°C in Luria-Bertani medium, centrifuged for 10 min at 1,500 x g, and suspended in 75 mM NaCl and 25 mM EDTA (pH 7.4) to a concentration of 5 x 109 cells/ml. The cell suspension was mixed 1:1 with 2% (wt/vol) low-melting-point agarose to prepare the agarose plugs. The embedded cells were digested for 48 h at 56°C with proteinase K (1 mg/ml in 0.5 M EDTA [pH 9.5], 1% [vol/vol] N-lauryl-sarkosine). Agarose plugs were equilibrated and stored in 10 mM EDTA and 10 mM Tris-HCl (pH 7.4) at 4°C until used. One-third to one-half of a plug was equilibrated in SpeI buffer (50 mM NaCl, 6 mM Tris-HCl [pH 7.5], 10 mM MgCl2). Digestion was performed overnight at 37°C in 90 µl of enzyme buffer with 4 U of SpeI, 0.1 mg of bovine serum albumin/ml, and 5 mM dithiothreitol. Separation of DNA fragments was performed by PFGE in a CHEF-DR III apparatus (Bio-Rad), with a 1.5% (wt/vol) agarose gel, 0.5x Tris-borate-EDTA buffer, and a linear ramping from 8 to 50 s for 24 h, 12 to 25 s for 22 h, and 1 to 14 s for 14 h. DNA fragments were visualized by ethidium bromide staining. Genotypes were defined from the evaluation of SpeI fragment patterns as described in reference 30.
SNP analysis by DNA sequencing. To search for SNPs in alkB2, this gene was sequenced in 20 strains (listed below). To this end, genomic DNA was PCR amplified by using either AlkB2-1 and AlkB2-2 or AlkB2-3 and RR1-20 as primers (Table 2). The DNA fragments obtained were sequenced with the same primers as above.
|
View this table: [in a new window] |
TABLE 2. Primers used for PCRs and endonucleases used to analyze SNPs at the indicated loci by RFLP
|
Analysis of fliC. P. aeruginosa strains contain one of two variants of the fliC gene (a type and b type), which encode flagellins of different molecular weights (1). Therefore, a PCR approach was undertaken to identify which variant was present in each strain. The oligonucleotides used for PCR, indicated in Table 2, rendered DNA fragments of 997 or 1,018 bp (a type fliC) or of 1,300 bp (b type fliC).
Nucleotide sequence accession number. The EMBL GenBank accession numbers for alkB2 are as follows: strain RR1, AJ33602; strain 892, AJ633605; strain K9, AJ633603; strain G7, AJ633606; strain SG1, AJ633604; strain DSM 1128, AJ633607; strain DM, AJ633608; strain HJ2, AJ633609; strain 63741, AJ633610; strain SG31, AJ633611; strain ATCC 10145, AJ633612; strain ATCC 15691, AJ633613; strain ATCC 33356, AJ633614; strain ATCC 33818, AJ633615; strain PAK, AJ633616; strain 19G12, AJ633617; strain CECT 119, AJ633618; strain ATCC 15524, AJ633619; strain CHA, AJ633620.
|
|
|---|
Determination of SNPs in selected genes.
Seven loci were analyzed for SNPs suitable for discrimination of P. aeruginosa strains. The loci were chosen according to several criteria: they should be highly conserved, they should belong to different categories (regulatory, structural, or metabolic genes), and they should be evenly distributed through the P. aeruginosa genome. Several SNPs had already been described at oriC (the origin of replication), citS (citrate synthase), ampC (chromosomal beta-lactamase), oprI (outer membrane protein OprI), fliC (flagellin) (17), and oprL (28). A subset of these reported SNPs in which the less frequent sequence variant was present in more than 15% of the analyzed strains were selected for our analyses. Sequencing of alkB2 in P. aeruginosa strain RR1 had shown the presence of several SNPs relative to strain PAO1 (24). To determine whether any of these SNPs could be useful to our study, alkB2 was sequenced in 20 P. aeruginosa strains (strains and EMBL GenBank accession numbers are indicated in Materials and Methods). Forty-two SNPs were found. Thirty-nine of them were present in less than 15% of the strains analyzed and were therefore discarded. One of them (at position 1017, with numbering relative to the translation start site) was present in all but the reference strain PAO1 and was discarded as well. Two SNPs were detected that changed in more than 15% of the 20 strains, namely G
A at position 471 and A
G at position 1047. These two SNPs were chosen for further analyses. Therefore, the complete set of SNPs selected to analyze the P. aeruginosa population structure included one at oriC, two at citS, one at oprI, seven at ampC, one at oprL, and two at alkB2 (Table 1). P. aeruginosa strains are known to encode either a- or b-type flagellins that differ by 35% in primary structure (1, 39). Since these two fliC variants (a type and b type) can be used as a discrimination test (39, 40), the presence of each variant was also evaluated as a discrimination tool.
An important aspect to take into account when using SNPs for strain typing is whether the loci and the SNPs considered show genetic linkage or are rather randomly associated (this is, they are in linkage equilibrium). Previous work had shown that oriC, citS, ampC, oprI, and fliC are not genetically linked (17). Furthermore, the seven SNPs selected at ampC have a random association as well (40). As will be shown below, the complete set of SNPs analyzed in this work are also in linkage equilibrium. Therefore, the selected SNPs can be used as a way to discriminate between strains.
Analysis of the selected SNPs in the complete P. aeruginosa strain collection (composed of 111 strains) was performed by RFLP of PCR-amplified DNA, since in most cases, one of the variants of the SNP corresponds to a target for a restriction enzyme which is missing in the alternative variant. In the case of oprL, where such a target was not available, an artificial restriction site was created at one of the oligonucleotides used in the PCR. The results of the RFLP analyses are shown in Table 1. The profile of SNPs for each strain is referred to as the SNP genotype. Strain PAO1 was taken as the reference strain, and its sequence for each SNP is indicated in Table 1. For the other strains, the sequence is shown only when it differs from PAO1. The genes citS, ampC, oprI, and fliC, as well as the oriC locus, had been previously fully sequenced in some of the strains tested, namely strains 892, K9, G7, DM, 63741, ATCC 10145, DSM 1128, HJ2, ATCC 15691, ATCC 33356, ATCC 33818, ATCC 21776, H2, SG1, and SG31 (17). The results obtained by RFLP were fully consistent with the reported sequence data, indicating that the method is reliable. Furthermore, in the case of the 20 strains in which the alkB2 gene had been sequenced, the RFLP method also gave consistent results.
Linkage analysis and frequency of SNPs. To analyze the possible linkage among all of the SNPs characterized, we determined the index of association (IA), which is a measure of linkage disequilibrium (38). IA is defined as VO/(VE1), where VO is the observed variance and VE the expected variance of the mean number of SNPs at which two P. aeruginosa strains differ. Calculations were performed by using the software available at http://www.mlst.net. The value of IA was estimated by generation of 1,000 randomized data sets under the assumption of random association of loci. Using each genotype as a unit, as described before (17), the value of IA was 0.161, which indicates no evidence of association among the different SNPs analyzed, with a significance level of P < 0.001. The SNP set can therefore be used for strain typing.
An additional important requisite for an SNP to be useful in strain typing is that the less-frequent variant should be present in a significant fraction of the clones, with 50% being the ideal value. Table 3 shows the frequency of each SNP in the P. aeruginosa collection screened. The less-frequent variants of the SNPs selected in this study were present in 15 to 49% of the strains, values that are high enough to consider these SNPs suitable for strain typing. Since no linkage was detected among the SNPs characterized, the theoretical discrimination efficiency of this set of SNPs can be estimated by multiplying the accumulated frequencies of all of the SNPs considered. If this hypothesis is correct, the discrimination ability of this set of SNPs ranges from 2.7 x 103 in the worst case (a strain with all of the most frequent SNPs) to 1.5 x 108 in the best case (a strain with all of the less-frequent SNPs).
|
View this table: [in a new window] |
TABLE 3. Frequencies of SNPs
|
![]() View larger version (44K): [in a new window] |
FIG. 1. Shared SNP profiles. The locations of the SNPs at each gene are indicated both by position and by the restriction enzyme used in the RFLP analysis, identified in Table 1. Strain groups are as follows: 1, PAO1, 3D10, SFQ47, and 18F8; 2, ZW88 and PT20; 3, K9 and VSF17; 4, G7 and SS1; 5, ZW92 and DGA138; 6, RP1, HJ2, ZW79, NCC81, and 60B5; 7, DSM 288 and DSM 1253; 8, ATCC 15691 and A5670; 9, AL5846 and PT36; 10, PT22, CHA, and ATCC 14886; 11, 892 and 63741; 12, SG1 and SG31; 13, ATCC 15522, ATCC 15524, and ATCC 15528. Changes relative to strain PAO1 are shaded. The expected frequency of each SNP combination is indicated in the rightmost column, obtained by multiplying the frequencies at which each of the 14 individual SNPs appears in the strain collection analyzed, plus the type of fliC, as deduced from Table 3.
|
PFGE analysis. To gain additional information on the genome organization of the strains sharing the SNP genotype, the SpeI fragment patterns of strains having the same SNP genotype were compared on adjacent lanes by PFGE (Fig. 2). Strains PAO1, 3D10, and 18F8, which share the same SNP profile, also show the same macrorestriction pattern. It was expected that strains 3D10 and 18F8 share the same SpeI macrorestriction pattern, since they are clonal variants intentionally introduced as an internal control in the typing assays. However, it was not expected that they would share the pattern with strain PAO1. Strain SFQ47, which shares the SNP genotype with PAO1, 3D10, and 18F8, has a different PFGE profile. Similarly, strains RP1 (clinical, Germany; clone J), HJ2 (clinical, Germany; clone M), ZW79 (clinical, United Kingdom), NCC81 (clinical, Spain), and 60B5 (clinical, Switzerland) share the same SNP genotype but have different SpeI macrorestriction patterns. Strains PT22 (environmental, Germany), CHA (clinical, France), and ATCC 14886 (environmental) show PFGE profiles that, although related in their fragment pattern, differ in several bands in spite of having the same SNP genotype. Finally, strains VSF17 (isolated in Spain) and K9 (Germany), which also share the SNP profile, have different macrorestriction patterns. Similar PFGE assays (data not shown) indicated that strain pairs G7 (clinical, Germany; clone G) and SS1 (clinical, Germany; clone K), DSM 288 and DSM 1253, ATCC 15691 (type strain) and A5670 (clinical, Germany), and PT20 (environmental strain, Germany) and ZW88 (clinical, United Kingdom) have different macrorestriction profiles, although they share identical SNP genotypes.
![]() View larger version (117K): [in a new window] |
FIG. 2. SpeI macrorestriction profiles of strains sharing the same SNP genotypes, performed by PFGE. The strains analyzed corresponded to groups 1 (PAO1, 3D10, SFQ47, and 18F8), 2 (VSF17 and K9), 6 (RP1, HJ2, ZW79, NCC81, and 60B5) and 10 (PT22, CHA, and ATCC 14886), as defined in the legend to Fig. 1.
|
|
|
|---|
A detailed analysis of the results obtained showed that several strains shared the same SNP genotype. Some of the strains sharing the same SNP profile had different SpeI macrorestriction profiles. In other cases, the SpeI macrorestriction profiles were clearly related, although they differed in several bands. An interesting case is that of isolates PT22, ATCC 14886, and CHA, which shared their SNP genotype and had different, although related, SpeI macrorestriction profiles. Strains CHA and ATCC 14886 shared not only the SNP genotypes but the full sequence of all the loci analyzed in this work, which highlights the relatedness of these two strains. Strain CHA is a highly virulent clinical isolate, whereas strains PT22 and ATCC 14886 were isolated from nonclinical (environmental) habitats. This observation supports the view that P. aeruginosa isolates that thrive in nonclinical habitats have probably all needed traits to infect mammals (2). Another interesting example is that of strains 3D10, 18F8, SFQ47, and PAO1, which are clinical isolates from different countries. On the whole, 13 groups of strains sharing SNP genotypes were found, 8 of which (groups 1, 2, 3, 4, 6, 7, 8, and 10) (Fig. 1) include strains that have different SpeI macrorestriction profiles. Strains K9 and VSF17 shared an SNP genotype that should appear with a very low theoretical frequency, 9.8 x 107, a value that suggests that this finding is not casual. Previous analyses have demonstrated that some PFGE-defined clones have widespread distribution (33). Indeed, two of the five strains from group 6 (Fig. 1), which have different SpeI genotypes but the same SNP genotype, belong to the most abundant clones in our strain collection (strains RP1 and HJ2, which belong to clones J and M, respectively). This collection includes isolates from numerous sources of diverse geographical origin (33). It is thus conceivable that these overrepresented genotypes define strains that are widely distributed. Notably, the PAO1 genotype is also overrepresented. Strain PAO1 was originally isolated form a burned patient in Australia 50 years ago (45). This suggests that the most prevalent strains are maintained in the environment for at least several decades, which may be the consequence of a better ecological adaptation. It should be noted that the bias deliberately introduced in our strain collection to avoid clonal variants has to be taken into account when using our results to evaluate the P. aeruginosa population structure. It is clear that our collection underscores the possible epidemic structure of natural populations, but we still detected the presence of some prevalent strain types that had a highly related core sequence (same SNP genotype), although they differed to a substantial extent in the SpeI macrorestriction profile.
Our results show that the use of RFLP for SNP genotyping can be a very powerful technique to discriminate between strains, but it is still unable to discriminate among highly related strains (for example, those having the same SNP profile but different macrorestriction patterns). This can have practical consequences when attempting to use this method to perform epidemiological studies to monitor an outbreak of a particular strain. To this end, the discriminatory ability should be improved even more by the addition of other markers or traits, for example, the presence or absence of genomic islands or of genes known to be present in some but not all P. aeruginosa strains.
The accumulation of molecular data in recent years and the growing evidence of the occurrence of horizontal gene transfer among bacteria in nature (reviewed in reference 37), have led to consideration that bacterial populations are not invariably clonal but range from the highly sexual Neisseria gonorrhoeae to the almost strictly clonal Salmonella (38). In this work, we have used both PFGE and SNP genotyping to analyze the P. aeruginosa population structure in a large and heterogeneous collection of strains. The sequence analysis of the loci studied here indicate that the core genome of P. aeruginosa is highly stable during evolution, since very few changes (even at the third position of the amino acid codons) were detected in strains isolated from different places and within a 50-year time lapse (17; this work). A similar result has been recently obtained for Escherichia coli with an in vitro model of bacterial evolution (20). On the other hand, the results obtained with the PFGE analysis support the view that, during evolution, large fragments of DNA (genome islands) can be excised from the genome or integrated into it, which then results in different macrorestriction patterns when analyzed by PFGE (16, 32) and confers genome diversity to this bacterial species (19). The population structure of P. aeruginosa is still under discussion, although recent data point to the idea that this species can display an epidemic population structure (28). The conservation of the SNP genotypes and the divergence of SpeI macrorestriction patterns in strains sharing the same SNP profile agree with the idea that the core genome of P. aeruginosa is highly conserved and that its evolution and structure rely more on acquisition, loss, and rearrangements of genome islands and genome islets than on point mutations. Therefore, and as noted earlier (17, 28), RFLP analysis of the chromosome and SNP analysis of individual genes measure different evolutionary forces. Horizontal gene transfer has an important role in bacterial adaptation to different habitats. In particular, P. aeruginosa populations can exchange large DNA blocks that integrate at specific sites (4, 17-19, 21, 32, 36). P. aeruginosa is characterized by its great biochemical and ecological versatility, and the genome size can vary as much as 30% in different isolates. In other words, horizontal gene transfer may have a more important role than point mutations on the adaptation of P. aeruginosa to different habitats.
This work was supported by grant QLK2-CT-2001-01339 from the Vth framework program of the EU and by grants BIO2000-0939 and BIO2001-1081 from the Spanish Ministry of Science and Technology. C.v.D was supported by grant FN 3231-51940.97 from the Swiss National Foundation and grant OFES 01.0122 from the Swiss Federal Office for Education and Science. P.G. was supported by a grant from the Mukoviszidose e.V.
Present address: Universitätskinderklinik, Universität Düsseldorf, D-40225 Düsseldorf, Germany. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»