Previous Article | Next Article ![]()
Journal of Bacteriology, April 2006, p. 2375-2382, Vol. 188, No. 7
0021-9193/06/$08.00+0 doi:10.1128/JB.188.7.2375-2382.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Departments of Microbiology and Immunology,1 Medicine, Stanford University School of Medicine, Stanford, California 94305,2 Centers for Disease Control and Prevention, Atlanta, Georgia 30333,3 Department of Infectious, Parasitic and Immune-Mediated Diseases, Istituto Superiore di Sanità, 00161 Rome, Italy,4 Women's and Children's Hospital, North Adelaide, South Australia 5006, Australia,5 VA Palo Alto Health Care System, Palo Alto, California 943046
Received 7 November 2005/ Accepted 10 January 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The Bordetella genus of respiratory pathogens provides a rich model with which to explore questions of evolution of pathogenesis and host adaptation. The three species comprising the "classical" bordetellae have much in common, including very similar mechanisms of pathogenesis and a high degree of sequence similarity in shared genes, but they have different host ranges and abilities to survive outside a host. Bordetella bronchiseptica, which has a broad mammalian host range and the most complete set of metabolic pathways of the three species, is thought to be similar to the ancestral form of the group from which the more-host-restricted variants evolved by genome reduction (7, 28). Bordetella parapertussis infects only sheep and humans, and Bordetella pertussis is an obligate pathogen of humans.
B. pertussis causes whooping cough, a significant source of mortality against which vaccines have been widely used in developed countries for approximately 50 years. The medical importance of this pathogen has motivated numerous molecular and epidemiological investigations which have revealed several distinctive features of this species. One hallmark of B. pertussis is its extremely limited genetic variability, as assessed by multilocus enzyme electrophoresis (26, 37), multilocus sequence typing (8, 38), and previous microarray-based comparative genomic hybridization (CGH) analysis performed on a small number of strains (7). Because B. pertussis induces a long-lasting immune response in its hosts, which theoretically imposes selective pressure on the bacterium to evade immunity induced by prior infection and vaccination, the lack of polymorphism in the species is somewhat surprising and has been speculatively attributed to fitness costs associated with immune evasion or to frequent population bottlenecks (36).
Another unusual feature of B. pertussis is its exceptionally high load of insertion sequence (IS) elements. The sequenced strain, Tohama I, carries 261 IS elements, 238 of which are identical repeats of IS481 (28). These elements comprise over 6% of the genome and have transposed into many coding sequences, creating numerous presumably nonfunctional pseudogenes in Tohama I and also providing sites for homologous recombination throughout the chromosome. The IS elements appear to have expanded recently in the B. pertussis genome, as many fewer IS elements are found in B. bronchiseptica and B. parapertussis strains (28, 37).
As well as the loss of functional genes due to disruption by IS elements, several lines of evidence have led to the inference that B. pertussis has undergone significant genome reduction (7, 28). In addition, very little evidence of horizontal gene acquisition was observed in the genome sequence of the single strain so far examined (7, 28). To gain a better understanding of how this pathogen is evolving to meet the pressures of its restricted host niche without the benefit of novel genetic input, we obtained strains that represented a wide diversity of temporal and geographic isolation characteristics, as well as strains from before and during a large epidemic, and strains from hosts with different vaccination status. We subjected these strains to CGH, subtractive hybridization, global expression profiling, and gene order analysis. Consistent with previous studies, we found very little variation in gene content between strains. However, variation in gene order may provide a source of genetic variability necessary for evolution of this species. Additionally, we found that whole-genome expression profiles of a recent clinical strain changed significantly over 12 laboratory passages, indicating that despite extremely restricted variability in gene content, B. pertussis can alter gene regulation quickly when introduced to a new environment.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Microarray design and construction. An expanded Bordetella microarray was constructed based on that described in reference 7. A total of 5,670 PCR products, representing 97.4% of the B. pertussis Tohama I open reading frames (ORFs), 97.9% of the B. parapertussis 12822 ORFs, and 98.5% of the B. bronchiseptica RB50 ORFs, were printed in duplicate on poly-L-lysine-coated glass slides. Details of microarray printing have been previously reported (7).
Comparative genomic hybridization. Genomic DNA from each strain was labeled and hybridized to the arrays along with mixed reference DNA (consisting of equimolar quantities of genomic DNA from each of the three sequenced strains used in construction of the array). Technical details are in the methods of the supplemental material. The threshold for calling an array element "not detected" was determined individually for each array based on its distribution of log ratios, as previously described (7). Regions of difference (RDs) were defined as two or more adjacent array elements (separated by a maximum gap of one probe) not detected in at least 3 (of 137) strains. Data have been deposited in ArrayExpress (http://www.ebi.ac.uk/arrayexpress/) with accession number E-TABM-54.
Subtractive hybridization.
Suppression subtractive hybridization was carried out using the PCR-Select bacterial genome subtraction kit (Clontech, Mt. View, CA), with modifications as previously described (7). The tester DNA was a pool of equal amounts of genomic DNA from five B. pertussis strains chosen to represent different geographical locations and time periods (Bpe37, Bpe43, Bpe183, WCH22, and Bp12464). A control tester pool was prepared by adding an equimolar amount of HaeIII-digested
X174 phage DNA after completing the initial RsaI digestion step of the protocol. The driver DNA was a pool of equimolar amounts of genomic DNA from the sequenced strains B. pertussis Tohama I, B. parapertussis 12822, and B. bronchiseptica RB50, as well as 40 fragments isolated from an ovine isolate of B. parapertussis that were not found in the sequenced Bordetella strains (data not shown). From the subtraction using the control tester pool, 52 randomly chosen fragments were sequenced. From the subtraction using the experimental tester pool, 480 fragments were recovered, cloned, reamplified, and printed on microarrays. The microarrays were hybridized with each of the five strains in the tester pool versus the driver pool, and the 10 spots from each array with the highest ratios of tester versus driver were chosen for sequencing.
Microarray-based chromosomal mapping. Genomic DNA was prepared in agarose plugs, digested with SwaI, and resolved by pulsed-field gel electrophoresis (PFGE) essentially as described elsewhere (25; see also the Methods section in the supplemental material). Each band was excised and treated with ß-agarase (New England Biolabs, Ipswich, MA) according to the manufacturer's instructions. DNA from each band was isopropanol precipitated in the presence of 30 µM spermine, 70 µM spermidine, and 100 mM NaCl and then labeled and hybridized to microarrays as described in the methods section of the supplemental material.
Analysis of transcript abundance profiles. Mid-log-phase cultures were diluted in fresh Stainer-Scholte medium to a calculated absorbance of 0.05 at 600 nm. Half of each culture was removed to another flask, and MgSO4 was added to a final concentration of 75 mM to modulate the strains into the Bvg phase. When cultures reached mid-log phase (optical density at 600 nm, 0.8 to 2.6), 1 ml of culture was centrifuged at 16,000 x g for 30 seconds, the supernatant was removed, and the cell pellet was frozen at 80°C. Total RNA was isolated using the RNAqueous-4PCR kit (Ambion, Austin, TX) with an additional lysis step using lysozyme (0.4 mg/ml). RNA labeling and hybridization are described in the methods section of the supplemental material. Significance analysis of microarrays (SAM) (35) was performed using all spots with data in at least 67% of strains (triplicate cultures were averaged together).
| RESULTS |
|---|
|
|
|---|
|
Pseudogenes were overrepresented in those RDs that consisted of Tohama I genes (Table 1). In addition, two RDs (RD1 and RD19) appeared to be decaying prophages. Genes in the functional category "adaptation," which includes those involved in iron uptake and copper resistance, also were overrepresented in the RDs, suggesting that these genes may be unused or possibly even detrimental to these bacteria in some situations (see Table 1 for explanation of functional categorization of genes). The categories "metabolism of small molecules" and "macromolecule metabolism and ribosomal genes" were significantly underrepresented in the RDs, indicating that these genes are under selection to be retained, as expected given their essential functions. Interestingly, although the chemotaxis and motility loci in Tohama I contain several pseudogenes and IS481 elements and B. pertussis motility has never been observed (3), no genes in this functional category appeared in any of the RDs, although the number of genes was too small for this underrepresentation to reach statistical significance. Notably, no known virulence factors were found among the RDs.
|
The opposite trend was observed for RD13 (BP2627-2629), RD16 (BP3104-3109), RD18 (BP3314-3322), and RD23 (BB0917- 0921), all of which were detected predominantly in strains collected in the postvaccine era. In each case, some prevaccine strains did contain the genes in these RDs, suggesting that the deleted strains were replaced by other strains circulating at the time that contained these genes.
The only strain with unusually high divergent gene content was the ATCC type strain 18323, which also has been differentiated from most B. pertussis strains by other typing methods (5, 23, 26, 40). This strain had the largest proportion of genes called "not detected" (5.3%), and the locations of not-detected regions frequently did not align with such regions in any other strain. In addition, strain 18323 contains genes found in other Bordetella species but not present in any other B. pertussis strains (5, 23, 26, 40). The fact that this strain is missing genes that are present in every other B. pertussis strain surveyed indicates that such genes are not essential for survival of the bacterium but may be important for achieving high levels of circulation within human populations (only one other B. pertussis strain similar to 18323 has been reported [5, 40]).
Investigation of B. pertussis gene acquisition by subtractive hybridization. A limitation of microarray-based CGH is that it can detect only genes that are present in the strain(s) used to construct the microarray. Therefore, to determine if other strains contain additional DNA, we performed subtractive hybridization. Genomic DNA from five B. pertussis strains from different countries and time periods was pooled and subjected to suppression subtractive hybridization, using a driver pool of DNA from the sequenced strains of B. pertussis, B. parapertussis, and B. bronchiseptica, supplemented with sequences from an ovine isolate of B. parapertussis. Four hundred eighty recovered fragments were screened by microarray hybridization analysis to find those most likely to be unique to the tester pool and 36 of these were sequenced, but BLASTN analysis indicated that all of them shared very high (or complete) nucleotide identity with sequences found in the driver pool. Thus, no novel sequences were recovered from the tester pool.
To gain an estimate of the sensitivity of this technique, the tester DNA pool was spiked with an equimolar quantity of HaeIII-digested
X174 phage DNA. Seventy-two percent of the
X174 spiked DNA (3,858/5,386 bp; 6 of 11 fragments) was recovered after sequencing 52 randomly chosen fragments from this subtraction. Because the experimental subtractive hybridizations included an additional screening step, we expect that the technique should be sensitive enough to detect most unique B. pertussis genes, and almost certainly any large insertions, that are present in the tester sequences.
Detection of genome rearrangements by microarray-based chromosomal mapping. Recombination between the many IS elements in B. pertussis may result in chromosomal deletions and inversions, depending on the orientation and location of the repeated sequences. Such chromosomal rearrangements are common in B. pertussis (32, 33). Many of the deletions observed by CGH appear to be the product of recombination between two IS elements that were in the same orientation on the chromosome. We employed a novel technique to investigate the extent of genome rearrangement and the role of IS elements in mediating it. Genomic DNA was digested into large fragments, and the gene content of each fragment was determined using microarray hybridization. SwaI digestion of Tohama I, the sequenced strain of B. pertussis, produced five bands of the predicted sizes, as assessed by PFGE. When DNA from each of these bands was hybridized to microarrays, the array elements detected showed excellent correspondence with the genes expected to be on each fragment according to the genome sequence, confirming the accuracy of this technique (data not shown).
Digestion with SwaI of genomic DNA from independently isolated B. pertussis strains resulted in a remarkable diversity of PFGE banding patterns (Fig. 2, top panel). Determination of the genes present in each of the bands produced by SwaI digestion of three isolates, Bpe43, Bpe336, and Bpe337, revealed that they are distinguished from Tohama I by several genomic rearrangements (Fig. 2, bottom panel). Each of the bands from these strains contained one to four blocks of genes which were contiguous in the Tohama I genome, indicating that multiple chromosomal inversions had occurred during the divergence of these strains to produce different arrangements of the same genetic material. At each apparent recombination junction, an IS481 element was present in the sequence of Tohama I, providing strong evidence that most of the chromosomal rearrangements are mediated by IS elements.
|
Analysis of differences in transcript abundance and Bvg regulation between pre- and postvaccine strains and laboratory-passaged strains. CGH and subtractive hybridization indicated that gene content was very similar among all isolates, but most intergenic regions are not represented on the array and neither method is able to detect small genetic alterations (e.g., point mutations, frameshifts, and short insertions or deletions) which could produce significant changes in expression levels. Indeed, several studies have found variation in gene expression even among strains with similar gene content (11, 12, 16, 24, 39). In addition, IS-mediated chromosomal rearrangements could cause genes to be moved to positions near different transcriptional regulatory motifs, resulting in different expression patterns.
To look for correlations between expression profiles and epidemiology of B. pertussis, we chose four Dutch strains from the prevaccine era and four from the postvaccine era, since these collections showed the most variation in gene content (although still quite limited). Only nine array elements were found to differentiate between the two groups of strains (determined by SAM; false discovery rate of 0% [data not shown]). These genes did not group together into apparent operons or categories of functional significance, and there were many more strain-specific differences than differences between the prevaccine and postvaccine groups. Therefore, we next chose to focus on a smaller number of strains with known relationships to each other.
The ancestral and passaged descendant (224 passages on plates) of Tohama I, a laboratory strain, showed very few differences in transcript abundance under Bvg+ (virulent) and Bvg (avirulent) conditions (four genes with a 20% false discovery rate and 10 genes with a 9% false discovery rate, respectively). In contrast, there were 38 and 41 genes (false discovery rate < 2%) with different transcript abundances in the ancestral and passaged strains of the clinical isolate Bpe280 under Bvg+ and Bvg conditions, respectively, after only 12 passages (Fig. 3). Of particular interest, transcripts of two genes involved in capsule biosynthesis were detected at lower levels in the passaged strain under Bvg conditions, which may provide an explanation for the conflicting reports about whether B. pertussis produces a capsule (see the discussion in reference 28). Similarly, transcripts of several genes for chemotaxis and motility were present at lower levels in the passaged strain under Bvg+ conditions, although it is not known whether these proteins are expressed. The fim3 gene, encoding an adhesin, also had lower transcript levels in the passaged strain of Bpe280, and the same pattern was seen in Tohama I when a lower stringency was used for determining expression differences.
|
| DISCUSSION |
|---|
|
|
|---|
However, not all bacterial pathogens exhibit substantial variation in gene content. Some species, such as Pseudomonas aeruginosa, maintain large genomes which confer adaptability to diverse environments (41). Other species, such as Yersinia pestis and B. pertussis, have evolved from more-generalist ancestors primarily through extensive genome degradation (1, 28). While gene acquisition also appears to have played a role in the evolution of Y. pestis (42), analysis of Bordetella genome sequences (28) and data presented here indicate that little or no gene acquisition has occurred during the divergence of B. pertussis from B. bronchiseptica.
In the large and diverse collection of strains surveyed here, only 6.4% (248 of 3,849 array elements) of the B. pertussis genome was found to be variably present. No known virulence-associated genes were located in RDs, nor were RDs exclusively associated with collections of strains from either before or after vaccination. Although some sets of RDs are missing in almost all the same strains, no two RDs show complete linkage. Instead, the pattern of RDs across the strains has a striking mosaic quality, suggesting that the RDs are not solely vertically inherited. Several nonexclusive factors may account for this observation. Some regions may be hot spots for gene loss, either due to a local feature of the genome (e.g., accessible chromosomal structure or spacing of IS elements) or because the genes in these RDs are nonessential. In this scenario, we would expect that in the absence of new or frequency-dependent selection pressures, all strains eventually would lose all variably present genes. The fact that this state is not observed may indicate that the strains are in an intermediate stage of genome reduction.
Another possibility is that variability in RDs is maintained in the population due to a form of frequency-dependent selection (21), such that an uncommon type of strain is favored due to acquired host immunity against the more common circulating strains. This model assumes that each RD encodes an immunogenic protein, the deletion of which would confer a selective advantage if the acquired ability to evade immunity outweighed the fitness cost of losing gene products in the RD. If the population size was large enough to avoid stochastic extinction, the proportions of various types of strains in the population might fluctuate indefinitely. The mosaic structure of RDs also could be maintained due to independent events of gene loss and interstrain gene transfer, a process that might occur because of a selective advantage conferred by temporary possession of the RDs or as a by-product of the large numbers of IS elements in the genomes.
Homogeneity in gene content among B. pertussis strains belies an extraordinarily high amount of genome rearrangement in this species, as first demonstrated by low-resolution chromosomal mapping (32, 33). Homologous recombination between high-copy-number IS481 elements is a likely mechanism for chromosomal rearrangement (28). Using a technique combining PFGE and microarray hybridization, we discovered multiple sites of genome rearrangement in five strains. All but one of the apparent recombination junctions in these strains were adjacent to IS481 elements. However, the rate of IS-mediated rearrangements during natural infection and transfer or lab passage is unknown. We found no changes in gene content of SwaI fragments in the laboratory strain Tohama I after it was passaged over 200 times on plates or in a recent clinical strain after 12 passages. Most previous investigations have not detected differences in PFGE types (predominantly using XbaI restriction patterns) after a small number of lab passages (2, 15), although one report indicated changes in PFGE types after approximately 10 passages (4). Comparison of PFGE typing of one strain used for production of a Swedish whole-cell vaccine revealed different patterns between vials prepared over an 8-year span, and occasionally within the same vial, but no changes were observed when several isolates were serially subcultured eight times (2). Laboratory conditions may not stimulate or select for these rearrangements to the same degree as natural infection conditions, and so we have initiated investigations into whether rearrangement occurs during chains of transmission of B. pertussis from one host to another.
Although the significance of nonprogrammed chromosomal rearrangement for bacterial evolution has not been heavily studied, some observations suggest that spontaneous chromosomal rearrangement may play a role in the evolution of other bacteria that bear high loads of insertion elements. For example, 9 out of 10 sucrose-resistant Burkholderia mallei mutants had deletions of sacB that were apparently mediated by recombination between flanking IS copies (27). In addition, several authors have proposed that the large number of IS elements in Y. pestis may contribute to rapid adaptive microevolution (34, 42).
Even in bacteria without large numbers of repeated sequences, chromosomal rearrangement has been observed in strains or species that are found only in specific niches, including host-specialized Salmonella species (13) and P. aeruginosa isolates from chronic lung infections in cystic fibrosis patients (17). Furthermore, impaired host colonization of a recombination-deficient Helicobacter pylori mutant strain suggests a functional role for chromosomal rearrangement in H. pylori pathogenesis (29). Like mutator phenotypes, which are observed in some pathogenic species (14, 19, 20), high-frequency chromosomal rearrangement may provide transient variation for rapid adaptation to a new host environment in host-restricted or niche-adapted strains such as B. pertussis.
The high IS load of B. pertussis may reflect a recent proliferation of the repeat elements during an expansion into a new niche that imposed less selective pressure against the concomitant disruption of genes (28), and the perceived "maintenance" of IS elements may be completely nonadaptive. In fact, IS-mediated gene disruption in B. pertussis could be deleterious most of the time, but perhaps the concomitant generation of diversity provided by genome rearrangement is important for the continuing survival of B. pertussis in its ever-changing human niche.
Effects of genome rearrangement might be manifested at the level of transcription, because genes near the rearrangement breakpoints could be put under the control of new transcriptional regulatory elements. Comparison of expression profiles from ancestral and passaged strains of Tohama I and Bpe280 (a recent clinical isolate) revealed a number of differences between the ancestral and passaged isolates, some of which may be due to undetected genome rearrangements or other mutations. For a bacterial species with constrained gene content, such as B. pertussis, alteration of expression patterns is an efficient way to respond to selective pressures. Several reports support the proposition that closely related strains, including those derived by lab passage or growth in a single host, can show significant differences in expression profiles (6, 11, 12, 16, 24, 39). The observation that expression profiles change over relatively few passages has significant implications for the study of B. pertussis, because the lack of a natural animal host for this species requires many investigations of gene regulation and function be conducted in vitro, often with laboratory strains that have been passaged many times. In a broader evolutionary sense, the finding that B. pertussis displays variation in gene expression provides counterbalance to the high level of gene content homogeneity observed in this species and suggests that this pathogen can respond to the dynamic pressures of host immunity by altering regulation of a restricted set of genes, without the benefit of the genetic input that appears to play a significant role in the evolution of other pathogens.
| ACKNOWLEDGMENTS |
|---|
This study would not have been possible without the skill and generosity of many people who provided strains from their B. pertussis collections, especially Frits Mooi (RIVM, Bilthoven, The Netherlands). We thank Sin-Yee Liew and Mari Nakamura for assistance with microarray preparation. We are grateful to Alok Saldanha and Pat Brown for helpful discussions about mapping gene order.
| FOOTNOTES |
|---|
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |