Previous Article | Next Article ![]()
Journal of Bacteriology, February 2008, p. 1473-1483, Vol. 190, No. 4
0021-9193/08/$08.00+0 doi:10.1128/JB.01207-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Sara A. Sandstedt,2,
Paul J. Bonthuis,1
Jennifer L. Geelhood,1
Kevin L. Nelson,1
William C. T. Unrath,1
Mathew A. Diggle,3
Mary J. Theodore,4
Cynthia R. Pleatman,4
Elizabeth A. Mothershed,4
Claudio T. Sacchi,4,
Leonard W. Mayer,4
Janet R. Gilsdorf,2,5 and
Arnold L. Smith1,6
Microbial Pathogens Program, Seattle Biomedical Research Institute, Seattle, Washington,1 Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, Michigan,2 Stobhill Hospital, Scottish Meningococcus and Pneumococcus Reference Laboratory, Glasgow, United Kingdom,3 Meningitis and Special Pathogens Branch, Division of Bacterial and Mycotic Diseases, National Center for Infectious Diseases, CDC, Atlanta, Georgia,4 Department of Pediatrics and Communicable Diseases, University of Michigan Medical School, Ann Arbor, Michigan,5 Department of Pathobiology, University of Washington School of Public Health, Seattle, Washington6
Received 27 July 2007/ Accepted 26 November 2007
|
|
|---|
|
|
|---|
Efforts to understand and control NTHI disease have been hampered by the diversity of these bacteria. Many of the surface antigens that have been studied display interstrain and intrastrain heterogeneity as a result of both sequence divergence and phase variation. It is increasingly recognized that NTHI isolates also vary in genetic content. We use the term island to refer to a genetic locus (one or more genes) that occurs in some but not all strains. As used here, the term does not imply that the locus is known to be readily transferred between strains or is thought to have been recently acquired. Islands whose function is known include the hmw, hia, and hif adhesin loci; the region flanked by infA and ksgA containing lipopolysaccharide (LPS) biosynthetic loci lic2BC or losAB; and the three loci required for the biochemical reactions used in biotyping (18, 27, 30, 34, 52). Several strains have recently been sequenced, and each contains numerous islands that are not present in the other sequenced strains (25, 47). Subtractive hybridization studies have identified other islands (55). Certain islands appear to be more common in disease isolates than in isolates from healthy subjects (17, 34, 44), suggesting that the genetic content of NTHI strains affects their likelihood of causing symptomatic infection.
Until recently, distinguishing H. influenzae strains from each other or evaluating their relatedness has been difficult. Epidemiological studies have commonly analyzed strains by gel electrophoresis of bacterial proteins, restriction fragments, or randomly amplified sequences (1, 39, 43). These methods yield patterns that vary from lab to lab. Within the last few years, two systems for typing H. influenzae have been developed, based on sequence polymorphisms of either housekeeping genes (multilocus sequence typing [MLST]) (35) or genes encoding 16S rRNA (16S typing) (46). Each of these two systems has the advantage of yielding unambiguous data, so that a database can include strains typed in several laboratories. The MLST system has so far been more widely adapted.
The primary goal of this project was to analyze the phylogenetic structure of H. influenzae, with particular focus on NTHI. Second, we sought to evaluate the extent to which phylogenetic groups of NTHI strains differ in content of genetic loci that may affect pathogenesis. A collection of clinical isolates for which we had previously used biotyping and PCR to detect several genetic islands (11) were typed by MLST in the present study, and we extended the biotyping and PCR studies to NTHI strains that were already in the MLST database.
We performed both profile-based and sequence-based analyses of the MLST database. Profile-based analyses tally the shared and differing alleles for each pair of sequence types (ST) without consideration of the actual sequences of the differing alleles. Acquisition of a new allele by genetic exchange (potentially altering several bases) has the same effect on the outcome of profile-based analyses as a single-point mutation. In contrast, for sequence-based analyses, alteration of an allele at several bases affects the outcome of the analysis to a greater extent than alteration at a single base. Because H. influenzae strains are known to undergo genetic exchange, it was not certain whether a sequence-based analysis would identify distinct phylogenetic groups, particularly within the nontypeable strains. We performed a rigorous analysis of the MLST sequences using a maximum-parsimony method and identified 13 distinct clades within the H. influenzae population. All except two of the clades corresponded closely to groups identified using eBURST, a profile-based analysis (16). Ninety-eight percent of encapsulated strains and 65 percent of the NTHI strains in the MLST database could be assigned to one of the 13 clades, while 21 percent of NTHI strains were members of a single eBURST group that did not correspond to any clade. We found that each of these phylogenetic groups had a different distribution of the genetic islands that we examined. These observations suggest that horizontal exchange has not completely blurred evidence of ancestral relationships among H. influenzae strains.
|
|
|---|
The program TNT (P. A. Goloboff, J. S. Farris, and K. Nixon, TNT: tree analysis using new technology, program and documentation, 2003 [available at http://www.zmuc.dk/public/phylogeny/tnt]) was utilized to conduct maximum-parsimony analyses of the concatenated sequences, using both constrained and random sectorial searches with search parameters optimized every 10 hits. The new technology tree drifting and tree fusing algorithms were used (21). The program was run until 4,545 equally most parsimonious trees were collected and the strict consensus tree stabilized. The majority-rule consensus of the resulting equally most parsimonious trees was calculated in TNT. The resulting majority-rule tree, plotted using ST65 as the outgroup, is shown in Fig. 1. Table S1 in the supplemental material lists the position of each ST on the tree. Frequency jackknife support values were also calculated with TNT (22), using 1,000 replications and a probability of removal for each character of 36%.
![]() View larger version (33K): [in a new window] |
FIG. 1. Maximum-parsimony majority-rule consensus tree for 359 ST, plotted with ST65 as the outgroup. Colored regions of the tree numbered 1 through 13 indicate branches (clades) that were found in 100% of trees, with the exception of clades 9 and 11, which were found in 92% and 81% of trees, respectively. The tree is aligned with a plot showing the serotype and the eBURST group for each ST. Each ST corresponded either to a single serotype or to one or more NTHI strains. H. influenzae biogroup aegyptius (H. ae.) strains are plotted separately from other NTHI strains. The ST for 15 sequenced strains are indicated on the tree. Table S1 in the supplemental material lists the position of each ST on the tree as well as its eBURST group and the serotype of associated strains.
|
(ii) 16S rRNA typing. The 16S rRNA genes were amplified using external primers, and the 1,595-bp products were sequenced as described previously (46). Maximum-parsimony analysis was performed on the complete set of 16S sequences, as described above for MLST sequences, yielding 2,230 equally most parsimonious trees.
Bacterial strains. The MLST database includes 322 NTHI isolates, of which 296 were included in our phylogenetic analyses. (When more than one strain with the same ST was obtained from the same subject, we excluded all except one isolate.) Of the 296 independently isolated strains, 206 were available for laboratory studies. These included 52 pediatric isolates of diverse sources used in an earlier study (11), 25 of the 27 otitis isolates from Finland that were used to develop the H. influenzae MLST scheme and have been used in several other studies (2, 7, 28, 35), 88 isolates submitted to the Active Bacterial Core Surveillance program at the Centers for Disease Control and Prevention that had been used to develop the 16S typing system (46), and several isolates for which partial genome sequences have recently been reported. All strains were confirmed to be H. influenzae based on the requirement for β-NAD+ and hemin and were judged not to be H. haemolyticus based on recently described criteria (37). Serotyping was performed by PCR as described previously (13). bexA primers yielded a PCR product from control strains of each serotype and failed to amplify a product from any of the NTHI strains.
Genetic and phenotypic heterogeneity. (i) Biotyping. Biotypes were assigned according to Kilian's system (30), using API 20E strips to detect biochemical reactions as previously described (11).
(ii) PCR of genetic islands. The hic, hif, hia, hmw, lic2BC, and losAB loci were detected by PCR as described previously (11). For these loci, PCR was initially carried out using flanking primers, and positive results were confirmed by partial sequencing of the PCR product or by amplification with combinations of internal and external primers, as described previously (11). To confirm the presence of the hia and hmw loci, PCR was performed using flanking primers (Table 1) as well as the internal primers described previously (11). For hmw, a multiplex PCR with primers flanking both the hmw1 and hmw2 loci was performed. The two products were not clearly resolved by gel electrophoresis, so the lack of one hmw locus would not have been detected. Both hmw and hia are known to differ in sequence from strain to strain (4, 33). We did not attempt to assess the sequence heterogeneity of these loci; rather the internal primers that we used were selected to hybridize with conserved regions of the hia and hmwA sequences.
|
View this table: [in a new window] |
TABLE 1. Primers used for PCR detection of genetic islands
|
Statistics. Frequency data were evaluated by Fisher's exact probability test, calculated online at the VassarStats website for statistical computation (http://faculty.vassar.edu/lowry/VassarStats.html). The resulting P values were subjected to the Bonferroni adjustment for multiple comparisons on a set of data (42).
Nucleotide sequence accession numbers. The sequences of the novel restriction-modification loci have been submitted to GenBank and assigned accession numbers EU285588 (R2846.1179-80, putative type II restriction system), EU296480 (R2866.503-507, probable type I restriction system), and EU296479 (R2866.1857-58, probable type III restriction system).
|
|
|---|
Jackknife resampling using frequency differences (22) indicated low support (<50%) for most of the clades identified in the majority-rule consensus tree. Notable exceptions include clades 1, 5, and 8, with jackknife values of 66%, 72%, and 87%, respectively. While resampling support was low for most of the tree, many of the clades identified using the MLST housekeeping genes also correlate with other characteristics, including serotype, biochemical traits, and other genetic traits (see below), indicating that the clades have biological relevance. In addition, the bootstrap and jackknife resampling methods are considered conservative, erring toward low confidence values (15, 48).
We also utilized eBURST, an alternative method of analyzing MLST data, to define groups in which each ST is identical at five or more of the seven sequenced loci to at least one other ST in the group. Ten such groups contained five or more ST. Each eBURST group mapped to a single region of the majority-rule tree, in most cases corresponding closely to a clade or to a serotype-specific cluster within a clade (Fig. 1; see Table S1 in the supplemental material). In contrast, the 40 ST in eBURST group 2 are not in any of the clades identified by the parsimony analysis.
These data provide the first evidence that the NTHI population contains several distinct genetic groups: of 296 independently isolated NTHI strains in the MLST database, 80% have ST in one of 10 clades (192 strains) or in eBURST group 2 (67 strains).
Clustering of encapsulated strains within serotype-specific clusters. The clustering of encapsulated strains on the tree shown in Fig. 1 is consistent with that observed in previous studies. A large multilocus enzyme electrophoretic study by Musser et al. (40) and a subsequent smaller MLST study by Meats et al. (35) each identified two major clusters of type b strains, with much smaller numbers of type b strains elsewhere on the phylogenetic tree. Each study also found type a strains to be clustered in three sites on the tree and a single cluster each for types c, d, e, and f. Since publication of the original H. influenzae MLST database, 194 type b strains have been added, with 96 new ST. We found that the vast majority of these new ST fall into the same two type b-specific clusters described previously but that 12 type b strains are outside of the clusters. The presence of type b capsular genes was confirmed by PCR for the two of those strains that were available to us. Twenty-nine strains of types a, c, d, e, and f have been added to the MLST database, and with the exception of two type d strains (confirmed by PCR), all fall into the previously described clusters. It was proposed previously that strains of types e and f and certain type a strains have a common ancestor (35). This was confirmed in the present study, in which these strains were all in clade 2. Nine NTHI strains have ST outside of the main NTHI clusters. The lack of capsular biosynthetic genes in each of these strains was confirmed by PCR. These strains may have lost capsule biosynthesis genes relatively recently, as seven of these NTHI strains are within the large type b group (clade 3), and the other two are most closely related to type e strains.
Overall, the addition of 525 strains to the MLST database since its original publication has not significantly altered our understanding of the genetic relationships among encapsulated H. influenzae strains. In contrast, the NTHI-specific regions of the MLST tree are now sufficiently developed that distinct groups can be defined.
Genetic and phenotypic comparison of NTHI clades. We sought evidence for common features among NTHI strains within a phylogenetic group (i.e., within one of the 13 clades or in eBURST group 2). NTHI strains are known to be heterogeneous in the presence and structure of adhesins and other outer membrane proteins, in LPS structure, and in metabolism. Of 296 independently isolated NTHI strains in the MLST database, 206 were available to us for laboratory study. This collection, which includes 52 strains studied previously (11), was evaluated by biotyping and by performing PCR to detect the hmw adhesin loci and the LPS biosynthetic loci flanked by infA and ksgA. PCR results were confirmed using a second set of primers and in some cases by partial sequencing of the product. For each of these traits, we found significant differences in distribution across the phylogenetic groups (Fig. 2; Table 2). For example, 7 strains of clade 1 were all biotype III; of 12 clade 2 strains studied, seven were biotype I; clade 8 strains (n = 7) were all biotype II; and clade 11 strains (n = 15) were all biotype V. Of 49 strains in eBURST group 2, all except 5 were biotype II. Each of these phylogenetic groups was equally uniform with regard to the hmw loci (Fig. 2; Table 2) and nearly so for the content of the infA-ksgA locus (Table 2). These data suggest that there may be biological differences among clades, resulting in biochemical differences as well as in the presence of genes affecting the structure of bacterial surface components, including both outer membrane proteins and LPS.
![]() View larger version (37K): [in a new window] |
FIG. 2. Characterization of NTHI strains and correlation of biotype and hmw locus with clustering on the majority-rule consensus tree. The tree differs from that in Fig. 1 in that each ST found in more than one NTHI strain was expanded to allow plotting of data for each strain, while serotype-specific clusters were collapsed (indicated by triangles). The tree is aligned with a plot showing clade (as in Fig. 1), biotype, the presence of the hmw loci (determined by PCR), and the type of specimen from which the strain was cultured. Commensal, strains cultured from oropharyngeal or nasopharyngeal swabs from healthy children; otitis, strains from middle ear aspirates from children with otitis media; eye, strains from children with conjunctivitis; mucosal, strains from other mucosal samples, primarily sputum or endotracheal aspirates; invasive, strains from blood or cerebrospinal fluid. The biotype and PCR data, including the results of PCR of the LPS biosynthetic locus flanked by infA and ksgA, are summarized in Table 1.
|
|
View this table: [in a new window] |
TABLE 2. Phylogenetic groups differ in distribution of genetic islands
|
|
View this table: [in a new window] |
TABLE 3. Further comparison of phylogenetic groups
|
In contrast, there was no apparent correlation between phylogeny and the clinical or geographic source of strains, with the exception of H. influenzae biogroup aegyptius. H. influenzae biogroup aegyptius strains from the Brazilian purpuric fever clone are all ST65 (not in any of the 13 clades), while epidemiologically unrelated conjunctivitis strains are ST70 to -77, clustered in or near clade 4. Of the remaining NTHI strains in the MLST database, the clinical source is known for all except 17. Isolates cultured from middle ear aspirates (n = 52) or from systemic infections in children (n = 57) or adults (n = 100) were all broadly distributed across the phylogenetic tree (Fig. 2). Consistent with a previous report that commensal strains are less likely than otitis isolates to contain hmw (10), we noted that clade 2 isolates, which are uniformly hmw negative, were more frequently commensal than other isolates. Correspondingly, strains of eBURST group 2, 90% of which are hmw positive, are less likely to be commensal. These differences are not statistically significant. Moreover, as the database contains relatively small numbers of nasopharyngeal isolates from healthy children (n = 35) or from sputum or other nonconjunctival mucosal sites (n = 17), it would be premature to draw any firm conclusions about phylogenetic differences between commensal isolates and those from disease.
Comparison of maximum-parsimony analysis with rapid MLST analyses. As a practical matter, the maximum-parsimony analysis used to generate the majority-rule tree required hundreds of hours of computer time, so it is unlikely to be repeated routinely as new ST are added to the database. We sought to determine whether any of the readily available tools for analyzing MLST data would cluster strains in a way that approximates the results of the maximum-parsimony analysis. A comparison between eBURST and maximum parsimony is shown in Fig. 1. As noted above, while several eBURST groups correspond closely to one of the 13 clades, the two sets of groups are not equivalent. eBURST group 2 contains a set of 36 ST that are not in any of the 13 clades. Conversely, clades 4, 6, 9, and 13 do not contain eBURST groups larger than five ST. The NTHI-specific cluster within clade 2 is also not in an eBURST group. Thus, while analysis of the MLST database by eBURST will identify several clusters of closely related strains, it will not identify all such clusters.
The lineage assignment algorithms in the START2 suite of sequence typing software (29) allow rapid generation of four trees from MLST data: neighbor-joining trees and unweighted-pair group method using average linkages trees based on either concatenated sequences or profiles (i.e., the set of seven allele numbers for each ST). We used the START2 algorithms to analyze the complete MLST database. Of the four resulting trees, the sequenced-based neighbor-joining tree was the one in which clustering of ST was most similar to the tree generated by maximum-parsimony analysis (Fig. 3). The main differences between the two were that the neighbor-joining analysis divided clade 3 (the large type b cluster) into three regions of the tree and clade 13 (NTHI) into two clusters. This method should allow tentative assignment of a newly typed strain to a cluster of strains that are likely to have a similar set of genetic islands.
![]() View larger version (20K): [in a new window] |
FIG. 3. Comparison between different analyses of the complete MLST database. The dendrogram was generated from MLST sequences by the neighbor-joining method, using software in the START2 suite of MLST analytic tools. In alignment with each ST on the tree is plotted the serotype of associated strains, and the assignment of the ST to clade and eBURST group as previously described is indicated. Table S1 in the supplemental material lists the position of each ST on this tree and on three other trees generated using other programs in START2.
|
![]() View larger version (23K): [in a new window] |
FIG. 4. Comparison between 16S typing and MLST. The dendrogram is a majority-rule tree derived from maximum-parsimony analysis of 84 16S sequences. Each branch is labeled with a red numeral representing the percentage of trees contributing to the majority-rule consensus tree for that branch. Branches representing encapsulated strains are shaded in color and labeled by serotype. For each branch, the MLST phylogenetic group (clade or eBURST group 2) for the same strains is indicated by a colored arc. Strains with 16S ST 3, 4, 5, 31, and 80 (indicated by red circles) each belonged to more than one MLST phylogenetic group.
|
|
|
|---|
The finding that there are several distinct genetic groups within the NTHI population raises the question of how these groups are maintained. There is no obvious effect of geography, as most clades have members that were isolated in more than one European country as well as in several regions of North America. The finding that hmw-containing strains are clustered in certain clades suggests that clades may differ in their interaction with epithelial cells. It is known that many strains lacking hmw contain the gene for a different adhesin, Hia (10, 11, 51, 52). HMW adhesins bind sialylated glycoproteins, while the ligand(s) of Hia has not been identified. It is known that hmw-containing strains differ from hmw-negative strains in the specificity of binding to several epithelial cell lines (54). It is thus conceivable that strains in different clades occupy different niches within the nasopharynx. Finally, there may be restriction barriers between clades. Several restriction endonucleases in H. influenzae strains are known (45, 49), and the recently sequenced genomes revealed the existence of additional restriction-modification systems. For example, the 86-028NP genome contains three restriction systems not found in the Rd KW20 genome sequence, including the HaeII system (25). PCR analysis in the present study suggested that the HaeII locus and three novel restriction-modification loci are each limited to strains in a small number of phylogenetic clusters. Such heterogeneity in restriction systems may limit genetic exchange between randomly chosen strains.
We analyzed the MLST database using two methods that differ in their assumptions. TNT is a parsimony method that assumes descent with modification (50), uses full sequences (which makes it more sensitive to recombination), and quickly explores tree space to find the most parsimonious trees (41). eBURST additionally assumes evolution by clonal complex formation (16), uses allelic profiles (which makes it more sensitive to stochastic error [9]), and forms groups based on single-locus variants in alleles, with founders dependent on the number of single-locus variants. eBURST does not link ST that differ by more than two alleles, whereas TNT will construct their entire evolutionary history. Despite the differences between these methods in assumptions about the evolutionary process, in the data used, and in their sensitivity to recombination and stochastic errors, the results were surprisingly similar (Fig. 1). As expected, the eBURST analysis focused on the recently diverged tips of the tree. Most of the differences between the analyses were that TNT linked some strains that eBURST did not link. This most likely occurred where there were small differences between more than two alleles.
Comparison of the results of eBURST and TNT maximum-parsimony analyses provides some evidence for the relative importance of recombination and mutation in different parts of the H. influenzae population. Turner et al. recently described the outcome of eBURST analysis in three simulated populations differing in rates of mutation and recombination (53). For a clonal complex in which recombination is rare or is only moderately frequent, a plot of eBURST relationships yields a radial pattern (53). In the H. influenzae MLST database, the ST in eBURST group 1 form this type of pattern (see Fig. 7 in reference 53), suggesting descent from the founder, ST6. The fact that essentially the same strain cluster (clade 3) (Fig. 1) was identified by the sequence-based TNT analysis is consistent with relatively rare recombination among this group of type b strains.
A second type of eBURST pattern has no clear founder and is described as "straggly." This occurs when eBURST groups strains that differ at only a single MLST locus even though they are not descended from a recent common ancestor. This pattern occurs in populations with frequent recombination (53). eBURST group 2, which contains approximately 23% of the NTHI strains in the database, has a straggly pattern (not shown). The interpretation that these strains do not form a radial clonal complex is consistent with their not forming a single monophyletic branch in the TNT analysis. Interestingly, we found that strains in eBURST group 2 are as similar in genetic content as strains within the TNT-defined phylogenetic groups (clades). The strains in this group may be closely related but undergo frequent intragroup recombination. More research on the effects of sampling on eBURST patterns will help determine if straggly groups in poorly sampled databases are indeed the result of recombination. Alternatively, the eBURST group can be explained as a large monophyletic group beginning at the top strain in eBURST group 2 and ending at the bottom strain in parsimony clade 2. The ancestral strains are included in eBURST group 2, but as lineages diverged, they may have accumulated mutations in more than two genes so that they could no longer be grouped by eBURST.
The third eBURST pattern described by Turner et al. (53) occurs in highly divergent populations, where each eBURST group contains only a very small number of ST. In H. influenzae, clade 13 does not correspond to an eBURST group; in other words, it is rare for strains in this group to share more than four MLST alleles with any other strain. However, the ST in clade 13 were shown by TNT analysis to be closely related, as they were clustered together in 100% of equally parsimonious trees. One interpretation is that in clade 13, mutation is frequent relative to recombination, resulting in large numbers of differences at the allele level, caused by a relatively few differences at the sequence level. These conclusions are limited by the potential differences between the simulations (53) and the actual content of the MLST database. The MLST database contains strains sampled with no consistent plan. It is highly likely that many intermediates connecting the MLST strains have not been sampled, which may affect eBURST patterns.
Our data have implications for the study of H. influenzae pathogenesis. The role of a genetic locus in disease is assessed in part by evaluating its distribution among strains from different clinical sources (10, 32, 44, 55). Recognition that many loci characteristically occur in certain phylogenetic clusters adds a different dimension to the distribution data. For example, the observation that hmw is more frequent in otitis media isolates than in isolates from healthy children was previously interpreted as indicating that the HMW adhesins have a specific role in pathogenesis of otitis media (10). The recognition that hmw-positive strains are clustered in certain phylogenetic groups may lead to identification of other loci that are common in those groups and are also candidates for virulence factors in otitis media. Otitis media strains in phylogenetic groups that are predominantly hmw negative may have a different set of virulence factors. Discovery of genes affecting complex phenotypes such as serum resistance will be aided by comparing strains within phylogenetic groups rather than by seeking genes common to all strains with a given phenotype. In some cases, such as evaluation of a vaccine candidate, it is desirable to study strains that are known to be genetically diverse. The phylogenetic tree thus provides a framework for design and interpretation of studies of H. influenzae biology.
An obvious question is whether clades differ in the frequency of association with disease or in its clinical presentation. For the NTHI strains currently in the database, we did not observe any correlation between clinical source and phylogenetic clustering. However, a limitation of this study is its sampling bias. Although asymptomatic colonization with NTHI is far more common than symptomatic infection, nearly 90% of the NTHI strains in the MLST database were isolated from patients with symptomatic infections. Although most NTHI infections are limited to the respiratory tract, strains from these mucosal infections are underrepresented in the database. Sixty percent of the NTHI strains in the database were cultured from normally sterile sites and typed as part of bacteremia surveillance studies carried out by public health laboratories in the United States and Scotland. While there have been epidemiologic studies characterizing NTHI strains from subjects who were not experiencing acute infections (10, 14, 17), few of those strains have been typed by MLST. Addition of these and other clinically diverse strains to the MLST database may result in development of new branches of the tree. We do not propose that the phylogenetic groups we describe should constitute an NTHI typing system. The majority-rule tree shown in Fig. 1 is our current best estimate of ancestral relationships among H. influenzae strains, but it is unlikely that this tree is complete.
In summary, a rigorous phylogenetic analysis of the MLST database showed for the first time that most NTHI strains are members of phylogenetic groups that are as distinct from each other as the six capsular serotypes. The clades that we identified are unexpectedly uniform in possession of genetic islands affecting metabolism, outer membrane protein composition, and LPS structure.
This work would not have been possible without the Haemophilus influenzae MLST website, hosted at Imperial College, London, and funded by the Wellcome Trust.
Published ahead of print on 7 December 2007. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
These authors contributed equally to this work. ![]()
Current address: Instituto Adolfo Lutz, Seção de Bacteriologia/Imunologia, São Paulo, Brasil. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»