Previous Article | Next Article ![]()
Journal of Bacteriology, March 2005, p. 2138-2147, Vol. 187, No. 6
0021-9193/05/$08.00+0 doi:10.1128/JB.187.6.2138-2147.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Genome Center,2 Program in Molecular and Cellular Biology,1 Department of Medicine,4 Department of Genome Sciences, University of Washington, Seattle, Washington3
Received 4 October 2004/ Accepted 9 December 2004
|
|
|---|
|
|
|---|
Iron is necessary for the growth and survival of almost all organisms, with the exception of Lactobacillus species and some intracellular parasites (3). Iron is incorporated into numerous protein moieties such as iron-sulfur clusters and participates in many cellular redox reactions. Although abundant on the surface of the Earth, iron is not readily accessible because it rapidly oxidizes to insoluble Fe3+ compounds under aerobic conditions. Bacteria obtain iron by a variety of mechanisms, including secretion and subsequent uptake of ferrisiderophores, direct uptake of ferrous iron, uptake of host iron-binding molecules, and proteolytic destruction of host proteins with subsequent iron scavenging (54). Mechanisms for iron acquisition via siderophores are diverse; at least 500 chemically distinct siderophores are known (69). Although almost all organisms require iron, excess iron is highly toxic in aerobic environments, particularly because of the production of hydroxyl radicals in the Fenton reaction.
Pyoverdines are siderophores made by members of the genus Pseudomonas (10, 37). Pyoverdine is the primary siderophore of P. aeruginosa, and pyoverdine mutants have attenuated virulence in multiple infection models (39, 66). Each strain of P. aeruginosa makes one of three pyoverdine types, each type having a distinct peptide side chain that is synthesized nonribosomally (40). The pyoverdine receptor is also type specific, transporting only its corresponding pyoverdine (14).
In a whole-genome-diversity study in which we compared two cystic fibrosis strains and one environmental strain of P. aeruginosa with the PAO1 reference strain, we found the pyoverdine region to be the most divergent alignable locus in the genome (64). Given the divergence in pyoverdine genes and the variety of different pyoverdine molecular structures, the pyoverdine locus appeared to be a possible target of diversifying selection. To determine the degree and patterns of diversity in pyoverdine genes, we sought to clone, sequence, and analyze the pyoverdine region from multiple strains.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Strains used in this study
|
Fosmid library creation and screening. P. aeruginosa genomic DNA was prepared by phenol-chloroform extraction and then further purified by ethanol precipitation. Genomic DNA was partially digested with restriction enzyme Sau3AI and size-separated by pulsed-field gel electrophoresis (Bio-Rad CHEF_DRII apparatus, 1% agarose gel, 14°C, 16 h, pulse ramp 1 to 6 s). DNA segments sized 35 to 52 kb were cut from the gel and recovered by electroelution in dialysis bags (SPETRA/POR molecular porous membrane tubing). Fosmid vector pFos1 was prepared as described (30) and genomic DNA fragments were ligated into the prepared vector with T4 DNA ligase (Roche, Basel, Switzerland).
The ligated vector was then packaged with Giga Pack III XL packaging extract (Stratagene), and E. coli cells were infected. Fosmid-containing colonies were picked into 384-well plates and also pooled into 96-well plates at eight colonies per pool. Fosmid-containing colonies were picked to 10-fold coverage of the genome, assuming average insert sizes of 40 kb.
Fosmid pools were screened with PCR at multiple sites upstream and downstream of the pyoverdine region. Primers were designed to amplify approximately 500-bp fragments and amplicons were spaced 1 to 3 kb apart. Once PCR-positive pools were identified, individual fosmid-containing cells were screened with PCR. Positive individual cells were regrown in 3 ml of L broth at 37°C for 18 h, lysed with 0.2 N NaOH with 1% sodium dodecyl sulfate treatment, and DNA was precipitated with isopropanol. Purified fosmids were retested with PCR and by digestion with multiple restriction enzymes.
Yeast capture cloning. Yeast capture cloning vectors were assembled via yeast recombination (45). PCR with tailed primers carrying homology to both vector and target DNA was used to build target segments. Primers used to amplify upstream of the pyoverdine region included sequences from genes pvdQ and pvdA, while primers used to amplify downstream of the pyoverdine region included sequences from genes PA2405 and PA2406. Targeting plasmid pEHS6 was assembled by transforming into S. cerevisiae the two target segments, a linearized yeast-E. coli shuttle vector, and a central fragment carrying the yeast wild-type CYH2 gene with the Amp/ori stuffer fragment (57).
Genomic DNA was prepared as described (34). After passing genomic DNA 20 times through a 26.5-gauge needle, sheared genomic DNA and the linearized yeast capture cloning vector were transformed into yeast spheroplasts (57). Plasmid DNA was prepared from pooled yeast colonies and transformed into E. coli cells (26). Yeast capture clones were screened by whole-cell PCR with primers specific to vector-insert junctions.
PCR was used to test for the presence of deletions in strains R' and ATCC O13 by designing primers at the boundaries of each potential breakpoint. Primers located inside putative deletions were designed with sequence from type III strain 206-12.
Multiple complete digest gels and DNA sequencing of fosmid and yeast capture clones. Multiple complete digest gels were used to verify the 35- to 52-kb size of each isolated fosmid insert or the size of each yeast capture clone (71). Restriction enzymes EcoRI, BstBI, and NcoI were used. When prior sequence data were available, fragment sizes from newly acquired fosmid or yeast capture clones were compared with those predicted from the sequence.
Shotgun libraries of fosmid and yeast capture clones were made by shearing and ligating prepared DNA into a pBluescript vector. Greater than eightfold sequence coverage was obtained for each clone by standard shotgun sequencing methods. Automated sequence finishing was done with Autofinish software (22), manual finishing was performed if necessary, and sequence assemblies were checked against multiple complete digest gel patterns. Sequences for strains PAO1, PA14, 1-60, and 2-164 were obtained from GenBank.
DNA sequence analysis.
All pairwise sequence alignments were performed with cross_match software, a banded implementation of the Smith Waterman algorithm (http://www.phrap.org/). Gene predictions were performed with Genemark (35). Protein domain predictions were performed with PFAM (6) and NCBI's Conserved Domain Database (36). The EMBOSS program lindna was used to generate windowed single nucleotide polymorphism (SNP) plots in Fig. 1, and the EMBOSS programs cusp and cai were used to estimate the codon adaptation index of a gene (58). The codon adaptation index of genes was calculated in reference to a codon table of all genes in the PAO1 genome. Nonribosomal peptide synthetase specificity was predicted with software at the website http://raynam.chm.jhu.edu/
nrps/ (13). PCR primer design was performed with Primer3 software (60). Hydrophobicity plots were generated with the algorithm of Kyte and Doolittle (32) with a window size of 15 amino acids. The program PROF 1.0 was used to predict the secondary structure of proteins (46).
![]() View larger version (24K): [in a new window] |
FIG. 1. Gene structure and patterns of variation across nine pyoverdine region sequences. Blue and green arrows on the reference sequence line indicate the direction of transcription. In the top panel, the eight other sequences have been aligned to sequence from the reference strain, PAO1; bands of color indicate the local density of single-base-pair differences. In the bottom panel, central-region sequences (each approximately 33 kb) have been aligned with an arbitrarily chosen reference for each type since the central regions are not alignable between types. Protein domains are overlaid on the gene structure of the nonribosomal peptide synthetase genes and are colored as follows: activation domain-red, condensation domain-yellow, thiolation domain-orange, thioesterase domain-brown. Since there is ambiguity in the precise positions of the deletion breakpoints in R' and ATCC O13, grayed regions show the range of possible breakpoints. While the deletion shown in ATCC O13 is a cloning artifact and that in R' is a true deletion, both presumably arose by similar mechanisms. Letters A to K indicate regions divergent within pyoverdine sequence types (see Table 3). Two type III genes were predicted to be smaller than colinear genes in other types: pvdP was shorter by more than 250 codons and pvdM was shorter by approximately 50 codons. For simplicity, no effort was made to reflect this length variation in the figure. The scale (in base pairs) is shown in the bottom right of the figure and is consistent throughout the figure. Pyoverdine genes pvdL (PA2424), pvdG (PA2425), and pvdX (PA2428) are not shown in the figure; these genes are not divergent between type I and II strains (no type III sequence is currently available). pvdY has a 468-bp coding region in type I and III strains and a 912-bp coding region in type II strains.
|
E(W) = N(w1, w2, ...wm1) N(w2, w3, ...wm)/N(w2, w3, ...wm1)
To measure how observed tetranucleotide usage deviated from the expectation, we calculated the frequency of divergence of a word F(W) as the ratio of the observed count N(W) to the expected count E(W). Linear regression analysis was then used to compare F(W) from two sequences, such as the pyoverdine region and the PAO1 genome, and produce an R2 value that estimated the similarity in tetranucleotide usage between the sequences.
Gene names were kept consistent with PAO1 gene naming conventions when predicted protein domains and synteny for genes from different types indicated obvious homologies. The nonribosomal peptide synthetase genes were given gene names consistent with the PAO1 nonribosomal peptide synthetase genes with a number added indicating their pyoverdine type. Type-specific genes were named based on predicted protein domains.
To search for additional type-specific genes outside the divergent pyoverdine region, nucleotide alignments were made between the genomes of type I strains PAO1 and PA14, the unfinished genome of a type II strain (data not shown), and the 0.5X shotgun sequence reads of three type II strains (64). Type III sequence data were not available for comparison outside of the pyoverdine region.
Tests for positive selection.
We used a maximum-likelihood method (73) to estimate whether a gene had experienced positive selection, and if so, we identified the specific codons that were under positive selection. Maximum-likelihood models allow one to test a great number of combinations of parameter values and give a score to each combination according to how well the parameters fit the data; the combination with the best score fits the given data best. Our data were a multiple alignment of nucleotide sequences obtained for each gene in the pyoverdine region. Also input to the models was a phylogenetic tree that was constructed for each gene. We used models whose parameters included the nonsynonymous-to-synonymous ratio
, the transition-to-transversion ratio
, and equilibrium codon frequencies
in each codon.
is a measure of how frequently single-base-pair differences change the amino acid coded for in a codon (as opposed to silent, or synonymous changes):
< 1 suggests purifying selection,
= 1 suggests neutral evolution, and
> 1 suggests positive selection. Values for
and
were estimated from the data, and codon frequencies were estimated from nucleotide base frequencies at each of the three codon positions. The genes with the highest scoring combination of parameter values that included a
greater than 1 were evaluated as being under positive selection.
In practice, maximum likelihood does not test all possible parameters. One must provide a starting point for each parameter, and the software will then incrementally change the parameter values to find a local high score. For each test, we used starting values of 0.1, 1.0, and 3.0 for
to ensure a broad sampling of likelihood space. When the likelihood scores from each of these starting values agreed, we were confident that we had obtained a true maximum and not a locally maximal score. Below are descriptions of two nested models that have detected positive selection in other systems and which we used to test pyoverdine genes for positive selection.
Model M0 assumed one class of
for all sites (aligned codons). Model M3 assumed three classes of
(
0,
1, and
2) each with a respective proportion of sites p (p0, p1, and p2). To compare the models, we calculated twice the log difference between the highest likelihood estimates of models M0 and M3. This difference was compared to a chi square distribution with four degrees of freedom to account for the difference in the number of parameters between the models. Evidence for positive selection in a gene required two conditions: model M3 predict a proportion of codons in a gene to have
> 1, and the likelihood score of model M3 to be significantly greater than the score of model M0.
To identify specific codons under positive selection, we compared two additional models. Model M7 assumed a beta distribution for values of
in a gene. Model M8 assumed a beta distribution with an added class of
for a proportion of sites p1. The beta distribution was used because it allowed for a range of
values in a gene, but only values of
between 0 and 1. If some codons in a gene were under positive selection, then model M8 would better fit the data because it allowed a fraction of codons to have
> 1. Two parameters, p and q, described the shape of the beta distribution. Values for p and q as well as
and p1 were estimated from the data. We compared twice the log likelihood difference between models M7 and M8 to a chi square distribution with two degrees of freedom. Evidence for positive selection in specific codons required two conditions: model M8 be significantly more likely than model M7, and model M8 predict a fraction of codons with
> 1. When positive selection was predicted, we could predict specific codons under positive selection by calculating posterior probabilities at each codon with Bayes' theorem. In other words, since we knew that a fraction of codons had a
value predicted by model M8, we could calculate the probability that each codon belonged to this class of
.
The program codeml from the PAML software package was used to test genes for positive selection (72). Amino acid multiple alignments were created with ClustalW (67). Alignments were then back-translated to nucleotide sequences and a neighbor-joining tree was generated with the program Neighbor with the phylogenetic software package PHYLIP (18).
To identify the locations of positively selected amino acids in FpvA, we aligned the amino acid sequence of FpvA from PAO1 to sequences from three E. coli proteins of known structure: ferric citrate transporter FecA (19), ferric enterobactin transporter FepA (8), and ferrichrome-iron transporter FhuA (20). Each of these proteins is in the same protein family as FpvA, and contains a TonB-dependent receptor domain (PF00593).
Nucleotide sequence accession number. Refer to Table 1 for sequence accession numbers.
|
|
|---|
The three pyoverdine sequence types correspond to the three known structural types of pyoverdine (see Fig. 1). The sequences of the same pyoverdine type have background levels of nucleotide divergence (approximately 0.5%) across the entire region analyzed, while the sequences of different pyoverdine types are highly diverged. The amount of divergence between types is variable in different genes and is highest in the genes located in the center of the region (see Fig. 2). The nucleotide sequences from the central part of the region, including pvdE, fpvA, pvdD, pvdI, and pvdJ, are not alignable between pyoverdine types. Figure 1 shows a multiple alignment and windowed SNP plot for all sequences, Table 1 lists the strains used in this study, and Table 2 lists the functions of individual genes in the region.
![]() View larger version (14K): [in a new window] |
FIG. 2. Mismatch in pairwise amino acid alignments between genes of each pyoverdine type. The highest-scoring alignment was used for each gene, and although some predicted gene boundaries differed, alignments spanned the smaller of any two aligned genes. Since the nonribosomal peptide synthetase genes pvdD, pvdJ, and pvdI were highly repetitive and mosaic in structure, the alignments given represent the best match to any nonribosomal peptide synthetase gene in the reference strain for a given type. Strains PAO1, 2-164, and R' were used as the reference strains for type I, II, and III pyoverdine, respectively.
|
|
View this table: [in a new window] |
TABLE 2. Functions of genes in the pyoverdine region
|
Approximately 30 kb downstream of the divergent pyoverdine region, pvdY is also divergent between types (Iain Lamont, personal communication). There are two highly diverged alleles of pvdY: type II strains have a 912-bp coding region, and type I and III strains have a 468-bp coding region. The two alleles are not alignable by pairwise nucleotide alignment and have 40% identity at the amino acid level. pvdY from type II strains contains an acetyltransferase protein domain (COG1670, RimL domain, e-value = 7e-06), and both alleles are similar to ribosomal acetyltransferases from other Pseudomonas species. Both alleles of pvdY have unusual codon usage: each allele has a codon adaptation index of less than 0.54, compared to the genome average of 0.68.
Pyoverdine genes adjacent to pvdY, including pvdL (PA2424), pvdG (PA2425), pvdS (PA2426), and pvdX (PA2428), have background levels of divergence between type I and II strains. pvdX, a gene of unknown function that is the only known pyoverdine gene to be expressed in iron-replete conditions, appears to be present in all types and is not divergent between types (68). It is notable that pvdL, a nonribosomal peptide synthetase that synthesizes the pyoverdine chromophore, was not divergent, while the nonribosomal peptide synthetase genes pvdI, pvdJ, and pvdD, which synthesize the pyoverdine side chain, were highly divergent.
We searched for other type-specific genes over an interval from 60 kb upstream to 60 kb downstream of the divergent pyoverdine region. Comparing sequences from strains of types I and II, we found only two candidate genes. Type II strains are missing most or all of gene PA2416. Type II strains also have a predicted gene with a 798-bp coding region between PA2423 and pvdL (PA2424); the gene did not contain a predicted protein domain but encoded a product similar to a hypothetical protein from Chromobacterium violaceum.
Outer membrane receptor gene fpvA and nonribosomal peptide synthetase genes pvdI, pvdJ, and pvdD are the most divergent genes in the pyoverdine region. Genes that function to synthesize and transport pyoverdine are the most divergent in the region. fpvA, the outer membrane pyoverdine receptor, is the most divergent gene by amino acid alignment (see Fig. 2). The next most divergent genes are immediately adjacent to fpvA and include the ABC transporter pvdE and the nonribosomal peptide synthetase genes pvdD, pvdJ, and pvdI. Other genes of moderate divergence between types include pvdP, pvdM, pvdN, pvdO, and PA2403. Type II strains also have moderate divergence in pvdA, fpvI, and fpvR and have a highly divergent downstream gene, pvdY.
Although genes in the central pyoverdine region are highly divergent by amino acid alignment (see Fig. 2), most colinear genes probably have related functions. Each type has genes colinear with fpvA, pvdE, and nonribosomal peptide synthetase genes, and the predicted secondary structures of the proteins encoded by both fpvA and pvdE are roughly conserved between types. These genes also share protein domains: fpvA genes have a TonB-dependent receptor domain (Pfam domain PF00593), pvdE genes have ABC transporter (PF00005) and ABC transporter transmembrane region (PF00664) domains, and nonribosomal peptide synthetase genes have activation (AMP-binding, PF00501), thiolation (phosphopantetheine attachment site, PF00550), condensation (PF00668), and thioesterase (PF00975) domains as well as putative epimerization domains. fpvA from each type and the nonribosomal peptide synthetase pvdD have been characterized in detail (1, 17, 50).
Pyoverdine type II and III sequences have type-specific genes. Type II sequences have a gene with a 966-bp coding region that contains an esterase/lipase protein domain (AES domain, COG0657, e-value = 1e-45) and is located between fpvA and the nonribosomal peptide synthetase pvdJ (2). The AES gene has 69% amino acid similarity and 27% nucleotide divergence from the gene PP4218 in the P. putida genome, which also has an esterase/lipase protein domain and is colinear with esterase/lipase protein domain within the P. putida pyoverdine region (43). There is no significant similarity between the esterase/lipase protein domain and genes in the finished Pseudomonas syringae genome DC3000 (11) or the unfinished Pseudomonas fluorescens genome Pf0-1 (GenBank accession number AAAT03000000).
Type III sequences have a gene with a 954-bp coding region that contains a siderophore-interacting protein domain (SIP domain, COG2375, e-value = 2e-52), and is located between pvdI (3) and PA2403. There is no significant similarity between the siderophore-interacting protein domain and genes in the available Pseudomonas putida, P. syringae, and P. fluorescens genomes.
Most intratype divergence is a result of recombination between types. There is approximately 0.5% nucleotide divergence between typical pairs of P. aeruginosa strains in most regions of the genome (28, 64). Although sequences of the same pyoverdine type primarily display background levels of divergence, there are 11 instances in the analyzed sequences of dramatically more pronounced divergence within a type (see Table 3). Most intratype differences are approximately 100 bp in length and are located within 400 bp of a region of intertype divergence. Since one allele is typically shared with another type, intratype divergence is the result of recombination between types. Recombination in P. aeruginosa is estimated to be very frequent (28, 47), so this evidence of recombination within the pyoverdine locus is not surprising. It is the swapping of divergent sequence between types that is notable. By shuffling sequences between types at the edges of divergent regions, recombination creates intratype divergence and may simultaneously erode the boundary regions of high intertype divergence.
|
View this table: [in a new window] |
TABLE 3. Regions of intratype divergencea
|
Strain 1-60 contains two examples of an entire gene that is normally type specific that has recombined with one of the other pyoverdine types. Strain 1-60 makes a type II pyoverdine and has predominantly type II sequence in the pyoverdine region. However, the PA2403 and pvdA alleles in 1-60 more closely resemble those in a type I strain. Strain 1-60 appears to be a hybrid, harboring a mixture of alleles typically found in type I or type II strains.
fpvA and PA2403 show evidence of positive selection. The extensive divergence in the pyoverdine region suggests an unusual evolutionary history. To discover whether positive selection contributed to the divergence, we tested each gene in the region for evidence of positive selection. These tests suggest that fpvA and PA2403, which accumulate nonsynonymous changes at an unusually high rate, are likely to have experienced positive selection (see Table 4).
|
View this table: [in a new window] |
TABLE 4. Evidence for positive selectiona
|
There is also evidence that positive selection has acted in PA2403. This gene, which is conserved in multiple Pseudomonas species, is of unknown function (55). PA2403 is not required for pyoverdine synthesis, but PA2403 mutants errantly produce pyoverdine in iron-replete conditions (44). With one exception, all of the positively selected amino acids reside outside the four transmembrane domains of the PA2403 protein (see Fig. 4). After protein folding, positively selected sites would reside on the same membrane face of the inner membrane, since two transmembrane domains separate the sites.
![]() View larger version (25K): [in a new window] |
FIG. 4. Hydrophobicity plot of PA2403 from PAO1 with codons predicted to be under positive selection superimposed. Four prominent hydrophobic regions (hydrophobicity > 2) likely represent transmembrane domains. Red asterisks represent individual codons predicted to be under positive selection, and asterisk size corresponds to posterior probability of positive selection. With one exception, all amino acids that are predicted to have been under positive selection fall between hydrophobic domains and likely reside on the same side of the membrane after protein folding. Hydrophobicity plots of PA2403 with sequence from pyoverdine type II and III genes give similar results.
|
Nonribosomal peptide synthetase-predicted amino acid specificities are consistent with pyoverdine structural peptide sequences. There is excellent correlation between the inferred amino acid sequence made by the nonribosomal peptide synthetase proteins and the primary structure of the pyoverdine that they produce. This correlation is possible because the nonribosomal peptide synthetase proteins are made of a series of cassettes, each of which catalyzes the addition of a particular amino acid to the pyoverdine. A cassette typically contains activation, thiolation, and condensation protein domains, which charge, bind, and connect, respectively, a specific amino acid to a growing peptide. The crystal structure of an activation domain allows homology-based association between cassettes and the amino acid they process during pyoverdine synthesis (13). For each pyoverdine type, the first activation domain in the nonribosomal peptide synthetase operon is specific for the first amino acid in the pyoverdine molecule, the residue closest to the pyoverdine chromophore. Continuing in the order of transcription through the nonribosomal peptide synthetase operon, activation domains are specific to the order of the amino acids in the pyoverdine, proceeding away from the chromophore (see Fig. 3 and 4). This order is consistent with the majority of other nonribosomal peptide synthetase enzymes.
![]() View larger version (23K): [in a new window] |
FIG. 3. Predicted amino acid specificity of nonribosomal peptide synthetase genes from the pyoverdine region. Protein domains are shown superimposed on genes, solid black rectangles representing activation domains and outlined rectangles representing other nonribosomal peptide synthetase domains. Amino acids in gray were not predicted due to lack of similarity to known sequences. Strain R' contained an in-frame deletion compared to type III strains that removed one amino acid from the pyoverdine side chain and fused two nonribosomal peptide synthetase genes. Only gene names for pyoverdine type I strains are listed. Underlined residues indicate D-isomers, but specific isomers were not predicted from structural data. fOHOrn, formyl-N5-hydroxyornithine; Dab, diaminobutryic acid; aThr, allo-threonine; c, cyclic.
|
Nonribosomal peptide synthetase enzymes are studied for their ability to generate many small molecules in a combinatorial fashion (2, 42), although attempts to manipulate the order and specificity of the nonribosomal peptide synthetase enzymes have encountered mixed success. The deletion in R' represents a natural example of nonribosomal peptide synthetase modification. This event also highlights the arbitrariness of the boundaries between nonribosomal peptide synthetase genes, since the in-frame fusion of two predicted nonribosomal peptide synthetase genes produced a functional product. It is unclear whether the novel peptide sequence in the R' pyoverdine confers any selective advantage; it is clear that deletion or duplication can rapidly change nonribosomal peptide synthetase genes because of their internally repetitive and highly modular structures.
|
|
|---|
It is interesting that alleles of pvdE or fpvA are more similar to genes from other soil bacteria than to other P. aeruginosa alleles. This situation is typical of a region under diversifying selection, where divergent alleles predating a speciation event can be inherited in both new species. However, since there is unusual codon usage and tetranucleotide usage in some pyoverdine types, horizontal transfer seems a more probable explanation for the trans-species polymorphism observed at this locus. Since the pyoverdine region does not show unusual G+C content, perhaps some pyoverdine genes originated in another organism with high G+C content. Other Pseudomonas species are primary candidates for the source of divergent alleles, but high similarity to fpvA was also seen in the high-G+C soil bacteria Agrobacterium tumefaciens and Azotobacter vinelandii.
The pyoverdine outer membrane receptor gene fpvA may drive diversity at the pyoverdine locus. To evolve a new pyoverdine type, both the nonribosomal peptide synthetase and the receptor must coevolve, maintaining their mutual specificity. Receptors are typically specific to a particular pyoverdine structure; however, multiple specificity has been reported, such as in the putative uptake of type II pyoverdine by the type III FpvA receptor (17, 38, 41). Our analysis identifies fpvA as the most divergent gene alignable between types in the region and under positive selection. fpvA is chromosomally located among the most divergent genes in the region and has the only intratype variation that is not a result of recombination between types.
Type II fpvA is an entry site for pyocin S3, a molecule made by some P. aeruginosa strains that is taken up by and lethal to some other strains (7). An alternative type I receptor, fpvB, was recently identified; the receptor is present in strains of each pyoverdine type and is regulated differently than fpvA, suggesting that there is selection to maintain alternative modes of regulation in the receptor (21). Viewed together, these aspects of fpvA function suggest a dynamic evolutionary history, where change in the receptor gene leads to further changes in the system of pyoverdine genes. Adjacent to fpvA, the nonribosomal peptide synthetase genes pvdI, pvdJ, and pvdD and the putative ABC transporter pvdE are also highly divergent and probably coevolve with fpvA. This coevolution would select against recombination events between types that separate these genes, particularly selecting against separation of the receptor gene and the nonribosomal peptide synthetase genes. Repeated rounds of receptor change followed by compensatory mutations elsewhere could result in rapid divergence between pyoverdine sequence types.
Diversifying selection seems to be acting on type-specific genes regardless of their chromosomal location. pvdY is separated by 30 kb from the divergent pyoverdine region but has type-specific divergence. This gene, whose function is currently unknown, presumably has a type-specific function in the pyoverdine system even though it is distantly located from other type-specific genes. It remains possible that there are still other type-specific pyoverdine genes elsewhere in the genome.
One of the most puzzling results of this study is the mixture of alleles in strain 1-60. Two genes in the type II strain have type I alleles, which are significantly divergent from and are present in place of type II alleles. To have become divergent, each allele must have had a selective advantage and must have evolved separately from alleles of other types for a relatively long period of time. Yet we now observe this mixture of alleles functioning together. If the alleles of different types can function together, how have the alleles been able to diverge from one another?
Lastly, we considered possible sources of diversifying selection at the pyoverdine locus of P. aeruginosa. Outer membrane protein genes such as the pyoverdine receptor fpvA are common targets for entry by phage or pyocins, and siderophore diversity may be a resistance mechanism. Siderophore diversity may also be a defense against ferrisiderophore stealing. Siderophore production is a cooperative behavior, since diffusion makes the ferrisiderophore available to any cell with an appropriate receptor. However, all cooperative activities invite cheating (23, 70). Many bacterial genome sequences, including the P. aeruginosa genome, contain numerous putative siderophore receptor genes without the corresponding siderophore synthesis genes (15, 49, 69). In iron-poor environments, it may be beneficial for a bacterial strain to make a siderophore that is distinct in structure from the major varieties present in the environment. This evolutionary dynamic could lead to continual generation of new siderophores whose selective advantage to a particular strain is constantly compromised when other strains acquire a compatible receptor.
We thank the staff of the University of Washington Genome Center. We are also grateful to Willie Swanson for advice on testing for positive selection, to Jean-Marie Meyer and Jane Burns for providing strains, to Ruolan Qiu for assistance with fosmid library construction, and to Arnold Kas for software assistance.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»