Previous Article | Next Article ![]()
Journal of Bacteriology, August 2004, p. 5442-5449, Vol. 186, No. 16
0021-9193/04/$08.00+0 DOI: 10.1128/JB.186.16.5442-5449.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Paulo A. Zaini,1,
Leandro M. Moreira,1 Ricardo Z. N. Vêncio,2 Adriana Y. Matsukuma,1 Alan M. Durham,3 Diva C. Teixeira,4 Hamza El-Dorry,1 Patrícia B. Monteiro,4,
Ana Claudia R. da Silva,1 Sergio Verjovski-Almeida,1 Aline M. da Silva,1* and Suely L. Gomes1*
Departamento de Bioquímica, Instituto de Química,1 Departamento de Estatística,2 Ciência da Computação, Instituto de Matemática e Estatística, Universidade de São Paulo, São Paulo,3 Fundo de Defesa da Citricultura, Araraquara, Brasil4
Received 20 February 2004/ Accepted 22 March 2004
|
|
|---|
|
|
|---|
Different strains of X. fastidiosa have been reported to infect a wide range of plants, including grapevines and citrus, almond, and pear trees, among others (26). In the United States for instance, Pierce's disease prevents profitable viticulture if leafhopper vectors are present at high densities (1, 14). In Brazil, citrus variegated chlorosis (CVC) is responsible for great financial losses to the citrus agroindustry, being detected in one-third of the citrus trees. Orange production quickly decreases in orchards affected by CVC, as fruits become hardened and of no commercial value. Interestingly, within the majority of host plants, X. fastidiosa behaves as a harmless endophyte (27).
Several X. fastidiosa strains have had their genomes completely or partially sequenced, and genome comparative analysis with different pathogenic strains of X. fastidiosa pointed to common candidate virulence determinants as well as strain-specific genomic signatures (4, 24, 32, 37). However, no information is available about the genome composition of nonpathogenic Xylella strains, which would contribute to more direct insights on pathogenicity mechanisms.
Genome-wide comparison between pathogenic and nonpathogenic strains within a species is a useful strategy for identifying candidate genes important for virulence. DNA microarray-based genome composition analysis is a good alternative to full genome sequencing and has been used in comparative studies to analyze various bacterial pathogens including Mycobacterium tuberculosis (3), Helicobacter pylori (29), Pseudomonas aeruginosa (38), Bacillus anthracis (28), Yersinia pestis, and Yersinia pseudotuberculosis (12).
In this report, we show that X. fastidiosa strain J1a12, which was isolated from citrus and is suitable for genetic transformation (6, 23), elicits few or no CVC symptoms when inoculated into citrus and tobacco plants. Furthermore, a DNA microarray-based genome composition analysis was performed by comparing J1a12 with strain 9a5c, which produces typical CVC symptoms (20) but is resistant to transformation with DNA in vitro (23), a drawback for its genetic manipulation. Our microarray data revealed that the great majority of the coding sequences (CDS) are highly conserved on both strains. However, 14 CDS were shown to be absent or highly divergent in the nonpathogenic strain. Expression profiling of both strains, PCR and reverse transcription (RT)-PCR with CDS-specific primers, and DNA sequence analysis were used to validate the genomic differences observed.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Evaluation of C. sinensis and N. tabacum plants inoculated with X. fastidiosa CVC strains 9a5c and J1a12
|
Data acquisition and normalization. Microarray slides were scanned by using a Generation III DNA scanner (Amersham Biosciences), and fluorescence intensity values (ICy3 and ICy5) from each spot were extracted by using ArrayVision, version 6.0, software (Imaging Research, Inc.). Raw fluorescence intensity and normalized data are available at the project site (http://verjo19.iq.usp.br/xylella/microarray/). Data normalization was carried out by LOWESS fitting on an M versus A plot (39), where M is the ratio of fluorescence intensities of the two measurements for each spot [defined as M = log2(ICy5/ICy3)] and A is the geometric mean of the fluorescence intensities [defined as A = 1/2 x log2(ICy5 x ICy3)]. The normalization script is available at the project site.
CDS classification process.
To determine hybridization noise and to estimate dynamic cutoff values for classifying a CDS as equally present in both strains (9a5c and J1a12) we used the hybridization data collected from three independent homotypic experiments (9a5c versus 9a5c). For this kind of experiment, also called self-self hybridization, the microarrays were cohybridized with strain 9a5c DNA separately labeled with either Cy3- or Cy5-dCTP analogs. As verified in the M versus A plot, there is a dependence of the hybridization intensity log ratio of each spot (M) with the mean log intensity of each spot (A). Thus, we have determined a cutoff value for each interval in the A axis by using kernel density estimators. We chose kernel density estimators (31) instead of the normal probability density function (16) because we experimentally derived the null distribution as the result of homotypic experiments and verified that it does not present a Gaussian behavior (further information is available at the project site [http://verjo19.iq.usp.br/xylella/microarray/]). The density distribution was integrated around the mode peak until 0.995 probability was reached, defining intensity-dependent noise threshold cutoff values (credibility intervals) based on experimental data from 9a5c versus 9a5c homotypic experiments, thus setting an interval where the hybridization ratio is considered to be 1:1 (e.g., 2.5 < M < 1.8, for the lowest accepted intensities at A = 2). These credibility intervals were subsequently used in the analysis of replicas of 9a5c versus J1a12 hybridization experiments to nonparametrically estimate the null distribution of the statistical test H0: CDS is present in both J1a12 and 9a5c strains. Spots outside the credibility intervals present strong evidence against a 1:1 ratio. Using these criteria, four categories were defined for the CDS in the J1a12 genome based on its orthologous 9a5c counterpart: (i) equally present in both strains, (ii) divergent in strain J1a12, (iii) highly divergent or absent in J1a12, and (iv) higher copy number in J1a12. Category i includes all of the CDS for which
60% of the replicas were inside the credibility intervals.
CDS presenting negative log ratio values outside the credibility intervals could be classified as divergent in strain J1a12 (category ii) or highly divergent or absent in J1a12 (category iii). To distinguish between these categories, we performed four control experiments where the microarrays were cohybridized with DNA from the sequenced grapevine strain Temecula (37) and strain 9a5c labeled with either Cy3- or Cy5-dCTP analogs. Next, we derived a correspondence between the hybridization intensity log ratio for each CDS amplicon and its respective nucleotide sequence identity in both strains. Amplicon sequences exhibiting nucleotide identities smaller than 20% were defined as category iii, highly divergent or absent. Their respective log ratio M was taken as the cutoff threshold (Mcutoff) of category iii. The log ratio value with the least false callings was an Mcutoff value of 1.7. At this threshold, 19 false positives and 23 false negatives were observed (0.76 and 0.92%, respectively). Category iii includes CDS with an M value of <1.7 and a P value smaller than 0.05 in a t test against the null hypothesis H0: M
1.7. Sequencing some amplicons from J1a12 that were outside the credibility intervals and checking the divergence between 9a5c and J1a12 sequences further supported the adequacy of the threshold. The remaining CDS that have a negative log ratio value, with an M value of
1.7, were considered divergent (category ii). Those CDS with positive log ratio values were considered to be present at higher copy numbers in J1a12 (category iv). CDS with low reproducibility (less than 60% of replicas in a single category) were excluded from the analysis.
Validation of microarray data. PCR and RT-PCR experiments were performed with CDS-specific primers to further investigate the status of CDS in the genome of strain J1a12 classified by using the microarray data. The reactions were carried out with genomic DNA or cDNA from strain 9a5c or J1a12 with 35 cycles of amplification. A sample of CDS presenting log ratios outside the credibility intervals, as determined by homotypic hybridization experiments (9a5c versus 9a5c), was chosen to perform the validation. Among the 64 CDS with negative log ratios (categories ii and iii), the following 33 CDS were randomly chosen for PCR validation: XF0077, XF0496, XF0497, XF0500, XF0663, XF0684, XF0696, XF0890, XF1250, XF1306, XF1581, XF1588, XF1589, XF1646, XF1663, XF1664, XF1686, XF1708, XF1709, XF1860, XF1863, XF1874, XF1877, XF1878, XF1884, XF1968, XF2193, XF2195, XF2307, XF2722, XF2768, XF2772, and XFb0001. In addition, the following CDS with positive log ratios (category iv) were also tested by PCR: XF0513, XF0514, XF0515, XF0516, XF0518, XF0519, XF0521, XF1933, XF1934, XF1935, XF1936, and XF1937. A 4-µl sample of each reaction mixture was electrophoresed in agarose gels, and DNA was stained with ethidium bromide. The amplicons were then classified by visual inspection as absent, same copy number, or more abundant in strain J1a12 in relation to strain 9a5c. In addition, DNA sequence determination was carried out for a few CDS. For that, PCR products obtained with primers based on neighboring CDS were cloned in pGEM-TEasy vector (Promega) and dideoxy sequencing reactions were performed with 100 ng of plasmid DNA in Big Dye Terminator sequencing reactions (Applied Biosystems) according to the manufacturer's instructions. Sequencing reactions were carried out with either CDS-specific or T7 promoter primers.
|
|
|---|
Similar results were observed with tobacco plants as the experimental host. As shown in Table 1, none of the plants inoculated with J1a12 presented symptoms or were colonized by the bacteria. In contrast, all tobacco plants inoculated with 9a5c presented the lesions characteristic of X. fastidiosa CVC infection, as previously described (21).
These results indicate that strain J1a12 shows a nonpathogenic phenotype, in contrast to the highly pathogenic behavior of strain 9a5c. In addition, Table 1 shows that plant colonization by strain J1a12 is very inefficient, as no bacteria were detected in the plant xylem by PCR experiments with a pair of primers (CVC-1 and 272-int) which are specific for Xylella isolates from citrus plants (25). This pair of primers amplifies a genomic region of approximately 500 bp, from chromosome position 1051239 to 1051745 of the 9a5c genome, encompassing 195 bp of CDS XF1100 and an intergenic region upstream of this CDS. It is important to stress that these primers can amplify the correct DNA fragment when directly tested in J1a12 in vitro cultures (6, 23).
Genotyping by DNA microarray analysis. To investigate whether the differences in phenotype observed between strains 9a5c and J1a12 could be associated with differences at the DNA level, we have constructed a DNA microarray encompassing 2,692 CDS, which represents 94.5% of all CDS described in the reference strain 9a5c (32). Total DNAs isolated from strains 9a5c and J1a12 were separately labeled with either Cy3- or Cy5-dCTP fluorescent analogs, and competitive microarray hybridizations were performed. Raw and normalized hybridization data are available at the project site (http://verjo19.iq.usp.br/xylella/microarray/). An initial screening revealed that 292 CDS presented either low signal intensity or poor reproducibility and were excluded from further analysis. As detailed in Material and Methods, the remaining 2,400 CDS were classified into four categories according to the normalized hybridization fluorescence intensity ratios of J1a12 over 9a5c DNA samples determined for each CDS. Among the 2,400 CDS, approximately 96% were found to have a log ratio (M) around 0 and were classified as equally present in both strains (category i). One example is shown in Fig. 1A. This figure shows an M versus A plot, i.e., normalized intensities log ratios (M) versus the average of log intensities (A) of all the replicas for a given CDS. This kind of graph shows the dependence of the ratio on the overall intensity of each spot, indicating that, for genes with low hybridization intensity signals (A values below 2), the observed ratios have a higher intrinsic dispersion, as determined by homotypic hybridizations. As a result, different cutoff values for M were used for different ranges of intensity (A) when classifying a CDS as equally present in both strains. M versus A plots showing the reproducibility of the data for each CDS are available at the project site.
![]() View larger version (37K): [in a new window] |
FIG. 1. Xylella strain J1a12 CDS classification based on DNA microarray hybridization ratios. Examples of CDS classified in each of the four categories are shown. (A) XF1621, classified as equally present in both 9a5c and J1a12 strains, category i; (B) XF0262, classified as divergent in J1a12, category ii; (C) XF0496, classified as absent or highly divergent in strain J1a12, category iii; (D) XF1937, classified as higher copy number in J1a12, category iv. Orange dots represent the results for all CDS in the microarray from three homotypic control experiments (9a5c labeled with Cy5 versus 9a5c labeled with Cy3). Graphs show the fluorescence intensity ratio M = log2(ICy5/ICy3) versus the fluorescence intensity mean A = 1/2 x log2(ICy5 x ICy3). Green dots represent the hybridization data (9a5c labeled with Cy3 versus J1a12 labeled with Cy5 or vice versa) from multiple replicas for the indicated CDS, where the fluorescence intensity ratio M = log2(IJ1a12/I9a5c). Similar graphs for each of the CDS are available at the project site (http://verjo19.iq.usp.br/cagexylella/private/).
|
|
View this table: [in a new window] |
TABLE 2. CDS divergent in strain J1a12
|
|
View this table: [in a new window] |
TABLE 3. CDS absent or highly divergent in strain J1a12
|
Despite all the information obtained with DNA microarray genotyping, it is necessary to stress that frameshifts and point mutations cannot be identified by this method. In addition, our DNA microarray experiments will not detect genes present exclusively in the unsequenced strain J1a12, impairing detection of genes that would eventually attenuate pathogenicity.
Validation of CDS classification. To further validate the distinction between CDS potentially absent or highly divergent and those classified as divergent, a sample of 33 CDS (listed in Materials and Methods) were analyzed by PCR with CDS-specific primers. All CDS tested were PCR negative for strain J1a12, including those with log hybridization intensity ratios between 0.5 and 1.7.
Figure 2A shows an example of a CDS (XF0077, encoding a fimbrial adhesin precursor) classified as absent or highly divergent by microarray analysis. PCR and RT-PCR experiments with CDS-specific primers corroborated its classification in this category. In addition, a PCR experiment with a pair of specific primers flanking XF0077 encompassing CDS XF0075 to XF0079 produced a smaller amplicon in strain J1a12 than in strain 9a5c (Fig. 2A). The exact position of the deleted region was determined by DNA sequence analysis, as described in the next section.
![]() View larger version (44K): [in a new window] |
FIG. 2. Validation of CDS classification. CDS-specific primers for XF0077 (A) and XF1968 (B) were employed to perform PCR amplifications with DNA from strains 9a5c and J1a12 (left panels) or to perform RT-PCR amplifications with total RNA (central panels). PCRs were also carried out with DNA of both strains and primers based on the sequence of the CDS flanking XF0077 (amplicon XF0075 to XF0079) or XF1968 (amplicon XF1967 to XF1969) (right panels). The sizes of the amplicons are shown in base pairs.
|
|
View this table: [in a new window] |
TABLE 4. Nucleotide and amino acid identity of selected divergent CDS of strain J1a12 compared to strain 9a5c
|
RNA expression studies by microarray hybridization comparing strains J1a12 and 9a5c were performed as an additional validation of CDS classification. RNA hybridization data are available at the project site (http://verjo19.iq.usp.br/cagexylella/private/). Hybridization signals were found to be at the background level for the 14 CDS classified as absent or highly divergent in strain J1a12. On the other hand, all of these CDS showed significant expression levels in strain 9a5c.
The expression levels of CDS that were classified as equally present in both strains (category i, see Materials and Methods) were studied under standard bacterial growth conditions. This class of CDS was chosen to eliminate possible hybridization artifacts due to sequence divergence. We found that among the 2,296 CDS in this category, which presented detectable hybridization intensity values, approximately 97% exhibited comparable RNA expression levels on both strains, i.e., differences in expression were smaller than twofold, with P values smaller than 0.05 in a t test. However, about 1% of the CDS presented a higher RNA expression level (twofold or more) in strain 9a5c and about 2% had higher expression in strain J1a12 (see Tables S3 and S4 in the supplemental material). No obvious correlation could be made between the CDS presenting differential expression and the phenotypes of each strain.
Functional characteristics of genes absent or highly divergent in strain J1a12. Among the 14 CDS classified as absent or highly divergent (Table 3), 10 have no similarity to known genes and no function could be assigned. Therefore, they will not be further discussed, although their involvement in pathogenesis cannot be excluded.
The X. fastidiosa strain 9a5c genome encodes three fimbrial adhesin subunits from type III pilus (XF0077, XF0078, and XF0080). Our microarray data classified XF0077 as absent or highly divergent, and we have confirmed its deletion in strain J1a12 by DNA sequence analysis. The deleted region, encompassing 1,050 nucleotides (Fig. 2A), extends from position 76845 to 77895 (numbers from the strain 9a5c main chromosome). XF0078 was classified as divergent in J1a12, and DNA sequence analysis has shown a 92.2% nucleotide identity with its ortholog in 9a5c (Table 4). The third fimbrial adhesin paralog (XF0080) was classified as equally present in both strains (category i). The three paralogs from 9a5c are similar to mrkD from Klebsiella pneumoniae and share high similarity with each other (XF0077 and XF0078 share 70% amino acid sequence identity and both display around 60% amino acid identity to XF0080). As reported for the adhesins of K. pneumoniae (30), the N-terminal region exhibits a greater degree of variability, probably conferring on strain 9a5c the ability to adhere to different bacterial and/or host cell components or even producing an extracellular matrix with greater bonding capacity. In K. pneumoniae, mrkD null mutants are fimbriate but nonadhesive (34) and the mrkD gene product is not required for biofilm formation (18). Proteomic and mass spectrometric analyses of whole-cell lysates and extracellular components have demonstrated that structural and adhesive subunits of fimbriae are ubiquitous in cultures of Xylella strain 9a5c (33). Despite the presence of two CDS encoding the adhesion subunit precursors XF0078 and XF0080 in J1a12, we have observed that this strain displays a much less aggregated phenotype in vitro than 9a5c cells, as shown in Fig. 3. One possible explanation for this phenotype is that the presence of the adhesin encoded by XF0077 is important for the adhesion of X. fastidiosa cells.
![]() View larger version (54K): [in a new window] |
FIG. 3. X. fastidiosa strain J1a12 has lost the ability to aggregate. Optical microscopy of X. fastidiosa strains 9a5c (A) and J1a12 (B). Magnification, x1,000.
|
XF1646, classified as absent or highly divergent in J1a12, was annotated as a putative UDP-3-O-(R-3-hydroxymyristoyl)-glucosamine N-acyltransferase, which is similar to the lpxD gene from Rickettsia rickettsii. This enzyme catalyzes the third step in lipid A biosynthesis, a constituent of lipopolysaccharides from the outer membrane. In Escherichia coli, an lpxD mutant had its susceptibility to various antibiotics increased as high as 512-fold, indicating alterations in the outer membrane permeability barrier (36). An altered outer membrane structure due to the incomplete lipid A biosynthesis may render J1a12 more sensitive to antimicrobial agents eventually produced by the plant host. The increased permeability of its outer membrane may also explain why J1a12 is amenable to DNA transformation, whereas 9a5c is not (6, 17, 23).
The two highest DNA hybridization intensity ratios obtained when comparing 9a5c versus J1a12 were derived from the two CDS encoded by the mini plasmid pXF1.3 (XFb0001 and XFb0002). This result reflects the presence of multiple copies of this plasmid in strain 9a5c (32) and its absence in strain J1a12, as previously reported (6).
Final remarks. Among the 14 genes classified as absent or highly divergent, three CDS encoding a fimbrial adhesin precursor, an arginase, and a UDP-3-O-(R-3-hydroxymyristoyl)-glucosamine N-acyltransferase are conspicuously absent in the nonpathogenic strain J1a12. Due to their putative role in bacterial survival in infected hosts, they emerge here as important players in Xylella pathogenicity. The observation that several other genes are missing in J1a12 gives support to the hypothesis that bacterial pathogenesis is a multifactorial process and that each of these factors may contribute somewhat quantitatively to the development of disease. In fact, inactivation of a single fimbrial adhesin gene (PD0058) or a single fimbriae protein gene (PD0062) in X. fastidiosa grapevine strain Temecula was not sufficient to decrease bacterial pathogenicity, causing only a slight reduction in the bacterial population (9).
Recent microarray expression studies comparing X. fastidiosa 9a5c cells freshly isolated from citrus with bacteria attenuated after several passages in axenic culture have shown that most genes found to be induced in the freshly isolated condition were associated with adhesion and with possible adaptation to the host environment (8). However, the set of genes observed in that study is different from the genes found to be absent in the nonpathogenic strain analyzed in the present report, reinforcing the multifactorial hypothesis of bacterial pathogenesis. Furthermore, our results obtained with J1a12 are independent of the number of passages in culture, since the microarray experiments were performed with DNA from bacterial cells obtained after either 14 or 24 passages and no differences were observed (data not shown).
Our plant colonization assays showing that strain J1a12 is unable to induce CVC symptoms or even sustain itself in host plants raise the question of how strain J1a12 was originally isolated from a citrus tree. A recent report about the diversity of the endophytic bacterial community in citrus trees (2) can shed light upon this intriguing question. Possibly, the presence of strain J1a12 in symptomatic plants is dependent on other microorganisms and/or other X. fastidiosa pathogenic strains eventually present in the biofilms formed by the aggregated cells and clogging the xylem vessels of infected plant hosts (13, 19).
Among the CDS found to be absent in J1a12 and proposed here to play a role in disease development, four CDS do have orthologs in the three other pathogenic Xylella strains that have been sequenced (see Table S5 in the supplemental material). Thus, despite the diversity of hosts, geographical location, and disease symptoms, different Xylella strains may present similar mechanisms of pathogenesis. Our results suggest that common strategies could be undertaken to control the diseases that are caused by different X. fastidiosa strains infecting various host plants. Given the importance of the set of genes found here, further functional characterization is warranted. Towards this end, we are currently trying to complement strain J1a12 with the CDS shown to be absent from its genome. Different from 9a5c, the nonpathogenic J1a12 strain is amenable to DNA transformation with plasmid vectors and the transposome system (6, 17, 23). Thus, we believe that with complementation studies it will be possible to evaluate the role of these CDS in bacterial virulence and the ability to colonize xylem vessels.
We are greatly indebted to Hugo A. Armelin for coordinating the Cooperation for Analysis of Gene Expression (CAGE) Project and for strongly supporting this work. We thank João Carlos Setubal and João Kitajima for providing information about reannotation of the Xylella genomic sequence and Apuã C. M. Paquola, Milton Y. Nishiyama, Jr., and Abimael A. Machado for providing bioinformatic tools. We thank Jesus Ferro for coordinating our PCR amplification data bank and for the Xylella cosmid library. We also thank Sanvai R. P. Rocha, Mateus de Almeida Santos, and Anelise G. Mariano for help in pathogenicity tests and Helder Nakaya and Ari J. S. Ferreira for valuable help in the beginning of this work.
A.M.d.S., H.E.-D., S.L.G., and S.V.-A. were partially supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico, (CNPq). T.K., L.M.M., R.Z.N.V., and P.A.Z. are FAPESP doctoral fellows.
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
These authors contributed equally to this work. ![]()
Present address: Alellyx Applied Genomics, Campinas, SP, Brasil. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»