Previous Article | Next Article ![]()
Journal of Bacteriology, May 2006, p. 3424-3428, Vol. 188, No. 9
0021-9193/06/$08.00+0 doi:10.1128/JB.188.9.3424-3428.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Division of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-8657, Japan,1 Division of Integrated Biosciences, The University of Tokyo, 202 Bioscience Bldg., 5-1-5 Kashiwanoha, Chiba 277-8562, Japan2
Received 18 November 2005/ Accepted 17 February 2006
|
|
|---|
|
|
|---|
A comparison of synonymous and nonsynonymous nucleotide substitution rates is an effective approach for studying the mechanisms of DNA sequence evolution (21). The numbers of nonsynonymous and synonymous substitutions per site (dN and dS, respectively) (18) and their ratio (
= dN/dS) are important indicators of the selective pressure at the protein level, such that
values of <1, 1, and >1 imply stabilizing selection, neutral mutations, and diversifying positive selection, respectively. Many examples of positive selection at the molecular sequence level have been reported previously (1, 3, 5-7, 19, 27, 29). Most of these proteins play very important roles in their respective organisms and probably influence their fitness directly. Positive selection on a protein may indicate that it has an important role in the evolution of an organism (4). Analyzing the positively selected proteins from an organism should contribute to our understanding of the evolution of these organisms and the proteins.
To examine the positive selection acting on the phytoplasma antigenic membrane protein (Amp), we used 14 phytoplasmas: three strains of onion yellows phytoplasmas (OY-W, OY-M, and OY-NIM) (15, 23) and, porcelain vine witches' broom phytoplasma (PvWB) (8) were previously described; plant tissues infected with mulberry dwarf phytoplasma (MD), eggplant dwarf phytoplasma (ED), tomato yellows phytoplasma (TY), lettuce yellows phytoplasma (LeY), Iceland poppy yellows phytoplasma (IPY), marguerite yellows phytoplasma (MarY), sumac witches' broom phytoplasma (SWB), and paulownia witches' broom phytoplasma (PaWB) were kindly provided by T. Shiomi, T. Usugi, N. Nishimura, and T. Tsuchizaki (20); DNA samples extracted from Bermuda grass white leaf phytoplasma (AYBG) and potato purple top phytoplasma (PPT) were kindly provided by A. Bertaccini (24). Total DNA was extracted as described previously (17) and used for PCR templates. The primers ES-1 (5'-TTG AGC TCG CGG CCG CAT GAA ACA AAA AAC AAT TAT C-3') and Nad-2 (5'-TTT GCC TGA TTG AAA CCA TCT-3') were used to amplify a 3.6-kb DNA fragment harboring the amp gene. The PCR conditions were as described previously (9), except for the use of LA-Taq (Takara). The PCR-amplified fragments were purified using ExoSAP-IT (Amersham Bioscience) and then sequenced directly by using 11 primers: ES-1, ES-2 (5'-CAA TAG CAG GTG AAT CAT AGG C-3'), EL-1 (5'-AGA ATT CCA TAT GAG TAA AAA AAT ACT TTA TGG CAA-3'), EL-2 (5'-AAA TTA GTA CTG TGC AAG AAA-3'), EL-3 (5'-TAT CAA GAA TTA AAA GAT ACT-3'), Amp-N1 (5'-AAG AAT TCC ATA TGC AAA ATC AAA AAA CTC A-3'), Amp-3 (5'-CTG TTA AAG CTG TAG ATG GTA-3'), Amp-C1 (5'-AAG AGC TCG AGT TAT TTA TTG TTT TTG TTT TTT TTA AC-3'), Nad-1 (5'-TAC TCG AGG CGG CCG CCA TTT TGC ATA CTA TAA GCA T-3'), Nad-3 (5'-AAT AAT GAA TAA CTA AGG AAG C-3'), and Nad-2. Each 3.6-kb DNA fragment of each of the 14 phytoplasmas contained four genes in the order groES, groEL, amp, and nadE (Fig. 1). A previously reported 1.2-kb DNA fragment containing the amp gene of clover phyllody phytoplasma (GenBank accession number AF244541) (2) was completely identical to that from PPT.
|
View larger version (4K): [in a new window] |
FIG. 1. Physical map of the 3.6-kb DNA fragment from OY-W. It contains three complete genes encoding molecular chaperonin GroES (groES), GroEL (groEL), and antigenic membrane protein (amp) and an incomplete gene encoding NAD synthase (nadE).
|
![]() View larger version (9K): [in a new window] |
FIG. 2. A maximum-parsimony tree for the 3.6-kb nucleotide sequence containing groES, groEL, amp, and partial nadE genes. Numbers on the branches are bootstrap values obtained for 100 replicates (only values above 70% are shown).
|
was calculated as an average over all of the codon sites in each gene using the Nei-Gojobori method by MEGA version 2.1 (12) with Jukes-Cantor model or a maximum likelihood (ML) pairwise comparison (model M-2 in CODEML of PAML) (30). The significant difference test of
was done according to a previously described procedure (14). These results are partly shown in Table 1. For 70 of the 91 pairs,
was >1, ranging from 1.819 (OY-M versus MerY) to 7.607 (PvWB versus MD). The values for the other 21 pairs could not be calculated because the values for dS were 0. The
values for 61 pairs were significantly greater than 1. Very similar results were obtained with the ML pairwise comparison (data not shown). For the other three genes (groES, groEL, and nadE),
was <1 for all pairs (data not shown). |
View this table: [in a new window] |
TABLE 1. Number of synonymous differences, number of nonsynonymous differences, and dN/dS ( ) of amp estimated by the Nei-Gojobori methoda
|
is assumed to follow a certain probability distribution among codon sites in the sequence. All ML analyses were performed by using the MP tree, using the CODEML program in the PAML package version 3.13 (30). Table 2 shows the likelihood scores, the parameter estimates, and the results of likelihood ratio test calculated with the MP tree. Models M2, M3, M5, and M8 estimated that
was >>1 and were significantly favored over model M0, M1, or M7 (P was <0.001 in all pairs), which is indicative of strong positive selection. The posterior probabilities of site classes along the amp gene are presented in Fig. 3 for the best-supported M3 model with the MP tree. One site, 83A, appears to be subjected to extraordinarily strong positive selection, indicated by
values of 120.028. Similar ML analyses of the selection pressures were also performed using other tree topologies (e.g., the neighbor-joining tree, trees constructed only by groEL or amp sequences, or a "star" tree), and almost the same or more extreme results were obtained (data not shown). These analyses were repeated for the groES, groEL, and nadE genes. In all three genes and in all tree topologies, no models incorporating positive selection were significantly supported over models that did not (data not shown). |
View this table: [in a new window] |
TABLE 2. Log-likelihood scores and parameters estimated for the amp gene from phytoplasmas
|
![]() View larger version (16K): [in a new window] |
FIG. 3. Posterior probabilities of site classes along the amp gene. Model M3 (discrete) with the maximum-parsimony tree topology was used; this assumes three classes of sites in the gene. The estimated frequencies and ratios for the three classes were as follows: p0, 0.788; p1, 0.191; p2, 0.021; 0, 0.979; 1, 18.209; 2, 120.222 (Table 2). In each site, the total of posterior probabilities of the three classes is 1.000; e.g., the first codon site has posterior probabilities of 0.965, 0.035, and 0.000 with class 1 (white), 2 (gray), and 3 (black), respectively. The horizontal dashed line indicates a posterior probability of 0.95. Segments corresponding to two transmembrane regions (TM) and a coiled-coil structure (CC) are shown. The positions corresponding to sites with posterior probabilities of >0.95 are indicated by arrowheads.
|
values estimated using both the Nei-Gojobori method (average
over all codon sites) and the ML analysis (heterogeneous
) provided strong evidence for positive selection acting on Amp, whereas there was no significant evidence of positive selection on the genes upstream or downstream of the amp gene. The criterion that
is >1 in the pairwise method is a very stringent one for detecting positive selection (25), so few genes were reported as having an
of >1 by the Nei-Gojobori method (6, 27). Here, Amp shows high
values in both methods, and Amp appears to have one of the highest
ratios of previously reported proteins (1, 3, 7, 19, 29). The high
values probably result from determining the
ratio over a short evolutionary time, as closely related and biologically distinct phytoplasmas were compared. As shown in Table 1, it was observed that the actual number of nonsynonymous substitutions was greater than that of synonymous substitutions in most pairs (e.g., 62.75 and 6.25 in PaWB and MD, respectively). These data indicate that many nonsynonymous substitutions were accumulated within the time of accumulating few synonymous substitutions. The Amp protein possesses a central hydrophilic region (amino acid positions 33 to 203) that is probably located on the phytoplasma exterior and two hydrophobic transmembrane regions (2, 10). In this study, the best-supported M3 model estimated 19 positively selected sites, and 18 of these 19 sites were located within the central hydrophilic region (Fig. 2). These results were consistent with the previous reports that the central region of amp is variable (2, 10), suggesting that there is strong selective pressure on the central region of Amp. It is likely that the positive selection on Amp is caused by the interaction between the phytoplasma and its extracellular environment, the host cytoplasm. However, it remains unclear whether the positive pressure was derived from either insects or plants. The detailed analysis of the source of positive pressure would be a further problem.
This study strongly suggests that the Amp protein plays a biologically important role in the phytoplasma-host interaction. Molecular evolutionary analysis might enable the identification of biologically important genes through the comparison of nucleotide sequences. Identification of positively selected proteins offers a good approach for understanding organisms, such as phytoplasmas, in which transformation or production of mutants is difficult. A search for other positively selected proteins of phytoplasmas would also be useful. Moreover, a search for the host proteins that interact with Amp should help to elucidate its role in host-phytoplasma interactions and the source of the positive pressure. These analyses will further our knowledge of the adaptation of phytoplasmas, contributing to our understanding of their biology.
Nucleotide sequence accession numbers. The sequences of the phytoplasmas have been deposited in GenBank with accession numbers AB124806 through AB124811, AB242231 through AB242237, and AB167357.
This work was supported by grants-in-aid from the Ministry of Education, Culture, Sports, Science, and Technology of Japan (09460155).
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»