Previous Article | Next Article ![]()
Journal of Bacteriology, August 2006, p. 5393-5399, Vol. 188, No. 15
0021-9193/06/$08.00+0 doi:10.1128/JB.01819-05
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Department of Veterinary Science, The University of Melbourne, Parkville, Victoria 3010, Australia
Received 29 November 2005/ Accepted 14 May 2006
|
|
|---|
|
|
|---|
Sequencing of the Mycoplasma pneumoniae genome has revealed that it harbors 46 lipoprotein genes in six multigene families, all of unknown function (4, 7). These multigene families include genes that encode proteins with sequence similarity to the lipoproteins but that lack the characteristic amino-terminal prolipoprotein signal sequence. In contrast, the closest relative of M. pneumoniae, M. genitalium, has only 21 putative lipoprotein genes, with 1 to 3 genes in each family.
No studies have focused specifically on the expression of the lipoprotein multigene families in M. pneumoniae. In this study, we employed reverse transcriptase PCR (RT-PCR) to investigate the expression of all the lipoprotein genes in the six lipoprotein multigene families in this organism. We also examined the transcriptional organization of lipoprotein genes to find possible operon structures. Furthermore, we analyzed the phylogenetics of the lipoprotein genes and explored the relationships between the genes within each of the gene families.
|
|
|---|
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Members
of lipoprotein multigene families in M. pneumoniae, their
closest M. pneumoniae and M. genitalium
homologs, and the expression of lipoprotein genes as detected
by RT-PCR and proteomic methods
|
![]() View larger version (19K): [in a new window] |
FIG. 1. Genomic
location of the six lipoprotein multigene families. The truncated ORFs
in family 1 could be grouped together with other truncated ORFs to form
several full-length genes: MPN587 with MPN586, MPN590 with MPN589,
MPN585 with MPN584, and MPN581 with MPN580. MPN591 and MPN583 lacked
amino- or carboxyl-terminal sequences that were encoded by adjacent
sequences that had been classified as noncoding sequences in the gene
map. Each of the truncated ORFs in family 2 could be linked with an
adjacent truncated ORF, MPN281 with MPN282, MPN146 with MPN147, MPN364
with MPN363, and MPN097 with MPN098. The full-length sequences
generated from each of these pairs had a significant amino acid
sequence identity (61%, 55%, 61%, and 56%, respectively) with
full-length gene products. In family 3 truncated ORFs MPN439, MPN438,
and MPN437 could be linked together, as could MPN442, the MPN436-like
ORF, and MPN440, and they shared high sequence
identity (65% and 70%, respectively) with the full-length ORF MPN436.
There was also sequence similarity with MPN436 in the noncoding regions
between the truncated ORFs, indicating that these truncated ORFs
probably resulted from several mutations in two originally full-length
ORFs. The three overlapping truncated ORFs in family 4, MPN459, MPN458,
and MPN457, could be linked together to form a sequence with high
similarity (82%) to the single full-length ORF. In family 5 the
truncated ORFs MPN650 and MPN649 could be linked, as well as MPN467 and
MPN466, and these pairs had high levels of identity with full-length
genes (48% and 75%, respectively). Family 6 consisted of nine
full-length ORFs. No truncated ORFs were
found.
|
![]() View larger version (30K): [in a new window] |
FIG. 2. Genomic
locations of lipoprotein genes homologous to members of family 2 and of
homologous putative ABC transporter operons in M. pneumoniae
M129, M. genitalium G37, M. gallisepticum R, M.
pulmonis UAB CTIP, M. hyopneumoniae 232, and M.
synoviae 53. Open arrows, lipoprotein genes; solid arrows,
staphylococcal nuclease (sNc) gene homologs; crosshatched arrows, ABC
transporter
genes.
|
BLink was used to determine which Mycoplasma species possessed homologs of the M. pneumoniae lipoprotein families. Family 1 and family 2 homologs were common in many Mycoplasma species, whereas family 3 and 4 homologs were found only in M. genitalium, M. penetrans, and M. gallisepticum. Homologs of family 6 were found only in M. genitalium. Family 5 genes were found to be unique to M. pneumoniae.
Transcripts for the majority of ORFs in lipoprotein multigene families were detectable by RT-PCR. We used RT-PCR to comprehensively study the transcription of all 67 putative lipoprotein genes contained in the six multigene families. Every gene was tested at least twice with an independently isolated RNA template. We also looked for any operon structure within the lipoprotein genes by using primers targeted to bridge the sequence between truncated ORFs. In total, 58 genes were detected by RT-PCR (Table 1).
Most of the ORFs in family 1 were expressed, with the exception of MPN590, MPN589, and MPN587. We also found that the truncated ORFs MPN581 and MPN580 were located on a single mRNA, while MPN585 and MPN584 were expressed as separate mRNAs. Two transcripts, MPN591 and MPN583, had noncoding regions at their 5' ends, which, based on similarity searches, corresponded to the missing sections of the full-length ORFs.
In family 2, all full-length ORFs were detected by RT-PCR, with the exception of MPN506. Most of the paired truncated ORFs were transcribed on the same mRNAs, with the exception of MPN364 and MPN363, which were detected only on separate transcripts.
Of the three full-length ORFs in family 3, MPN444 and MPN436 were transcribed, while MPN489 was not detected by RT-PCR. Truncated ORFs MPN442, MPN436-like ORF, and MPN440 were transcribed polycistronically, whereas truncated ORFs MPN439, MPN438, MPN437, and MPN485 did not yield consistent products in RT-PCR, possibly as a consequence of low amounts of the respective mRNAs.
All four ORFs in family 4 were transcribed. Truncated ORFs MPN458 and MPN457 were located on a single transcript, while MPN459 was located on a separate transcript.
In family 5 all 13 ORFs were detected in RT-PCR. The paired truncated ORFs MPN650 and MPN649 were detected on a single transcript, as were MPN467 and MPN466. MPN148 and MPN054 were transcribed with predicted noncoding regions, containing amino-terminal and carboxyl-terminal sequences, respectively. Thus, these ORFs appear to have once been full-length ORFs that have since been truncated by a mutation in a start codon and by a frameshifting mutation, respectively.
In family 6, transcripts of all nine ORFs were detectable. The mRNAs for each gene were transcribed separately.
Phylogenetic relationships between lipoproteins within each family. To investigate orthologous and paralogous relationships of putative lipoprotein genes within a family, we conducted phylogenetic analyses of members of families 1, 2, 3, 5, and 6 and the corresponding M. genitalium homologs. Phylogenetic trees (Fig. 3) were constructed using the maximum parsimony method. For the analysis, truncated ORFs were linked together to form a full-length sequence.
![]() View larger version (13K): [in a new window] |
FIG. 3. Phylogenetic
trees of families 1, 2, 3, 5, and 6. Trees were generated using the
maximum parsimony method (MEGA 3.0). The nucleotide substitution rate
is shown as a bar under the tree, and bootstrap values (1,000
replicates) are given for each branch. Phylogenetic analysis of the
family 1 members and family 1 homologs in M. genitalium placed
them in at least four separate groups. Groups II, III, and IV had an
orthologous association with different M. genitalium genes.
Phylogenetic divergence between the genes in group IV mostly
corresponded to their order in the genome. Thus, an ORF was more
closely related to the ORFs closer to it on the gene map than to the
ORFs farther away from it. The genes in group I were closely related
and also lay together on the gene map. Further evidence supporting the
clusters of the tree was found in the different gene lengths between
the groups. MG068, containing 474 amino acids, is similar in size to
the other genes in group IV, which is shorter than MG067 and MG395,
which are 516 amino acids and 524 amino acids, respectively, and are in
groups II and III. There were at least two orthologous groups in family
2. Group I only had two members, MG185 and MPN199, while the other
group contained MG260 and the remaining 10 family 2 members. Family 3
formed three orthologous groups with an M. genitalium homolog
in each group. Although there were more genes in group III, only one
ORF was full-length. Family 5 members formed two groups with
significant bootstrap values (>70). As there were no homologs
in other species, it was not possible to further explore the origins of
these two groups. Truncated ORFs were usually closely related to an
intact member (e.g., MPN411 to MPN654 plus 111 bp of C-terminal
sequence, MPN012 to MPN054 plus 127 bp of C-terminal sequence, MPN011
to MPN650 plus MPN649, and MPN271 to MPN467 plus MPN466).Family 6 ORFs formed at least two orthologous groups in the
phylogenetic tree, and both groups had an M. genitalium
homolog.
|
|
|
|---|
Interestingly, many truncated lipoprotein genes were transcribed, often polycistronically, with other truncated ORFs. This was observed in all lipoprotein gene families that contained truncated genes. This indicates that these genes may not be pseudogenes and, furthermore, that the truncated gene products may retain function(s). The M. pneumoniae genome is reported to have few known transcriptional stop signals (5, 15). Therefore, even functionally unrelated genes could be transcribed polycistronically, as suggested by Benders et al. (1), although M. pneumoniae could employ mechanisms other than hairpin termination for transcriptional termination (15).
In studies of gene expression at the protein level, Regula et al. (12) and Ueberle et al. (14) were able to identify 305 proteins of the predicted 668 ORFs in M. pneumoniae using one-dimensional and two-dimensional gel electrophoresis and mass spectrometry, although they were unable to resolve alkaline proteins and membrane proteins using these methods. Jaffe et al. (9) used shotgun mass spectrometry combined with novel computational methods to build a proteogenomic map of M. pneumoniae strain FH based on the detection of expressed proteins alone. We detected transcription of 87% of the genomically predicted lipoprotein ORFs contained in multigene families in M. pneumoniae strain M129, compared to detection of 76% of the ORF products by Jaffe et al. in strain FH (9). The mass spectrometry and RT-PCR findings were in agreement for 45 of 67 ORFs. At the RNA level gene expression has been previously studied in microarrays (17). Weiner et al. (17) found that 606 of 688 ORFs were transcribed at 37°C, in agreement with our observation that most of the lipoprotein genes were transcribed.
Currently, the functions of most M. pneumoniae lipoproteins are not known, the exception being the lipoprotein that is a subunit of the FoF1-type ATPase (11), which is not a member of any of the six lipoprotein families. We observed that some lipoproteins were associated with ABC transporter genes (Fig. 2) or shared sequence similarity with ABC transporter genes (MPN436 and MPN459), suggesting that they may play a role in the transport of nutrients into the cell.
Our phylogenetic analyses suggest that gene duplication events have occurred frequently in all lipoprotein multigene families and that this duplication occurred prior to and after separation from M. genitalium. This is consistent with the suggestion by Himmelreich et al. (8) that the increased number of lipoprotein genes in the M. pneumoniae genome compared to that of M. genitalium was due to gene amplification. However, concurrent gene degradation has also occurred, as indicated by the many truncated ORFs in all of the families apart from family 6. Thus, the genomic repertoire for each group has resulted from an interaction between gene duplication and degradation. Each orthologous group in these families may be exposed to independent selective pressures, possibly related to their function, as indicated by the common retention of at least one full-length ORF in each group. Family 6 evolution differs from the other families. No truncated ORFs were found in this family, suggesting that the gene duplication events might have occurred recently and that the family may still be expanding.
In summary, we have performed a comprehensive analysis of lipoprotein multigene families in M. pneumoniae and investigated the transcription of all the genes in these families using RT-PCR. We found that most of the lipoprotein genes were transcribed in M. pneumoniae, in contrast to the limited numbers of lipoprotein genes expressed in many other mycoplasmas. Our examination of the transcription data and the genome sequence did not detect any evidence of phase variation. There is no evidence of local polynucleotide sequences upstream of the truncated ORFs that might be generating phase variable frameshift mutations, as has been seen in lipoproteins of some other mycoplasmas. However, we did detect some interesting associations between known genes, particularly ABC transporter genes, and the lipoprotein genes. The gene expression data and lipoprotein gene analysis presented in this paper lay a foundation for functional studies of lipoprotein multigene families in M. pneumoniae.
Supplemental
material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»