| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
,
Department of Ecophysiology, Max Planck Institute for Terrestrial Microbiology, Karl-von-Frisch Str., 35043 Marburg, Germany
Received 18 September 2007/ Accepted 2 November 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
A typical TCS consists of an HPK and a cognate response regulator (RR) which are encoded in the same operon and architecturally organized in a simple linear 1:1 phosphotransfer signal transduction pathway (58). HPKs are multidomain proteins and generally contain a nonconserved sensor domain, which is responsible for detecting a particular stimulus, and a highly conserved kinase domain, which can be further subdivided into the HisKA domain, which is involved in dimerization and contains the conserved phosphorylatable histidine residue, and the HATPase_c domain. Typical RRs are either single-domain proteins consisting only of the conserved receiver domain, which contains the conserved aspartate residue that accepts the phosphoryl group from the histidine residue in the cognate HPK, or multidomain proteins consisting of a receiver domain and a variable output domain. HPKs autophosphorylate in a stimulus-dependent manner on the conserved histidine residue in the HisKA domain using ATP as a phosphodonor. Subsequently, the phosphoryl group is transferred to the conserved aspartate residue in the receiver domain of the cognate RR, resulting in activation of the RR and the generation of an appropriate output response. The phosphorelay systems constitute a structurally more complex type of TCS, with phosphotransfer occurring in three sequential steps (3). Generally, the architecture of these systems is similar to that of typical TCS and involves a linear phosphotransfer scheme. Specifically, the phosphoryl group is first transferred from the conserved histidine in the HPK to the conserved aspartate in a receiver domain, next to a conserved histidine in a phosphotransferase domain (Hpt), and finally to the conserved aspartate in a second RR. The domain organization of proteins in phosphorelays is highly modular. Thus, all four domains involved in phosphotransfer may reside in separate proteins (5), the kinase and first receiver may be present in the same protein (42), or the kinase, the first receiver, and the Hpt domain may be present in the same protein (64). It has been argued that the specific advantage of phosphorelays over typical TCS is that they allow the integration of several sensory inputs in one signal transduction pathway (9). In TCS, as well as in phosphorelays, the output response is determined by the phosphorylation status of the final RR and typically involves alterations in gene expression, in protein-protein interactions, or in enzyme activity. In addition to the linear 1:1 TCS and 1:1:1:1 phosphorelays, TCS and phosphorelays may also be organized as branched pathways. In the one-to-many pathways (54), one HPK has several cognate RRs, as exemplified by CheA (58) and ArcB (41) in Escherichia coli. This pathway connectivity would be suited for generating several responses to one input signal. In the many-to-one pathway (54), one RR has several cognate HPKs, as exemplified by Spo0F in Bacillus subtilis (5) and DosR in Mycobacterium tuberculosis (31). This pathway connectivity would be suited for integrating several input signals to generate one output response.
Here we have focused on an analysis of TCS in the gram-negative deltaproteobacterium Myxococcus xanthus. M. xanthus has a highly complex life cycle that involves social behavior (23). In the presence of nutrients, cells grow and divide, and if present on a solid surface, they form coordinately spreading colonies. In response to starvation, a developmental program is initiated that culminates in the formation of multicellular, spore-filled fruiting bodies. This developmental program involves two temporally and spatially coordinated morphogenetic events, aggregation of cells into fruiting bodies and sporulation of cells that have accumulated inside fruiting bodies. Cells that remain outside the fruiting bodies do not sporulate (43). Fruiting body formation is initiated by the RelA-dependent accumulation of the intracellular signaling molecule(s) guanosine penta- and tetraphosphate [(p)ppGpp] (16, 53). In addition, two intercellular signals, the A- and C-signals, are important for fruiting body formation. The A-signal consists of a mixture of amino acids and peptides (32) and is part of a system that monitors the density of starving cells (32, 33). The C-signal is a 17-kDa protein (27, 38) that induces and coordinates aggregation and sporulation (26, 30, 37). Fruiting body formation is accompanied by changes in gene expression, and several of the genes required for fruiting body formation are transcriptionally regulated in response to starvation (19, 29). The formation of spreading colonies, as well as fruiting body formation, depends on the functionality of the A- and S-gliding motility systems (18).
Forward- and reverse-genetics approaches have identified 35 TCS genes that are important for fruiting body formation (see Tables S3, S4, and S5 in the supplemental material for a list of these genes). These effects on fruiting body formation, in several cases, e.g., the RRs DigR (44) and AglZ (67), are likely to be indirect and caused by an effect on gliding motility. However, most other TCS mutants display only developmental defects, suggesting that the corresponding proteins are directly involved in development. More than half of the TCS genes required for fruiting body formation are encoded by orphan TCS genes or by TCS genes in complex gene clusters (See Materials and Methods for a definition of TCS genes), and the architecture of the corresponding signal transduction pathways remains poorly understood.
To begin to address the function of the remaining TCS genes in M. xanthus, we used bioinformatics in combination with functional analyses. We report that 71% of the TCS genes are organized in unusual manners as orphan genes or in complex gene clusters whereas the remaining 29% display the standard paired gene organization. Our bioinformatics analyses suggest that TCS proteins encoded by orphan genes and complex gene clusters are functionally distinct from TCS proteins encoded by paired genes and that the connectivity of the pathways made up of TCS proteins encoded by orphan genes and complex gene clusters is different from that of pathways involving TCS proteins encoded by paired genes. Experimentally, we found that orphan TCS genes are overrepresented among TCS genes that display altered transcription during fruiting body formation. The systematic analysis of 25 orphan HPKs that are transcriptionally up-regulated during development led to the identification of 2 HPKs that are likely essential for viability and 4 new HPKs that have important function in fruiting body formation or spore germination.
| MATERIALS AND METHODS |
|---|
|
|
|---|
To identify output domains in RR, HMMER searches were performed on all M. xanthus proteins bearing receiver domains using all matrices provided in the pfam database (http://pfam.sanger.ac.uk/) (10). Results from the searches were inspected, and based on the annotations provided for each pfam matrix, all hits from known output domains were noted. For those proteins for which the above search failed to identify an output domain, possible cryptic, conserved domains were searched for as follows: peptide sequences leading and trailing the receiver domains from these proteins were collected and subjected to a BLAST search of the NCBI nonredundant protein database. Results were manually analyzed, but no new domains were identified.
Transmembrane helices were predicted using the TMHMM 2.0c software package with the provided default model and options (http://www.cbs.dtu.dk/services/TMHMM) (56). If a protein was predicted to contain only a single transmembrane helix, the location of the transmembrane helix was inspected. If the helix was located within the first 20 amino acid residues and would thus overlap with the signal peptide, the HPK was categorized as cytoplasmic.
Classification of TCS proteins in M. xanthus based on genetic organization. Based on the genetic organization of TCS genes, a set of criteria was developed to classify these genes into three groups (Fig. 1), as follows.
|
Paired genes. Paired genes were defined as two adjacent genes encoding an HPK or an HPK-like protein and an RR and transcribed in the same direction.
Orphan genes. All other gene organizations were considered orphan genes. Three che gene clusters are flanked by three orphan genes (MXAN6953, MXAN6955, and MXAN6966) encoding HPKs and one gene (MXAN2687) encoding an HPK-like protein that are not CheA-like, and these genes are classified as orphan.
Cell growth and development for DNA microarray analysis. M. xanthus DK1622 wild-type cells were grown in liquid 1% CTT medium (18) at 32°C to a density of 5 x 108 cells/ml, harvested, and resuspended in prewarmed (32°C) MC7 (10 mM morpholinepropanesulfonic acid [pH 7.0], 1 mM CaCl2) to a calculated density of 5 x 109 cells/ml. The cell suspension was diluted 1:8 with MC7 and 35 ml of the resulting suspension transferred to a sterile 145-mm culture dish (Greiner Bio-One) and incubated at 32°C. After 0, 2, 4, 6, 9, 12, 15, 18, or 24 h of development, cells were harvested, and immediately frozen in liquid nitrogen and stored at –80°C. For the preparation of total RNA from exponentially growing cells, cells were grown as described above, harvested at a density of 5 x 108 cells/ml, and immediately frozen in liquid nitrogen and stored at –80°C. Total RNA from these cells served as "reference RNA." Three biological replicate time course experiments were performed.
Isolation of total RNA and DNase I treatment. Total RNA was isolated from cell pellets using the hot-phenol method (44). One hundred micrograms total RNA was treated with 20 U RNase-free DNase I (Fermentas) for 60 min at 37°C. RNA was purified using the RNeasy minikit (Qiagen). The absence of DNA was verified by PCR. RNA quality was checked using agarose gel electrophoresis (50).
cDNA synthesis, fluorescent labeling, and hybridization. cDNA synthesis was carried out using 25 µg of DNA-free total RNA for each experimental sample (RNA from developing cells) and 25 µg of the DNA-free reference sample as described previously (20, 44) with the following modifications. cDNA synthesis was carried out in the presence of 0.5 mM (each) dATP, dCTP, and dGTP, 0.1 mM dTTP, and 0.4 mM aminoallyl-dUTP in a total volume of 30 µl. Subsequently, RNA was hydrolyzed by addition of 10 µl of 1 M NaOH and 10 µl of 0.5 M EDTA and incubated at 65°C for 15 min, followed by addition of 10 µl of 1 M HCl. cDNA was purified using a Zymo kit (Zymo Research), vacuum dried, and resuspended in 13 µl of fresh 100 mM sodium bicarbonate, pH 9. cDNA of the reference probe was labeled with Cy5 and cDNA of the developmental probe with Cy3 as described previously (20). The M. xanthus DNA microarrays cover 88% of all protein-coding genes in the M. xanthus genome and were printed at the Stanford DNA microarray printing facility and postprocessed as described previously (8, 44). Briefly, each open reading frame (ORF) is represented on the microarray as a 275- to 325-bp PCR fragment. Hybridizations were carried out as described previously (44).
Data acquisition and analysis.
Microarrays were scanned simultaneously at two wavelengths (for Cy3, 532 nm; for Cy5, 635 nm) using a GenePix 4000B microarray scanner (Axon Instruments, Inc.). Image analysis and processing was performed using the GenePix Pro 6.0 software package (Axon Instruments, Inc.). The ratio-normalized data set (mean ratio of medians = 1) containing median signal intensity and median signal background from each channel was further analyzed using Aquity 4.0 software (Axon Instruments, Inc.) and the Significance Analysis of Microarrays (SAM) software (version 2.0), which assigns a score to each feature on a microarray on the basis of changes in gene expression relative to the standard deviation of repeated measurements (63). A filtered subset of all features printed on the array was selected based on the following criteria: (i) found by the Genepix Pro 6.0 spot-finding algorithm ("Flags"
0) and (ii) signal-to-noise ratio of either the Cy3 (532 nm) or Cy5 (635 nm) channel was greater than 2. For statistical significance analysis of the selected data points, SAM was used to calculate a t-like statistic (d) based on the estimated variance of the data. The cutoff value of the SAM analysis was chosen as the value where the median false discovery rate became 0%. For the selected features, ratios were averaged and subjected to a 1.5-fold cutoff criterion (at least in one time point during development) and were further selected for the ones where data points were present for all developmental time points and the expression value at 0 h was between +1.5 and –1.5 (not regulated).
Quantitative real-time PCR. Verification of gene expression data obtained by DNA microarray analysis was carried out using quantitative real-time PCR (qRT-PCR) as described for M. xanthus (44). Briefly, cDNA was synthesized with the cDNA Archive kit (ABI) from 1.0 µg of DNA-free total RNA from six different developmental time points (0, 6, 12, 18, and 24 h) from one biological experiment from the DNA microarray analyses. Primers for qRT-PCR (see Table S1 in the supplemental material) were designed with Primer Express 2.0 (ABI) to give fragments with sizes of 60 to 150 bp. The qRT-PCRs were carried out in triplicate in a total volume of 25 µl containing 12.5 µl SYBR green PCR master mix (ABI), 1 µl of each primer (10 µM), 0.1 µl cDNA, and 11.9 µl H2O. qRT-PCRs were performed on an AB 7300 real-time PCR detection system using standard conditions. Expression ratios were calculated as the absolute expression level in developing cells over the absolute expression level in vegetative cells. The efficiency of each primer pair was determined using four different concentrations of DK1622 chromosomal DNA (10 ng/µl, 1.0 ng/µl, 0.1 ng/µl, and 0.01 ng/µl) as a template in qRT-PCRs.
Construction of mutants in genes coding for HPKs.
In-frame deletion mutants were constructed by two-step homologous recombination (Fig. 2). Briefly, approximately 1,100-bp PCR products containing the in-frame deletions were cloned in the plasmid pBJ114 (22), which contains the galK gene for counterselection. Primers used for the constructions are listed in Table S2 in the supplemental material. Four primers (A, B, C, and D) were designed to amplify the 1,100-bp fragment carrying an hpk in-frame deletion by PCR with M. xanthus chromosomal DNA as a template. Briefly, primers A and B were used to amplify the upstream flanking region of the hpk gene. Primer A contained a restriction site for cloning in pBJ114, and primer B contained either a restriction site for cloning or a region complementary to the downstream flanking PCR fragment. Primers C and D were used to amplify the downstream flanking fragment of the hpk gene. Primer C contained either a restriction site for cloning or a region complementary to the upstream flanking PCR fragment, and primer D contained a restriction site for cloning in pBJ114. The fragments AB and CD were used to generate the full-length in-frame deletion fragment either by direct cloning or in a second PCR with primers A and D and the two flanking PCR fragments as templates. The hpk in-frame fragments were cloned in the plasmid pBJ114 and transformed into E. coli Top 10 (Invitrogen) (F– mcrA
(mrr-hsdRMS-mcrBC)
80lacZ
M15
lacX74 recA1 ara
139
(ara-leu)7697 galU galK rpsL endA1 nupG) and checked by sequencing. Correct plasmids were introduced into the M. xanthus wild-type strain DK1622 by electroporation (25). The insertion of plasmids after the first homologous recombination was confirmed by PCRs with three primer pair combinations (primers are listed in Table S2 in the supplemental material): primers E (binds upstream of primer A) and F (binds downstream of primer D), primers E and M13-forward (hybridizes to pBJ114), and primers F and M13-reverse (hybridizes to pBJ114). For each in-frame construct, at least one clone with the insertion of the plasmid in the upstream flanking region of the hpk gene and one clone with the insertion in the downstream flanking region of the hpk gene were chosen for the second homologous recombination. To isolate clones containing the in-frame deletion, cells were plated on CTT plates with 1% or 2% galactose (Sigma) for counterselection. Galactose-resistant and kanamycin-sensitive colonies were screened out and checked by two PCRs with the primers E and F and the primers G and H, which bind to the deleted part of the hpk gene, to verify the in-frame deletion. For MXAN3036 and MXAN4988, no in-frame deletions were obtained with the 1,100-bp in-frame deletion constructs using the primers labeled "short" in Table S2 in the supplemental material, i.e., all galactose-resistant clones contained in the intact genes. Therefore, for both genes in-frame deletion constructs on 1,400-bp fragments were generated using the primers labeled "long" in Table S2 in the supplemental material. Also, with these constructs, no in-frame deletions were isolated. In the case of MXAN0060, primers A and B were used to generate an internal fragment of MXAN0060, which was cloned into pBGS18 (57) to generate pXS011, which was used to generate an insertion mutation in MXAN0060. M. xanthus strains are listed in Table 1.
|
|
To test for motility defects, cells were grown as described above and plated on 0.5% CTT medium supplemented with 0.5% or 1.5% agar as described previously (52).
Microarray data accession number. The microarray data discussed in this publication have been deposited in the NCBIs Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) and are accessible through Gene Expression Omnibus Series accession number GSE9477.
| RESULTS |
|---|
|
|
|---|
|
Genetic organization of two-component genes in M. xanthus. During the analysis of M. xanthus TCS genes, we noticed that many of these genes are not organized as pairs as is typically reported. To analyze the genetic organization of TCS genes, we developed a set of criteria according to which TCS genes were divided into three categories: complex gene clusters, orphan genes, and paired genes (Fig. 1) (see Materials and Methods). The genetic organization of TCS genes in M. xanthus diverges significantly from the standard paired organization (Table 2; Fig. 3, 4, and 5): 55% (138 genes out of 251 total) of TCS genes are orphans, 16% (39 out of 251) of TCS genes are located in complex gene clusters, and only 29% (74 out of 251) are found as paired genes.
|
|
|
HPKs are generally described as being integral membrane proteins (58). To determine whether HPKs in M. xanthus conform to this general description, we analyzed HPKs for the presence or absence of transmembrane helices. Among the 118 HPKs, 45 (38% of the total number of HPKs) are likely to be integral membrane proteins and 73 are likely to be cytoplasmic (Fig. 3) (see Table S3 in the supplemental material for detailed results). Also in this analysis, we found a biased distribution of the two types of HPKs. Thus, 11 of the predicted cytoplasmic HPKs are encoded by complex gene clusters (corresponding to 61% of all HPKs encoded by these genes), 54 of the predicted cytoplasmic HPKs are encoded by orphan genes (corresponding to 79% of all HPKs encoded by these genes), and only 8 of the predicted cytoplasmic HPKs are encoded by paired genes (corresponding to 25% of all HPKs encoded by these genes).
RRs and output domains. As a second step in understanding the connectivity of TCS in M. xanthus, we divided the RRs into single-domain RRs, which consist only of the receiver domain, and multidomain RRs, which in addition to the receiver domain contain an output domain. The 119 RRs can be divided into 38 without (32% of the total number of RRs) and 81 with output domains (Table 3; Fig. 4) (see Table S4 in the supplemental material for the detailed domain organization). All che gene clusters contain either a CheAY hybrid kinase or CheY homologs (see Tables S3 and S4 in the supplemental material for the detailed description of proteins encoded by che gene clusters). Thus, the remaining single-domain RRs are not likely to be CheY paralogs.
|
Transcriptional regulation of TCS genes during development. Thirty-five TCS genes have been identified as important for fruiting body formation and sporulation (see Table S3, Table S4, and Table S5 in the supplemental material for the specific genes). Several of these genes are developmentally regulated at the transcriptional level during development. Thus, we reasoned that one approach to identify TCS proteins that may have a function in fruiting body formation would be to analyze the expression profiles of TCS genes during development.
For these experiments, we used an M. xanthus DNA microarray covering 88% of the 7,380 ORFs on the M. xanthus genome (see Materials and Methods). The detailed analysis of the experiments will be described elsewhere (N. Hamann, S. Wegener-Feldbrügge, L. Søgaard-Andersen, and R. Hedderich, data not shown). Total RNA was isolated from mid-exponentially growing wild-type cells (DK1622) and from different time points during development (0, 2, 4, 6, 9, 12, 15, 18, and 24 h), and cDNA was prepared, labeled with Cy3 (samples from development) and Cy5 (reference), and competitively hybridized to the microarray. As a reference, the RNA isolated from mid-exponentially growing wild-type cells was used. A total of three biological experiments were performed. Thus, data analysis (Materials and Methods) was carried out on three experimental values for each gene. The ratio-normalized data set was analyzed using Acuity 4.0 software (Axon Instruments) and SAM software, version 2.0, which assigns a score to each feature on a microarray on the basis of changes in gene expression relative to the standard deviation of repeated measurements (63). Genes called to be significantly regulated during development were selected by a delta value of the SAM analysis where the false discovery rate became 0% in combination with a 1.5-fold cutoff and data points for all time points.
Among the 200 TCS genes present on the array, 50 displayed altered expression during development (Fig. 3, 4, and 5; see Table S3, Table S4, and Table S5 in the supplemental material for details on the expression of individual genes). Developmentally regulated genes exhibited expression ratios in the range of 10.9-fold up-regulation to 6.6-fold down-regulation. The changes in gene expression during development were asymmetric: 46 genes were up-regulated, and 4 genes were down-regulated. To validate the significance of the expression data obtained from the DNA microarrays, qRT-PCR was applied to 11 genes (10 genes up-regulated during development and 1 gene not showing regulation). The transcriptional differences determined in the microarray experiments were confirmed by the qRT-PCR analysis (see Table S3, Table S4, and Table S5 in the supplemental material for details of qRT-PCR data).
To generate a complete data set on the expression profile for an entire category of TCS genes, we tested by qRT-PCR the expression during development of the 13 orphan HPK genes for which no expression data were available from the DNA microarray experiments. Among these genes, we found that six were up-regulated and seven were down-regulated during development (see Table S3 in the supplemental material for details of the qRT-PCR data).
In total, we identified 63 TCS genes out of 213 TCS genes tested either in microarray analyses or by qRT-PCRC that were transcriptionally regulated during development (Fig. 3, Fig. 4 and Fig. 5) (see Tables S3, S4, and S5 in the supplemental material for the expression of individual genes). Fifty-two genes were up-regulated and 11 genes down-regulated. Thirty-six percent and 13% of the tested orphan and complex genes, respectively, were transcriptionally regulated during development, whereas only 12% of the tested paired genes were transcriptionally regulated during development.
Genetic analysis of transcriptionally up-regulated, orphan histidine protein kinase genes. The gene expression profiling experiments indicate that orphan TCS may have important functions in fruiting body formation. To test this hypothesis and to potentially identify novel TCS genes required for development, we focused on the 25 orphan HPK genes that are transcriptionally up-regulated during development (Fig. 3) (Table 4 contains a list of all 25 genes, including expression data). These 25 genes include espA (7), sdeK (13), espC (34), asgD (6), and mokA (28), which have previously been suggested to be important for development. To analyze the importance of the remaining 20 genes for fruiting body formation, we sought to generate in-frame deletions in 19 of these genes using a two-step recombination procedure (Fig. 2); for MXAN0060, an insertion mutant was constructed. In-frame deletions were preferred over insertion mutants to avoid polar effects on downstream genes. In addition, we generated or obtained in-frame deletions of the five previously identified orphan HPKs important for development in order to systematically compare developmental defects. For each of the genes MXAN3036 and MXAN4988, more than 200 galactose-resistant, kanamycin-sensitive clones were tested (see Materials and Methods). For both genes, all colonies tested contained the intact HPK gene. These observations suggest that MXAN3036 and MXAN4988 are essential genes for viability. For the remaining 22 genes, we obtained stable in-frame deletion mutants. For MXAN0060, we obtained a stable insertion mutant.
|
MXAN1014 (=
sdek) was unable to aggregate to form fruiting bodies and was strongly reduced in sporulation. Also as previously reported, a mutant carrying
MXAN0931 (=
espA) (SA2139) displayed early aggregation with the formation of many small and irregularly shaped fruiting bodies; also, SA2139 displayed early sporulation with many spores localized outside the fruiting bodies (Fig. 6B). SA2144 carrying
MXAN6996 (=
asgD) displayed delayed aggregation but normal levels of sporulation. This is in disagreement with a previous report, in which an insertion in asgD was reported to cause aggregation as well as sporulation defects (6). We attribute these differences to differences in strain backgrounds and mutations being analyzed in the previous report and the data presented here. SA2112 carrying
MXAN3290 displayed delayed aggregation and formation of abnormally shaped fruiting bodies. SA2112 sporulated at wild-type levels, however, many of the spores were localized outside fruiting bodies. SA2134 carrying
MXAN4465 displayed normal timing of aggregation and sporulation. However, this mutant formed 1.7-fold ± 0.4-fold more fruiting bodies than the wild type and individual fruiting bodies were smaller than those formed by the wild type, covering an area of only 60% ± 7% of that of a wild-type fruiting body. Moreover, the fruiting bodies formed by SA2134 in submerged culture were less condensed than those formed by the wild type (Fig. 6B). SA2138 carrying
MXAN0712 was unable to aggregate to form fruiting bodies and was strongly reduced in sporulation. Finally, SA2118 carrying
MXAN0736 displayed normal aggregation and fruiting body formation and sporulated at wild-type levels. However, these spores germinated at a level threefold lower than that observed with the wild type. It should be noted that the two mutants carrying in-frame deletions of espC (PH1017) and mokA (SA2143) developed in a manner indistinguishable from that of the DK1622 wild type under all conditions tested and sporulated at wild-type levels and with the formation of germination-proficient spores (data not shown). Inactivation of these two genes has previously been reported to cause developmental defects (28, 34). We attribute the difference between published results and our results to differences in strain backgrounds used and to different mutations being analyzed.
|
|
| DISCUSSION |
|---|
|
|
|---|
The M. xanthus TCS genes could be divided into three classes based on their genetic organization. Fifty-five percent and 16% of all the TCS genes are organized as orphan genes or in complex gene clusters, respectively, and only 29% are found as paired genes. We compared this genetic organization of TCS genes to that in other bacteria and found that of the 59 TCS genes in E. coli, 20% are orphan, 8% are in complex gene clusters, and 72% are paired; of the 63 TCS genes in B. subtilis, 14% are orphan, 0% in complex gene clusters, and 86% paired; of the 86 TCS genes in Caulobacter crescentus, 63% are orphan, 7% in complex gene clusters, and 30% paired; of the 126 TCS genes in Pseudomonas aeruginosa, 35% are orphan, 8% in complex gene clusters, and 57% paired; and of the 186 TCS genes in Streptomyces coelicolor, 25% are orphan, 9% in complex gene clusters, and 66% paired (S. Huntley and L. Søgaard-Andersen, unpublished data). This comparison shows that the genetic organization of TCS genes shows large interspecies variations. Moreover, it is evident that the three categories of TCS genes are not peculiar to the M. xanthus genome.
The genome size of M. xanthus is 9.14 Mb, and it has been suggested that lineage-specific gene family expansions (LSE) were major contributors to the genomic expansion (14). Our preliminary analyses suggest that a large fraction of TCS genes in M. xanthus may have arisen by LSE (Huntley and Søgaard-Andersen, unpublished). Alm et al. (1) reported that deltaproteobacteria have a propensity for LSE of TCS genes. In agreement with this, we find that the complements of TCS genes in the three Myxococcales species Sorangium cellulosum (genome sequence provided by Rolf Müller, University of Saarland, Saarbrücken, Germany), Stigmatella aurantiaca (genome at http://cmr.tigr.org/tigr-scripts/CMR/CmrHomePage.cgi), and Anaeromyxobacter dehalogenans (genome at http://genome.jgi-psf.org/finished_microbes/anade/anade.home.html) also contain a large fraction of TCS genes that likely arose by LSE (Huntley and Søgaard-Andersen, unpublished). Interestingly, however, the complements of TCS genes that have expanded by LSE in the four Myxococcales species appear to be different, suggesting that for each species the particular genes amplified provide that species with some selective benefits. A detailed description of LSE of TCS genes in Myxococcales will be presented elsewhere (Huntley and Søgaard-Andersen, unpublished).
We identified three groups of structurally remarkable TCS proteins in M. xanthus. One group consists of 14 HPK-like proteins, which contain only a HisKA domain or a HATPase_c domain (see Table S5 in the supplemental material). Four lines of evidence suggest that these genes are not pseudogenes but code for functional proteins. First, three of these genes (MXAN0461 [= redE] [17], MXAN2670 [= asgA] [45], and MXAN5123 [= mrpA] [59, 60]) are required for development. Second, at least nine of the genes were found to be expressed in global transcriptional profiling experiments. Third, two of the genes are transcriptionally up-regulated during development. Finally, the HPK-like protein YojN, which functions in a phosphorelay with RcsB and RcsC (61) in Escherichia coli, provides evidence that HPK-like proteins may be functional. A second group of proteins with interesting structural features consists of proteins that have organizations of signal transduction domains, which have not been reported previously and which raise interesting questions in terms of how they function in phosphotransfer reactions. For instance, the orphan HPKs MXAN2606 and MXAN2317 are predicted to have the domain structures HisKA-HATPase_c-RR-HisKA-HATPase_c and HisKA-HATPase_c-RR-RR-Hpt, respectively, and the RR MXAN7362, which is encoded by a complex gene cluster, is predicted to have the domain structure RR-Hpt-RR-RR-GGDEF. The third group of TCS proteins with interesting structural features consists of 14 RR with output DUF. These RRs are overrepresented among RRs encoded by complex gene clusters and orphan genes, i.e., 10 and 2 of these domains are found in RRs encoded by orphan and complex genes, respectively. Interestingly, four of these proteins are orphan RRs involved in regulating gliding motility (MXAN2991 [= aglZ] [40, 67], MXAN4149 [= frzS] [39, 66], MXAN4461 [= romR] [35], and MXAN6627 [= sgnC] [68]).
The analysis of M. xanthus TCS proteins revealed structural features that have functional implications. First, 73 out of the 118 HPKs are predicted to be cytoplasmic, suggesting that the many HPKs in M. xanthus may be involved not in monitoring external stimuli or intercellular signals but rather in monitoring cytoplasmic stimuli. Alternatively, they could indirectly be involved in monitoring external stimuli or intercellular signals by interacting with membrane proteins. Second, the analysis of output domains in RRs suggests that the output responses from TCS systems in M. xanthus center on three types, regulation of gene expression, regulation of cyclic-di-GMP metabolism, and unknown functions.
Interestingly, we found strongly biased distributions of different types of TCS proteins encoded by paired genes and orphan genes and in complex gene clusters. These biased distributions have several functional implications, as discussed below. For paired TCS genes, the main implication is that a large fraction of the corresponding proteins are part of simple 1:1 TCS with an integral membrane HPK and a cognate RR that is involved in regulation of gene expression. The predicted membrane localization of the paired HPKs suggests that they are primarily involved in monitoring external stimuli. Moreover, the underrepresentation of these genes among transcriptionally regulated genes during development indicates that they may be functionally active in vegetative cells. Clearly, the latter implication does not preclude a function during fruiting body formation of these proteins. Consistently, 15 paired TCS genes have been identified which are important for development (see Tables S3, S4, and S5 in the supplemental material for the identities of these genes).
For TCS proteins encoded by orphan genes or genes in complex gene clusters, the biased distribution of protein characteristics and expression profiles suggests that the corresponding HPKs are primarily involved in monitoring cytoplasmic stimuli (due to the overrepresentation of HPKs predicted to be cytoplasmic) and that the main output responses from the corresponding pathways are regulation of gene expression, regulation of cyclic-di-GMP metabolism, and unknown functions (as indicated by the overrepresentation of RRs with DUF output domains). Moreover, the overrepresentation of these genes among those that are transcriptionally regulated during development suggests that many of these genes encode TCS proteins with a function only during development. Consistently, 16 orphan TCS genes (including the 4 identified in this report) and 6 TCS genes encoded in complex gene clusters have been shown to be important for development or spore germination (see Tables S3, S4, and S5 in the supplemental material for the identities of these genes). It should be noted that transcriptional regulation during development does not preclude a function in vegetative cells.
A question that remains to be addressed focuses on the connectivity of the TCS proteins in M. xanthus. As mentioned for the paired TCS genes, the almost complete absence of hybrid HPKs and single-domain RRs in the corresponding proteins suggests that the paired genes encode proteins that make up simple, linear 1:1 pathways. For TCS proteins encoded by orphan genes and in complex gene clusters, the connectivity has been analyzed experimentally only for the RedCDEF proteins (= MXAN0459 to MXAN0462), and the data suggest that these four proteins may constitute a complex phosphorelay (17). Since the connectivity of TCS proteins cannot be predicted based on sequence conservation alone (54), this question, therefore, remains open for most of the TCS proteins encoded by orphan genes and in complex gene clusters. The close to 1:1 numerical ratio of HPKs and RRs encoded by these genes could lead to the notion that they could be organized in 1:1 pathways. However, two observations argue against this notion. First, hybrid HPKs are overrepresented among these proteins. Second, many of the RRs encoded by these genes are single-domain RRs. The overrepresentation of hybrid HPKs and RRs without output domains among the proteins encoded by complex gene clusters and orphan genes strongly suggests that the signal transduction pathways encoded by these genes are structured as phosphorelays and/or are branched. Phosphorelays would likely depend on the presence of Hpt domain-containing proteins. In addition to CheA kinases, we identified only two proteins containing Hpt domains, the hybrid orphan HPK MXAN2317 and the RR MXAN7362, which is encoded in a complex gene cluster. It has been argues that Hpt domains are difficult to identify due to the low level of sequence conservation (4); thus, M. xanthus may indeed encode more proteins containing Hpt domains. Clearly, experimental analyses are needed to address the question of the connectivity of the M. xanthus TCS proteins.
We directly tested genetically the hypothesis that orphan developmentally up-regulated genes could be important for development by focusing on the 25 orphan HPK genes that are up-regulated at the transcriptional level during development. Among these genes, we found two (MXAN3036 and MXAN4988) that are likely to be essential for viability and seven that are important for development without having vegetative defects. These seven genes include MXAN0931 (= espA) (7), MXAN1014 (= sdeK) (13, 46), and MXAN6996 (= asgD) (6), which have previously been shown to be important for development. In addition, we identified MXAN0712, MXAN0736, MXAN3290, and MXAN4465 as important for development or spore germination. Finally, inactivation of MXAN6855 (= espC) (34) and MXAN7206 (= mokA) (28), which have previously been reported to be important for development, did not display developmental defects under our conditions. How these seven proteins function in development remains to be determined. Clearly, the lack of developmental defects in the remaining 16 mutants could be caused by functional redundancy among HPKs. Nevertheless, our data have two implications: first, the transcriptional up-regulation of a TCS gene does not necessarily mean that this gene has an essential function during development (at least not under the three conditions tested here). This notion is supported by the observation that several other genes that are transcriptionally up-regulated during development also do not have essential functions during development (29). Second, even though TCS proteins clearly have important functions in development, the large number of TCS genes in M. xanthus may not have evolved solely to regulate fruiting body formation.
| ACKNOWLEDGMENTS |
|---|
The International Max Planck Research School for Environmental, Cellular and Molecular Microbiology and the Max Planck Society supported this work.
| FOOTNOTES |
|---|
Published ahead of print on 9 November 2007. ![]()
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
| REFERENCES |
|---|
|
|
|---|
54 enhancer binding proteins and Myxococcus xanthus fruiting body development. J. Bacteriol. 186:4361-4368.