Previous Article | Next Article ![]()
Journal of Bacteriology, April 2006, p. 3073-3087, Vol. 188, No. 8
0021-9193/06/$08.00+0 doi:10.1128/JB.188.8.3073-3087.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Groupe de Génétique des Biofilms-CNRS URA 2172, Institut Pasteur, 25 rue du Dr. Roux, 75724 Paris CEDEX 15, France
Received 20 December 2005/ Accepted 30 January 2006
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
The biofilm matrix is a complex hydrated milieu that contains proteins, DNA, RNA, ions, and polysaccharidic polymers (9, 61, 62). These polymers are very diverse and include components such as ß-1,6-N-acetyl-D-glucosamine polymer (PIA in Staphylococcus spp. or PGA in Escherichia coli), colanic acid (E. coli), and alginate-, glucose-, and mannose-rich components (Pseudomonas aeruginosa and Bacillus subtilis), as well as cellulose (Salmonella, E. coli, and Pseudomonas fluorescens) (7, 8, 19, 20, 27, 44, 58, 59, 64, 68). Polysaccharides are often involved in the establishment of productive cell-to-cell contacts that contribute to the formation of biofilms at liquid/solid interfaces, of pellicles at air/liquid interfaces, of cell aggregates and clumps in liquid cultures, and of wrinkled colony morphology on agar plates. Evidence for a structural role of some of these matrix polysaccharides is accumulating, and the regulation of the production of these exopolysaccharides is being actively investigated for different bacteria (9, 33, 55).
Cellulose production is a widespread phenomenon in Enterobacteriaceae, including Salmonella enterica serovar Typhimurium, Salmonella enterica subsp. enterica serovar Enteritidis, E. coli, Citrobacter spp., and Enterobacter spp. (49, 58, 67, 68). In association with the production of curli fibers, cellulose synthesis in Salmonella has been shown to be a primary cause of biofilm formation and to lead to a distinctive phenotype on agar plates, the red, dry, and rough (rdar) morphotype in LB medium at 28°C (58, 68). The genetic dissection of the rdar morphotype in Salmonella showed that both curli fibers and cellulose production are positively regulated by CsgD (AgfD in Salmonella), a transcriptional regulator belonging to the LuxR family (47, 51). The roles of CsgD in these two pathways can be monitored by the absorption of the Congo red dye (CR; indicative of curli and cellulose production) or the fixation of calcofluor (CF; indicative of cellulose production). In Salmonella serovar Typhimurium and Salmonella serovar Enteritidis, the ability to bind CF depends on the expression of two divergent cellulose synthesis operons, bcsABZC and bcsEFG, that are constitutively expressed (58, 68). The positive regulation of cellulose production by CsgD is mediated through the transcriptional regulation of adrA. AdrA, in turn, activates cellulose synthesis at a posttranscriptional level by controlling the synthesis of cyclic diguanylate (c-di-GMP). In Gluconacetobacter xylinus, it has been suggested that c-di-GMP binds to BcsB, promoting an allosteric change of the protein conformation that leads to its activation (36). Recently, a PilZ domain proposed to be part of the c-di-GMP binding protein has been identified in several bacterial cellulose synthases, including BscA, although direct evidence for c-di-GMP binding is still missing (2). In addition to being subject to several global regulators, such as OmpR, CpxR, H-NS, and IHF, depending on environmental conditions (25, 31), CsgD expression is also controlled by RpoS through the positive action of the transcriptional regulator MlrA (see Fig. 6A) (11, 46).
|
Here, we have investigated cellulose and biofilm formation in a collection of E. coli isolates. We show that cellulose synthesis is the primary cause of biofilm formation and of the expression of multicellular behavior (rdar morphotype) in the commensal E. coli strain 1094. In this strain, cellulose synthesis does not require CsgD or AdrA, which is indicative of an alternative CsgD-independent cellulose regulatory pathway. We identified the genetic determinant involved in this pathway and provided evidence of the existence of alternative cellulose regulatory networks in E. coli. These findings indicate that the regulation of cellulose production in E. coli is more complex than previously recognized.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
-red linear DNA gene inactivation method using the three-step PCR procedure as described previously (12, 17; for details, see http://www.pasteur.fr/recherche/unites/Ggb/3SPCRprotocol.html). In the pyedQ-uidA fusion mutant in E. coli 1094, yedQ was replaced by uidA. The primers used to delete the genes presented in this study are listed in Table S2 in the supplemental material. All constructs were checked by PCR with specific primers (see Table S2 in the supplemental material). Molecular biology procedures. Standard techniques were used for cloning, DNA analysis, PCR, electroporation, and conjugation, as described previously (54). As the genome sequence for 1094 is currently unavailable, E. coli 1094 genomic sequences were determined from PCR products amplified from the bacterial genome with proofreading polymerase. All DNA sequencing was performed by Genome Express SA, France.
Congo red and calcofluor phenotype assays. Two microliters of an overnight (o/n) culture grown at 37°C in LB medium (with added antibiotics when needed) was spotted onto LB plates (without NaCl) containing 0.004% CR and 0.002% brilliant blue (referred to as CR plates) or onto LB or M63B1-0.4% glucose (M63B1-glu) plates containing 0.02% CF (Sigma reference F-3543) and 1 mM HEPES (referred to as CF-LB or CF-M63 plates, respectively). The spotted drops were allowed to dry, and the plates were incubated for 24 to 48 h at 30 or 37°C. Red or pink colonies on CR plates indicated the binding of CR. Fluorescence of a colony under UV light indicated the binding of CF on CF-LB or CF-M63B1 plates.
Plasmid construction. pZE12-yedQ and pZE12-adrA, which encode YedQ and AdrA, respectively, placed under the control of a synthetic lacp promoter (34), were constructed as follows. The yedQ and adrA open reading frames were amplified from the E. coli 1094 chromosome and cloned into pZE12-gfp by replacing the gfp gene with yedQ or adrA. A similar procedure was used to clone the promoter region and the entire adrA gene from the E. coli 1094 chromosome into pZE12-gfp. The resulting plasmid, pZE1Pr-adrA, encodes adrA placed under the control of its own promoter. The second glycine and first glutamic acid from the GGEEF motif of YedQ were both mutated to alanine, leading to a GAAEF motif. First, we amplified from pZE12-yedQ the yedQ sequence between two FspI sites using the primer FspI-GAEEF-5, which contains two mismatches compared to the wild-type sequence of yedQ from E. coli 1094 and FspI-GAEEF-3. The amplified PCR products were then used to replace the wild-type GGEEF motif of YedQ with a GAAEF motif in pZE12-yedQ, leading to plasmid pZE12-yedQgaaef. DNA fragments corresponding to 247 bp preceding the ATG start codon of yedQ or adrA of E. coli strain 1094 were amplified, and the PCR products were cloned into pRS145 to create pRSpyedQ::lacZ and pRSpadrA::lacZ plasmids, which carried a transcriptional fusion between the promoterless reporter gene lacZ and the yedQ or adrA promoter, respectively. The integrity of all the cloned fragments was verified by sequencing. The sequences of the primers used for plasmid construction can be found in Table S2 in the supplemental material.
Biofilm formation assay. All experiments were performed at least in triplicate in M63B1 minimal medium with 0.4% glucose at 37°C. Sixty-milliliter microfermentors containing a removable glass slide were configured as continuous-flow culture bioreactors with a 40-ml h1 flow rate, which minimizes planktonic growth of the bacteria as described previously (26; http://www.pasteur.fr/recherche/unites/Ggb/matmet.html). The equivalent of 1 optical density at 600 nm (OD600) unit from o/n bacterial cultures grown in 0.4% glucose M63B1 minimal medium supplemented with the appropriate antibiotics was used to inoculate the microfermentors. The bacteria were then cultivated for 24 h. Pictures of the microfermentors were taken before resuspension of the bacterial biofilm. The biomass was estimated by measuring the OD600 of the biofilm after resuspension in the microfermentor by vigorous shaking (the biofilm is weakly attached and easily detached from the surface) and further vortexing once the resuspended biofilm was transferred in a 15-ml tube.
Mutagenesis.
CF-negative mutants were screened after mutagenesis with the ori6K plasmid pSC189 carrying the kanamycin-resistant Mariner transposon described previously (13). pSC189 was conjugated from strain S17-1
pir (pSC189) into strain 1094csgD. The loss of CF-binding ability was screened under UV light at 30°C on CF-LB plates containing kanamycin (selection of the transposon insertion) and spectinomycin (selection for 1094csgD).
Multiplex PCR. Two PCRs were performed for each mutant: one used the primers pairs yhjL.ext5/yhjL.500-3 and cell.2-3/yhjO.ext5, leading to the amplification in the wild-type strain 1094 of 4,266-bp and 3,161-bp fragments, respectively, while the second PCR used the primers pairs cell. 1-3/cell.1-5, yhjO.int3/cell.3-5, and cell.4-3/cell.4-5, leading to the amplification of 3,180-bp, 1,692-bp, and 3,456-bp PCR products, respectively. The primer sequences are listed in Table S2 in the supplemental material.
Arbitrary PCR. To determine the transposon insertion sites in the studied mutants, an arbitrary PCR was used as described previously (14, 18). This method involves a first round of PCR using a primer specific for the right end of the transposon (IR2) and an arbitrary primer (ARB1 or ARB6). A second PCR is then performed on the product from the first PCR, using a primer specific for the rightmost end of the transposon (IR2-60-5) and a primer that is identical to the 5' end of the arbitrary primer (ARB2). Arbitrary PCR primer sequences are listed in Table S2 in the supplemental material.
ß-Galactosidase assays. To determine the ß-galactosidase enzyme activities of strains carrying transcriptional lacZ fusions, pRSpyedQ-lacZ and pRSpadrA-lacZ, cultures were grown o/n (stationary-phase samples) or until the OD600 reached 0.3 to 0.5 (exponential-phase samples) in LB or M63B1-glu medium at 37°C or 30°C. To avoid formation of clumps in the cultures grown in M63B1 medium, 3 µg ml1 of purified endo-ß-1,4-glucanase from Clostridium thermocellum (30) was added to the medium. Biofilms were grown for 24 h in M63B1- glu plus 50 µg ml1 ampicillin medium. The enzyme activity was assayed at least in triplicate for each strain under each condition as described previously (37).
Sequence analysis. Amino acid sequence comparisons were performed using the Clustal program package (28). Homology searches were performed using BLAST 2.0 (http://www.ncbi.nlm.nih.gov/sutils/genom_table.cgi?). DNA sequence manipulations were done with DNA Strider 1.3 (35).
| RESULTS |
|---|
|
|
|---|
These results indicate that the ability to bind CF is a widespread phenotype among natural E. coli isolates. However, this phenotype does not strictly correlate with the capacity to form a biofilm under our experimental conditions.
Cellulose synthesis is necessary for E. coli 1094 biofilm formation. Although E. coli K-12 does not bind CF, it has the homologous genes of the two divergent operons, bcsABZC (formerly yhjONML) and bcsEFG (formerly yhjSTU), which are involved in the synthesis of cellulose in Salmonella (see Fig. 6) (58, 68). These genes could also be amplified by PCR in E. coli strain 1094, an E. coli commensal strain that exhibited strong fluorescence on CF agar plates, a typical rdar morphotype on CR plates, and a strong biofilm phenotype in a microfermentor under all tested conditions (LB; M63B1-glu; 30 or 37°C) (Fig. 1A and B). To test the roles of these genes in the different phenotypes displayed by strain 1094, we deleted bcsC, bcsA-C, or bcsE-G genes in E. coli 1094. These deletions abolished CF binding and led to smooth and red (sar) colonies on CR plates under all tested conditions (Fig. 1A and data not shown). Furthermore, the 1094bcsC mutant lost its ability to form biofilm in microfermentors (Fig. 1B).
|
Taken together, these results indicate that the extracellular material produced by E. coli 1094 is cellulose and that it depends on the expression of the bcs operons. Furthermore, cellulose production is absolutely required for the expression of mature biofilm and multicellular behavior in E. coli 1094.
CsgD does not regulate cellulose synthesis in E. coli 1094. In Salmonella, cellulose synthesis generally occurs at 28°C in LB medium and is regulated by CsgD via the MlrA-CsgD-AdrA pathway (hereafter referred to as the CsgD-dependent pathway) (see Fig. 6A). In E. coli 1094, however, cellulose production (determined by CF binding on agar plates) occurs under all tested conditions (30 and 37°C; LB; M63B1-glu), suggesting that the regulation of cellulose synthesis may be different in Salmonella and E. coli 1094. In order to test the contribution of CsgD to cellulose regulation in E. coli 1094, a 1094csgD mutant was grown on CF and CR agar plates. Surprisingly, 1094csgD displayed a wild-type fluorescent phenotype on both LB and minimal-medium agar plates containing CF at both 30°C and 37°C (Fig. 1A), suggesting that CsgD is not involved in the regulation of cellulose synthesis in E. coli 1094. Consistent with this, in E. coli 1094, the deletion of regulatory genes acting either upstream or downstream of csgD (mlrA/yehV and adrA/yaiC, respectively) led to the same phenotype as a csgD mutant on CF plates (Fig. 1A). Moreover, all three mutants exhibited a rough and dry morphotype on CR plates and an almost wild-type biofilm phenotype in microfermentors (Fig. 1A and B). As expected from the loss of curli fiber (which bind CR) production in both 1094csgD and 1094mlrA strains, colonies formed by these strains appeared pink instead of red on CR plates compared to the wild-type 1094 or the 1094adrA strain (Fig. 1A). This phenotype, comparable to the one displayed by a 1094csgA strain (Fig. 1A), is indicative of a reduced ability to bind CR. It also indicates that both the mlrA and csgD genes are expressed and functional in E. coli 1094.
These results demonstrate that mlrA, csgD, and adrA do not regulate cellulose production in E. coli 1094, suggesting the existence of an alternative and CsgD-independent cellulose regulation pathway in this strain.
RpoS participates in both CsgD-dependent and CsgD-independent cellulose pathways. In Salmonella, RpoS, the main sigma factor of the stationary growth phase, is involved in the CsgD-dependent cellulose regulation pathway via the transcriptional control of MlrA (11). Although neither mlrA, csgD, nor adrA mutants have any impact on cellulose production in E. coli 1094, the deletion of the rpoS gene in E. coli 1094 led to a CF-negative phenotype on LB medium at both 30 and 37°C, as well as a smooth and pink (sap) morphotype on CR plates (Fig. 1A). In minimal medium, E. coli 1094rpoS still displayed a low and inconsistent CF-positive phenotype associated with a reduced but significant biofilm formation phenotype (Fig. 1B). The introduction of a wild-type rpoS allele on the plasmid pSTC14 complemented both the CF-negative phenotype and the sap morphotype (data not shown). Since the pink phenotype is indicative of a lack of curli expression, these results indicate that, in LB medium, RpoS both regulates curli expression and contributes to the CsgD-independent cellulose synthesis regulation pathway. Therefore, although cellulose synthesis in E. coli 1094 is independent of CsgD, it still depends on RpoS.
Screening for E. coli 1094 mutants impaired in cellulose production. In order to identify the genetic determinants of the production of cellulose in E. coli 1094, we performed a Mariner transposon mutagenesis on 1094csgD. We screened for CF-negative mutants on CF-LB agar plates at 30°C and identified 17 independent nonfluorescent mutants out of 7,000 clones.
To directly identify expected CF-negative mutants due to the insertion of the transposon in the cellulose genes, we performed a multiplex PCR on the two divergent bcsABZC and bcsEFG cellulose operons. A transposon mutant exhibiting a non-wild-type multiplex PCR profile indicates that, in this mutant, the transposon was inserted in one of the cellulose genes. We found that 11 out of 17 independent mutations were indeed located in the bcsABZC operon and 1 in the bcsEFG operon. These mutants displayed a CF-negative phenotype and a white and smooth phenotype on Congo red plates (data not shown). The natures of the five remaining CF-negative mutants carrying a transposon insertion located outside of the cellulose operons were determined by sequencing the regions adjacent to the transposon insertion site. The five mutants corresponded to insertions in four different loci belonging to three functional classes and are presented in Table 2.
|
Finally, one insertion was found in the yedQ gene. This uncharacterized gene (1,695 bp) is predicted to encode a putative protein of 564 amino acids (aa) (64.11 kDa) with an unknown function. Domain analysis using the protein family (Pfam) database indicated the presence of a signal peptide (aa 1 to 45), two potential transmembrane domains (aa 20 to 42 and 358 to 380), and, most interestingly, a carboxy-terminal GGDEF domain (aa 394 to 557), shared by proteins that display proven or suspected c-di-GMP synthase activity, such as AdrA in Salmonella (Fig. 2A) or PleD of Caulobacter crescentus (42, 56). Interestingly, yedQ is highly similar (64.7% amino acid identity) to GcpA (STM1987), a regulator of cellulose synthesis in minimal media at 37°C identified recently in Salmonella serovar Typhimurium (23).
|
|
Altogether, these results indicate that YedQ is a new GGDEF regulator of cellulose synthesis in E. coli.
RpoS does not regulate yedQ expression in E. coli 1094. Because of the functional similarity between YedQ and AdrA, we tested whether they could share an RpoS transcriptional control and if the CF- and CR-negative phenotypes displayed by 1094rpoS in LB could be due to the lack of yedQ expression. Neither CF nor CR binding could be restored in 1094rpoS(pZE12yedQ) in LB medium, and the introduction of yedQ in a 1094rpoS strain led to irregular levels of CF binding in minimal medium (data not shown).
To further study the regulation of yedQ transcription, the activity of a plasmid-borne yedQ-lacZ transcriptional fusion (pRSpyedQ::lacZ) introduced in either E. coli 1094 or 1094rpoS was tested under various conditions, including 30°C and 37°C and exponential and stationary phases in liquid rich and minimal media, as well as biofilm. These experiments showed that the transcription of yedQ is only moderately affected by the tested environmental conditions (twofold increase in minimal medium at stationary phase compared to LB) (Fig. 4A; see also Fig. S2A and B in the supplemental material). Moreover, the rpoS mutation did not significantly affect the expression of the yedQ-lacZ fusion (Fig. 4A; see also Fig. S2A and B in the supplemental material). Therefore, in contrast to Salmonella adrA expression, neither RpoS nor CsgD directly regulates the expression of yedQ. In an attempt to identify regulators of yedQ activity, we also mutagenized the strain 1094
yedQ::uidA. This strain carries a chromosomal pyedQ-uidA (ß-glucuronidase) transcriptional fusion and is pale blue to white on LB plates containing 5-bromo-4-chloro-3-indolyl-beta-D-glucuronic acid (a chromogenic substrate for UidA) using the plasmid pSC189 bearing a Mariner-based transposon. Since the extremities of the transposon have strong constitutive promoter-out activity (A. Roux and J. M. Ghigo, unpublished data), this mutagenesis allowed us to screen both for the inactivation of yedQ repressor and for the promoter-out-mediated activation of yedQ activators by looking for blue colonies due to pyedQ derepression or activation on LB-5-bromo-4-chloro-3-indolyl-beta-D-glucuronic acid at 30°C. This new screen failed to identify any regulators of yedQ expression. Altogether, these results suggest that yedQ is constitutively expressed under all tested conditions in E. coli 1094.
|
|
We then compared the expression levels of yedQ and adrA in E. coli 1094 using a plasmid-borne transcriptional lacZ fusion to either the yedQ or the adrA promoter (pRSpyedQ::lacZ and pRSpadrA::lacZ, respectively). These comparisons were performed in minimal or rich medium at 30°C or 37°C in exponential- and stationary-phase cultures or biofilm. In strain 1094, adrA is 5- to 16-fold less expressed than yedQ, depending on the environmental conditions (Fig. 4B; see also Fig. S2C and D in the supplemental material).
We did not observe a decrease in the expression of the padrA-lacZ fusion in the 1094csgD mutant, suggesting that under these conditions, adrA expression does not depend on CsgD (Fig. 4D). Since csgD is expressed and functional in E. coli 1094 (see above), the low level of adrA1094 expression could result from the inability of its promoter to be recognized and activated by CsgD. The comparison of the adrA promoter sequences of E. coli strains MG1655 and 1094 showed only a few differences (Fig. 4C). To assess if these differences could affect the capacity of the adrA1094 promoter to be activated by CsgD, we introduced the adrA-lacZ fusion in the strains MG1655 and MG1655ompR234 (which up-regulates csgD) and their respective csgD mutants (10). As shown in Fig. 4D, adrA expression was reduced in an MG1655csgD mutant but was significantly increased by the ompR234 mutation in MG1655 (Fig. 4D). The adrA1094-lacZ fusion activity was also severely reduced in the double ompR234-csgD mutant (Fig. 4D), showing that the adrA promoter of strain 1094 can be recognized and activated by the CsgD protein of MG1655.
Taken together, these results demonstrated that AdrA is potentially functional in strain 1094. Moreover, although it can be activated by CsgD in E. coli MG1655, the adrA1094 promoter is not regulated by CsgD in E. coli 1094, and its expression level is low compared to YedQ expression.
CsgD-independent cellulose production is found in several natural E. coli isolates. To investigate whether E. coli isolates other than 1094 may also display a CsgD-independent cellulose regulation pathway, we studied three other CF-positive E. coli strains from our collection: strains 1125, DSM6601, and 55989 (Table 1; see also Table S1 in the supplemental material). These three strains appear to produce cellulose, as a deletion of the bcsC gene led to a CF-negative phenotype on CF agar plates and a sar phenotype on CR plates (Fig. 5).
|
Interestingly, 1125yedQ remained CF positive and exhibited an rdar morphotype on CR-LB plates (Fig. 5C). To exclude the possibility that AdrA could regulate cellulose synthesis independently of CsgD in this strain, the adrA mutation was introduced in 1125. This mutant displayed wild-type phenotypes on CF and CR plates (Fig. 5C). Finally, in order to check that the CF-positive phenotype exhibited by either the 1125adrA or 1125yedQ mutant did not result from a cross-complementation by yedQ or adrA, respectively, the double mutant 1125adrA-yedQ was created. It exhibited the same phenotypes as the wild type (Fig. 5C).
These results demonstrate that CsgD-independent and YedQ-dependent cellulose synthesis is not restricted to the E. coli 1094 strain and also operates in DSM6601. Moreover, since neither YedQ, CsgD, nor AdrA is required to bind CF or CR in E. coli 1125, this suggests that the regulation of the cellulose production in this strain uses a different genetic pathway, possibly involving one of the 17 other GGDEF proteins encoded by genes identified in the E. coli genome (21).
| DISCUSSION |
|---|
|
|
|---|
Although laboratory E. coli K-12 derivative strains do not produce cellulose, the polymer has been detected in a few natural E. coli isolates (68). Our analysis of a collection of 87 E. coli strains that did not undergo significant laboratory subcultivation showed that cellulose production is a common phenotype in E. coli. This trait was not, however, strictly associated with strong biofilm formation capacities, suggesting that cellulose synthesis is a biofilm determinant of some but not all E. coli strains. Nevertheless, as CF is also a nonspecific indicator of chitin and succinoglycan polymers, we cannot exclude the possibility that some of these biofilm-negative but CF-positive strains do not produce bona fide cellulose, a hypothesis that could be tested by the systematic introduction of mutations in the cellulose operons of these strains. We also observed that half of the non-CF-binding strains that were assessed for biofilm formation were indeed able to form a biofilm. This suggests that, in these strains, other proteinaceous components (such as pili or curli) or other exopolysaccharides, such as colanic acid or poly-1,6-GlcNAc (PGA), could be involved in biofilm formation (4, 16, 44, 64).
To investigate the link between cellulose production and biofilm formation in E. coli, the commensal E. coli strain 1094, which exhibits both rdar and strong CF-binding phenotypes and forms a biofilm, was studied in more detail. As observed with Salmonella (23, 48, 58, 68), cellulose is required for biofilm formation in E. coli strain 1094. However, unlike in Salmonella, curli production is not necessary for the expression of a mature biofilm and multicellular behavior. Indeed, whereas the deletion of the cellulose operons abolished these phenotypes, the 1094csgD or 1094csgA mutant behaved like the wild-type strain.
Considering the conservation of the genes involved in cellulose production and regulation in Salmonella and E. coli, it was reasonable to assume that CsgD and AdrA could play roles in cellulose production in E. coli (46). Indeed, during the course of this work, a study of cellulose and curli expression in E. coli showed that a representative fecal isolate of E. coli regulates cellulose synthesis via CsgD (5). However, the analysis of the genetic basis of cellulose synthesis in the commensal E. coli strain 1094 allowed us to show that neither CsgD, AdrA, nor MlrA is involved in this process in E. coli 1094.
Using a genetic screen based on the ability of the cellulose-producing strain 1094csgD to bind the fluorescent dye CF, we identified five mutants defective for CF binding located outside of the cellulose operons. Three of these transposition mutants corresponded to insertions in the glucose-1-P-uridylyltransferase (galU) and phosphoglucose isomerase (pgi) genes. Because these enzymes play a direct or indirect role in the synthesis of UDP-glucose, the sugar subunit of cellulose, both galU and pgi are likely to have an effect on cellulose synthesis per se, but not on its regulation. galU has been shown to be necessary for biofilm formation and biosynthesis of the exopolysaccharide of a phage-resistant rugose variant of Vibrio cholerae and to affect E. coli early adhesion (24, 38). A pgi mutant has been isolated from a mutagenesis screen performed on a CF-positive Salmonella serovar Enteritidis strain, along with other genes involved in different sugar metabolic pathways (58).
One of the five CF-negative mutants identified in our screen corresponds to an insertion in the yedQ gene. The protein with an unknown function, YedQ, is predicted to be a membrane-associated protein that contains a GGDEF (DUF1) domain. Proteins displaying a GGDEF domain are widespread, are present in most bacterial genomes, and form a large family of proteins that have been associated with diguanylate cyclase activity and c-di-GMP production (22, 29). The discovery of c-di-GMP and the control of its intracellular level by proteins with diguanylate cyclase (GGDEF domain proteins) and/or phosphodiesterase (EAL domain proteins) activities originated from work on cellulose production in G. xylinus (52, 63). In Salmonella, the GGDEF domain protein AdrA has been shown to be involved in cellulose synthesis regulation and its c-di-GMP synthase activity has been recently demonstrated (45, 56). Although most of the GGDEF domain proteins have not yet been experimentally characterized, it is now considered that c-di-GMP, in addition to its role as an allosteric activator of cellulose synthesis, is a second messenger involved in a number of cellular and behavioral functions, such as the transition from sessility to motility, cell cycle-dependent localization of proteins, and cellulose and biofilm formation (3, 6, 22, 23, 29, 33, 42, 45, 50, 51, 56, 58).
The construction of a null yedQ mutant in strain 1094 and its plasmid complementation confirmed that, under all tested conditions, YedQ is required for cellulose synthesis and biofilm formation in this strain. A site-directed mutagenesis in the GGEEF motif of YedQ showed that this motif is essential for cellulose synthesis in E. coli 1094, as well as for YedQ1094 to complement an AdrA defect in Salmonella serovar Enteritidis. Altogether, these results suggest that, like AdrA in Salmonella, YedQ contains a functional GGDEF domain and that we have identified a new cellulose regulatory protein in E. coli 1094 that is probably involved in the regulation of the level of intracellular c-di-GMP through its GGDEF domain activity (56). During the course of this study, gcpA (STM1987), a gene encoding a GGDEF protein sharing 64.7% identity with YedQ, was shown to be critical for cellulose and biofilm formation in Salmonella serovar Enteritidis (23). However, unlike YedQ, which is required for E. coli 1094 cellulose synthesis in rich and minimal media at both 30 and 37°C, in Salmonella serovar Enteritidis, GcpA regulates cellulose production under specific culture conditions in a CsgD-independent manner. Indeed, while in Salmonella serovar Enteritidis cellulose production depends on CsgD and AdrA in LB medium at 28°C, it depends only on GcpA in ATM at 37°C (23). These differences suggest that yedQ and gcpA may not be similarly regulated in E. coli 1094 and Salmonella serovar Enteritidis, as suggested by the differences observed between the two promoters. Therefore, while the roles of YedQ/GcpA could be different in E. coli and Salmonella serovar Enteritidis, they are both involved in CsgD- and AdrA-independent regulation of cellulose production.
Our finding that an rpoS insertion mutant leads to a CF-negative phenotype in E. coli 1094 suggested that the CsgD-dependent and -independent pathways, depicted in Fig. 6, may share a common root, i.e., that like adrA, yedQ expression could also be regulated by RpoS. However, the expression of an inducible overexpressed copy of yedQ in a 1094rpoS mutant did not reverse the CF-negative phenotype of a 1094rpoS mutant on LB-CF plates (data not shown). Moreover, a pyedQ-lacZ transcriptional fusion was still expressed in a 1094rpoS background, showing that RpoS is not required for the expression of yedQ (Fig. 4; see also Fig. S2 in the supplemental material). Although we did not obtain evidence that yedQ could be subjected to any significant transcriptional regulation under our experimental conditions, we cannot exclude yedQ regulation at either a transcriptional or a posttranscriptional level. Nevertheless, RpoS is required for the CF- and CR-binding abilities of strain 1094, suggesting that in this strain, RpoS could regulate the transcription of the cellulose bcs operons or a metabolic pathway important for cellulose biosynthesis, such as UDP-glucose synthesis, depending on environmental conditions (Fig. 6).
The functional specificity of the multiple GGDEF domain proteins producing freely diffusible c-di-GMP and present in a single bacterium is an important and currently unresolved question (29, 50). Different mechanisms have been proposed, involving either the temporal and environmental transcriptional regulation of GGDEF and EAL domain protein expression, the spatial control of the production of c-di-GMP near its cellular target proteins, or the fine tuning of the activity of GGDEF and EAL domain proteins by a specific ligand or by c-di-GMP itself (1, 29, 33, 42, 50, 59, 60). In E. coli 1094, whereas complementation experiments showed that AdrA is functional when overexpressed, transcription analysis using multicopy padrA-lacZ and pyedQ-lacZ fusions showed that adrA is weakly expressed compared to yedQ. These observations support the hypothesis that the specific use of the yedQ over the adrA cellulose pathway is not due to a defect in AdrA protein activity but may be due in part to substantial differences in gene expression. Indeed the physiological level of expression of AdrA was not sufficient to circumvent yedQ defects in E. coli 1094, whereas its expression from a high-copy-number plasmid could complement 1094yedQ. The level of protein expression has been shown to be important for the complementation of the Salmonella serovar Typhimurium adrA mutant by the Yersinia pestis GGDEF domain protein HmsT (55).
The alignment of the sequences of the adrA promoters from E. coli strains 1094 and MG1655 shows very few differences (97.7% identity) (Fig. 4C). It is not yet clear if the identified differences in the promoter sequence could explain the lack of CsgD activity on the adrA1094 promoter, thus leading to a low level of expression of adrA in E. coli 1094. Interestingly, however, CsgD can activate the adrA1094 promoter in E. coli MG1655 (Fig. 4D), suggesting that the adrA1094 promoter is functional but is independent of CsgD in E. coli 1094. It was previously suggested that CsgD could activate the csgB (curli expression) and adrA promoters by different mechanisms, since the location of the putative CsgD-binding sites are different in the two promoters (10). Our results raise the possibility that in E. coli 1094, CsgD has diverged compared to CsgDMG1655 and has lost its ability to regulate adrA while still regulating the csgB promoter. This hypothesis is under investigation.
The regulation of the level of adrA expression does not, however, formally exclude the existence of functional specificities of YedQ and AdrA, possibly related to structural differences outside of their GGDEF domains. Indeed, although YedQ and AdrA are predicted to be membrane proteins and share a GGDEF domain (with 32.9% identity and 48.8% similarity), they have distinct N-terminal domains (Fig. 2B). The N-terminal domain of YedQ is predicted by Phobius software analysis to contain two transmembrane domains and to be mainly periplasmic (32). It does not match any conserved domains in the Pfam database. On the other hand, the N-terminal domain of AdrA is a MASE2 domain, which is predicted to be an integral membrane sensory domain found in many diguanylate cyclases (39, 66). These structural differences in the N-terminal domains of the YedQ and AdrA proteins could lead to distinct localizations and interactions with potential partners and diverse abilities to sense signals for protein activation or repression, all leading to synthesis of c-di-GMP at a specific place and time.
Based on nucleotide sequence homologies, it has been proposed that the cellulose biosynthesis genes could constitute a functional module acquired by horizontal gene transfer under some environmental conditions in which the expression of the rdar morphotype could constitute a selective advantage (53). The acquisition of the capacity to activate cellulose biosynthesis through the production of c-di-GMP would have been acquired after the establishment of the cellulose genes in the genome (46). If so, owing to the large number of proteins potentially involved in the process (11 proteins with a GGDEF domain in Salmonella serovar Typhimurium and 19 in E. coli), it is possible that, depending on the strain, a GGDEF protein different from AdrA, and possibly more highly expressed due to simple activating mutations, might have fulfilled this regulatory function. In agreement with this hypothesis, E. coli strain 1094 displays a CsgD- and AdrA-independent CF-positive phenotype, which involves YedQ. Furthermore, we found that such a CsgD-independent, YedQ-dependent cellulose pathway could also be found in another commensal E. coli strain, DSM6601. Additionally, in the case of E. coli 1125, an as-yet-uncharacterized GGDEF protein, distinct from both AdrA and YedQ, may have acquired this function, as the adrA or yedQ mutants, as well as the adrA/yedQ double mutant in strain 1125, retain their CF-binding abilities (Fig. 5C). Interestingly, experimental data that do not fit the cellulose regulatory pathway schematized in Fig. 6A have been reported (see the introduction and references 23, 48, and 58). This suggests that alternative CsgD-independent pathways may also exist in these Salmonella strains and possibly in other Enterobacteriaceae. Further studies are under way to test this hypothesis. Owing to the roles of CsgD in different biological processes subject to a complex network of regulatory proteins, the appearance of CsgD-independent cellulose regulation pathways provides a way to uncouple CsgD-regulated processes (e.g., curli regulation) from cellulose synthesis. Such an uncoupling may be favored when the expression of cellulose would confer a selective advantage under conditions in which adrA is usually not expressed, such as the high temperature in the human gastrointestinal tract. This may provide a selective pressure leading to the emergence of CsgD-independent cellulose pathways involving regulated diguanylate cyclases, such as YedQ.
In conclusion, the regulation of enterobacterial cellulose production is more complex than previously recognized and, in E. coli, can depend on alternative, and possibly widespread, cellulose regulatory pathways. The characterization of an alternative pathway for the regulation of cellulose production in E. coli illustrates the diversity of the regulatory arsenal used by bacteria to control the synthesis of polysaccharidic components of the extracellular matrix.
| ACKNOWLEDGMENTS |
|---|
J.-M.G. is supported by Institut Pasteur, CNRS URA 2172, and Fondation BNP PARIBAS grants. S.D.R. is supported by Sanofi-Pasteur.
| FOOTNOTES |
|---|
Supplemental material for this article may be found at http://jb.asm.org. ![]()
| REFERENCES |
|---|
|
|
|---|