| MINIREVIEW |
Department of Biology and Biochemistry, University of Houston, Houston, Texas 77204,1 Center for Infectious Diseases, Department of Molecular Genetics and Microbiology, Stony Brook University, Stony Brook, New York 11794-51202
| INTRODUCTION |
|---|
|
|
|---|
|
|
In this review, we summarize the current knowledge of OM protein secretion in terms of the structure of the exporters and the mechanism of secretion and examine similarities and differences among the different export pathways (Table 2). Since many virulence factors are secreted proteins, a more complete understanding of the mechanisms underlying bacterial protein secretion is crucial to the development of new vaccines and treatments for a wide range of human pathogens.
|
| AT SECRETION |
|---|
ATs are synthesized as large multidomain precursors consisting of an N-terminal Sec-dependent signal sequence; an internal passenger domain, or
-domain; and a C-terminal ß-domain. Following transport across the IM and cleavage of the signal sequence, the C-terminal domain inserts into the OM, driving export of the passenger domain to the cell surface. Thereafter, the passenger domain may either remain attached to the bacterial cell or be released to the external milieu (33, 46).
Most of the currently proposed models for AT secretion suggest that upon insertion into the OM, the C-terminal translocator domain folds into a ß-barrel with a central hydrophilic channel (33, 44, 46). An
-helical region located at the N terminus of the translocator domain, called the "linker" region, is believed to adopt a hairpin structure that inserts into the pore, pulling the passenger domain to the cell surface (45). Recently, the crystal structure of the translocator domain of the NalP AT from Neisseria meningitidis was solved, revealing a monomeric 12-stranded ß-barrel that encloses a pore of 10 by 12.5 Å. The ß-barrel exhibits all the features of other OM porin-type structures, and most remarkably, it has both termini located on the periplasmic side of the membrane (52).
A different structural configuration was predicted for the translocator domain of the Hia AT from Haemophilus influenzae, YadA from Yersinia enterocolitica, and NhhA from N. meningitidis. Studies with these ATs supported the formation of a trimeric 12-stranded ß-barrel in which each monomer contributes four transmembrane ß-strands (54, 56, 66). All these proteins are characterized by a very short translocator domain
70 residues in length (33, 56, 66).
A third structural motif for the translocator domain has been proposed for intimin and invasin, which are believed to be putative ATs (46, 75). In this case, the ß-barrel is formed by the N-terminal part of the protein, which inserts into the OM, possibly driving export of the C-terminal domain to the cell surface (46). Studies with intimin suggested formation of a dimer (75).
Based on the current knowledge of the AT translocator domain, various models have been predicted for transport of the passenger domain across the OM. The monomeric nature of the NalP translocator domain supports secretion of the mature AT through a monomeric ß-barrel pore. According to this model, the linker region inserts into the channel, pulling the passenger domain to the bacterial surface and plugging the pore after secretion. This is in accordance with the pore activity of NalP (52). Similarly, blockage of the pore was reported for the recombinant ß-domain of the immunoglobulin A (IgA) protease (76) and the AIDA ß-domain (45). However, the pore dimensions measured for NalP reveal a rather narrow channel that could host only two fully extended polypeptides at a time. This suggests that the linker region can adopt its
-helical conformation only after the passenger domain has exited the pore completely, and thus translocation and folding may be coupled processes (52).
An alternative model of AT secretion has been proposed by Veiga et al. (76), whose studies with a recombinant C-terminal domain of the IgA protease from N. gonorrhoeae revealed the formation of an oligomeric ß-barrel of at least six monomers, with a central pore of
2 nm, that can accommodate transport of folded or partially folded proteins (76).
Following export to the cell surface, the passenger domain adopts its native conformation (33, 46), either with the help of an intramolecular chaperone (51) or by an unknown mechanism. Thereafter, the folded protein may remain attached to the bacterial surface (33, 46) or be cleaved and released to the external environment (30, 33, 46, 65).
| TPS |
|---|
Proteins secreted by the TPS pathway, called TpsA proteins, transverse the IM with the help of a Sec-dependent N-terminal signal peptide (33, 34). A 110-residue conserved "TPS domain" located in their N termini is essential for secretion across the OM (33), a process that involves an integral OM channel-forming protein, TpsB (34). During OM transport, TpsA adopts its native conformation (33, 34), and in the case of cytolysins, this step is coupled to protein activation (62). Thereafter, TpsA may either remain attached to the bacterial cell or be released to the external medium (33).
No TpsB transporter has been crystallized to date, and thus information on their structure comes from sequence analyses and experimental data, derived mainly from studies of three members of the family, Serratia marcescens ShlB, Bordetella pertussis FhaC, and H. influenzae HMW1/2B. TpsB transporters share only a few conserved residues, and their homology is generally limited to structural motifs (34, 80). They are rich in beta structure and are predicted to fold into a ß-barrel (33, 46, 80), which appears to be restricted to their C-terminal portion (27). A highly conserved region with the consensus sequence D/EXAHyXAHyGGXBS/THyRGY/F, where X can be any residue and Hy can be any hydrophobic residue, was found in most TpsB transporters. This motif localizes in a predicted surface loop (80), and studies with FhaC revealed that it might possess an important role in secretion (27). Recent studies with HMW1B demonstrated formation of a tetrameric pore with an estimated size of
2.7 nm (67). Similar pore dimensions were estimated for FhaC as well (32).
In the process of secretion, the TpsA proteins interact physically with the corresponding TpsB transporter on the periplasmic side of the OM, possibly through secretion signals present in the TPS domain (34). Such a direct interaction was recently demonstrated between HMW1B and the N terminus of the HMW1 adhesin (67). The fact that each transporter is responsible for secretion of a specific exoprotein (34) has led to the suggestion that interactions between an exoprotein and its transporter may occur via variant rather than invariant residues of the TPS domain (14). The exoproteins are believed to cross the membrane in an extended conformation from the N to the C terminus and to fold progressively as they reach the cell surface. It is speculated that the TPS domain is involved in nucleating folding (14). Once on the surface, the TpsA protein usually remains noncovalently attached to the membrane, but in some cases it may be released to the external environment (33). FhaC appears to undergo a conformational change during FHA secretion, suggesting that TpsB transporters may be gated by their substrates (27). Thus, interaction with the exoprotein may lead to opening of the transporter's channel, which closes after TpsA has reached the bacterial surface and remains in a closed state until it interacts with another exoprotein. This is consistent with the hypothesis that each transporter can secrete several of its substrate exoproteins and is in agreement with the substoichiometric amounts of FhaC produced and FHA secreted (33).
| CU SECRETION |
|---|
The molecular basis for many aspects of fiber assembly has been revealed by crystal structures, and the CU pathway is one of the best-understood protein secretion pathways. Following translocation across the IM as unfolded polypeptides via the Sec system, subunits must interact with the periplasmic chaperone. The chaperone acts by a mechanism called donor strand complementation to allow proper folding of the subunits while simultaneously preventing premature subunit-subunit interactions (12, 59). Chaperone-subunit complexes next target the OM usher, which serves as a platform for fiber assembly and secretion to the cell surface. Fiber assembly is thought to occur at the periplasmic face of the usher, concomitant with secretion of the fiber through the usher to the cell surface (70). Pili are assembled in a top-down fashion, with the adhesin incorporated first. The usher facilitates this by differentially recognizing chaperone-subunit complexes according to their final positions in the pilus (22, 61). At the usher, chaperone-subunit interactions are exchanged for subunit-subunit interactions in a process called donor strand exchange, which couples chaperone dissociation with fiber assembly (60, 81). A significant finding from recent crystal structures is that the chaperone maintains subunits in a high-energy conformation (60, 81). Subunit-subunit interactions and donor strand exchange then allow subunits to undergo a topological transition to a more compact, lower-energy state. This topological transition presumably provides the driving force for fiber formation and secretion at the usher.
The molecular details of how the usher coordinates fiber assembly and secretion across the OM are less well understood, but structural and functional analyses are beginning to elucidate this process for pilus biogenesis. Chaperone-subunit complexes initially target an N-terminal region of the usher consisting of the first 124 to 139 residues of the mature N terminus (47, 48). Each usher specifically interacts with its own chaperone-subunit complexes, and the N-terminal targeting domain may confer specificity on the usher (47, 72). Binding of a chaperone-adhesin complex may induce a conformational change in the usher, priming the usher for subsequent assembly events (61). Subsequent events include stable interaction of chaperone-subunit complexes with the usher C terminus, which is required for subunit assembly into pili and secretion to the cell surface (61, 72). The P pilus usher PapC forms channels 2 to 3 nm in diameter, which are large enough to allow secretion of folded pilus subunits (71). A recent high-resolution electron microscopy study revealed that PapC assembles as a dimeric complex, with apparent channels located at the center of each monomer (41). Thus, the usher assembles in the OM as a twin-pore secretion complex. Interestingly, a similar twin-pore structure has been identified for the unrelated TOM protein translocase of the mitochondrial OM (1). Exactly how the twin-pore usher complex functions in the biogenesis of surface structures remains to be determined. The dimeric usher presumably would allow the simultaneous binding of two chaperone-subunit complexes, potentially positioning the complexes to favor donor strand exchange between the subunits. The energy generated from donor strand exchange would then be used to drive fiber assembly and secretion through the usher to the cell surface.
| T2S |
|---|
Substrates of the T2S system cross the IM via the Sec translocase and fold in the periplasm with the help of the enzymes present there (58, 64). Transport across the OM involves the products of 12 to 15 genes, depending on the species (58). One of the most intriguing features of the T2S pathway is that it contains only one integral OM protein, protein D, which is believed to form the translocation channel (49, 53, 69). The rest of the proteins that comprise the secretion apparatus seem to be associated with the IM and periplasm, with the exception of the lipoprotein S, which when present, is located on the periplasmic side of the OM (49, 57, 58, 64).
Protein D is a member of the secretin family (58, 64, 69, 80). Secretins range from 50 to 70 kDa in mass (69). In addition to T2S systems, secretins are also essential components of the T3S system and the system for filamentous phage export and assembly (7, 50, 69, 80). As a typical secretin, protein D forms very stable multimeric complexes in the OM, with a highly conserved C-terminal half and a more diverse N-terminal domain (58, 69, 80). The N-terminal protein domain extends into the periplasm and seems to be important for substrate recognition, interaction with the cytoplasmic components of the secretion apparatus, protein multimerization, and possibly gating of the formed channel (49). The C-terminal domain of protein D is thought to be embedded in the membrane, forming the pore through which the substrate is exported to the cell surface (7, 49, 50, 69). This secretin domain is predicted to fold into a ß-barrel (69). In addition, some D proteins contain a small C-terminal domain which binds a secretin-specific lipoprotein S, responsible for proper targeting and insertion of the secretin into the OM (58, 64).
Studies with the D-S complex from Klebsiella oxytoca (PulDS) have provided insights into the structure of the channel. Scanning transmission electron microscopy analyses of the PulDS complexes revealed a ring-like structure of 12 PulD subunits surrounded by 12 radial spokes of PulS (49). The enclosed central cavity has an estimated internal diameter of 7.6 nm, sufficient for passage of folded proteins (49). Side views of the PulDS complex indicate two ring-shaped structures stacked one on top of the other. The broader ring is probably integrated in the OM, whereas the smaller ring seems to protrude into the periplasm as far as the IM, folding back inside the channel, and thus this structure most likely corresponds to the N-terminal domain of PulD (50). This is in agreement with the high-density region observed within the ring structures of both PulD and the Pseudomonas secretin XcpQ (7, 49). Recently, the structure of the secretin PilQ was determined by cryoelectron microscopy to 12-Å resolution (15), revealing a continuous channel, sealed at both the top and bottom, with a diameter of 87 Å at its widest part. These structural features suggested more than a passive role for the secretins and have important implications for their function in OM secretion (15).
Several models have been proposed for secretion across the OM through the T2S pathway. One of them suggests secretion through a central channel formed by the outer surfaces of the ß-barrels of 12 protein D monomers. Alternatively, each D monomer may contribute several ß-strands, leading to formation of a single ß-barrel with a large central pore (69). Formation of macromolecular superstructures has also been proposed, in which several D oligomers of 12 subunits each form higher-ordered pore structures. In this model, secretion may occur either through the central pore formed by the outer surfaces of the oligomers or through the individual pores formed by each D dodecamer of the superstructure (58). This model is further supported by the structural analysis of the secretin PilQ, which suggests formation of a tetramer of PilQ trimers (15). It is speculated that during secretion, the pseudopilin components of the T2S apparatus may act as a piston, pushing the exported protein through the secretin pore (49).
| T1S |
|---|
-hemolysin (HlyA) of pathogenic Escherichia coli (37). Proteins secreted by the T1S systems do not contain a cleavable N-terminal signal peptide; instead, they possess a noncleavable C-terminal signal sequence that targets them to the secretion apparatus (5, 6). The latter consists of three polypeptides: the two components of the IM translocase, which are a member of the ABC superfamily (e.g., HlyB) (5) and a member of the membrane fusion protein family (e.g., HlyD), and the integral OM protein that forms a ß-barrel with a central hydrophilic pore (e.g., TolC) (36). In the case of the E. coli HlyA secretion system, upon interaction with the substrate, the IM translocase recruits TolC via the adaptor HlyD, forming a continuous channel that spans the cell envelope (36). A 38-residue charged N-terminal cytosolic domain of HlyD is essential for TolC recruitment and T1S (3, 36).
The crystal structure of TolC, obtained at 2.1-Å resolution, revealed a homotrimer of three 471-residue monomers that form a single channel (38). The most striking characteristic of TolC is its 100-Å-long periplasmic domain. The TolC channel can be divided into three units: (i) a 12-stranded right-handed ß-barrel OM domain (the channel domain), (ii) a left-handed 12-stranded
-helical barrel domain that protrudes deep into the periplasm (the tunnel domain), and (iii) a mixed
/ß equatorial domain that forms a strap around the midsection of the
-helical barrel. In the TolC trimer, each monomer contributes four antiparallel ß-strands, four antiparallel
-helices, and shorter helices and strands that form one-third of the equatorial domain (36, 38). On the extracellular side of the OM, the TolC channel is fully open and does not have any inward-folding loop to constrict the formed ß-barrel or any plug domain to control channel permeability. At the periplasmic end, on the other hand, the helices of the TolC tunnel are arranged into pairs of coiled coils, which taper to close the pore (36). As a result, the periplasmic entrance of TolC is nearly closed, with an effective diameter of
3.9 Å, and thus must undergo conformational change to allow substrate passage. The periplasmic entrance of TolC is the only constriction of the TolC pore (36).
Transition of TolC from a closed resting state to an open state is a crucial step in protein export (36). Opening of the TolC periplasmic entrance occurs in a substrate-dependent manner and is thought to involve an iris-like rearrangement of the periplasmic tunnel entrance helices (36). This model was further supported by in vivo experiments with E. coli, where the TolC entrance was fixed by introduced disulfide bonds. When the TolC coiled coils were locked into the narrowest entrance configuration, HlyA was no longer exported. The locked TolC, however, was found to be recruited properly by the IM translocase, indicating that the untwisting of the TolC entrance helices is essential for proper TolC-dependent export, as it opens the entrance pore to allow passage of the substrate (23). Once the substrate is exported, TolC reverts to its resting closed state (36).
| T3S |
|---|
T3S is often associated with virulence plasmids and pathogenicity islands and consists of three types of proteins: (i) structural components of the injection apparatus, (ii) effector (secreted) proteins, and (iii) regulatory factors controlling expression of structural and effector proteins (25). The injection apparatus consists of at least 20 different proteins (74) that assemble into a needle complex, which allows protein secretion upon contact with the eukaryotic target cell; hence the term "contact-dependent pathway" that is often used to describe T3S (25). Targeting of the substrates to the T3S apparatus is believed to occur via N-terminal signal peptides, although it has also been proposed that secretion signals might reside in the 5' ends of the corresponding mRNAs (16).
As many as 11 proteins are conserved in the T3S system (31). The OM component of the needle apparatus is one of these proteins, and it is exemplified by YscC from Y. enterocolitica. YscC belongs to the secretin family, and it is the only component of the Yop system that is clearly found in the OM (40). Homologues of YscC include the secretins InvG of Salmonella enterica serovar Typhimurium, EscC of enteropathogenic E. coli, MxiD of Shigella flexneri, and HrcC of the plant pathogen Pseudomonas syringae (20, 26, 40, 63). Being a secretin, YscC multimerizes and forms very stable ring-shaped structures (40) with a molecular mass of
1 MDa (9). Electron microscopy demonstrated that YscC exhibits a 13-fold angular order, which is indicative of an oligomer of 13 subunits (9). Analyses of the formed channels demonstrated a stain-accessible central pore (9, 40) with an external diameter of
14 nm and an inner diameter of
4 nm (9). YscC has an estimated total length of 11 to 12 nm and is composed of two stacked rings and a conical domain. These findings suggest that YscC probably corresponds to the upper two rings of the basal body of the needle complex (9).
Both YscC and InvG require an OM lipoprotein (YscW and InvH, respectively) for proper OM insertion (17, 40). Absence of YscW from Y. enterocolitica results in a lower yield of the YscC secretin and reduced substrate secretion via the T3S pathway (40). Interestingly, YscW is not a stable component of the oligomeric structure (9), as is the case for the PulDS proteins of the T2S pathway. The lipoprotein binding site has been identified in the C termini of various secretins. However, studies with a C terminally truncated YscC revealed that the C terminus is not essential for OM localization and function of YscC in the secretion process or for interaction with the YscW lipoprotein (8). These findings suggest that the C terminus of YscC may render the secretin lipoprotein dependent for OM insertion, which may in turn increase the efficiency of the secretion pathway (8). In addition, YscW promotes YscC oligomerization, which indicates that interaction with YscC occurs prior to the formation of the secretin multimer (8). Studies with the EscC and MxiD secretins have indicated that correct insertion and function of these two proteins in the OM depends on the presence of other T3S apparatus components (26, 63).
Secretion through the T3S pathway is generally believed to occur in one step without the involvement of periplasmic intermediates. In this model, the secreted proteins are exported directly across the two membranes via a continuous channel that spans the cell envelope. The formation of such a continuous conduit by the T3S apparatus has been supported by both biochemical studies and electron microscopic images (16, 26).
| T4S |
|---|
The T-DNA transfer system of A. tumefaciens has provided the basis of what is known about T4S (11, 13). Conjugative T4S systems are composed of three distinct substructures: the CP hexamer, which targets the substrate to the transenvelope apparatus; the transenvelope protein complex for transport of the substrate across the IM and OM; and the conjugative T pilus, for substrate delivery to the target cell. The last two substructures consist of mating-pair formation (Mpf) proteins, which in the case of A. tumefaciens are the VirB1-VirB11 proteins (11, 42). Recent studies have proposed that the proteins VirB2, VirB6, VirB8, VirB9, and VirB11 correspond to channel subunits of the T4S apparatus, whereas VirB4, VirB5, VirB7, and VirB10 interact indirectly with the secreted substrate through their direct association with the channel subunits (10). The rest of the Mpf proteins do not interact with the transfer intermediate (10).
VirB7 and VirB9 are the OM constituents of the translocation pore. Both proteins are predicted to possess large periplasmic domains (19, 24). VirB9 is a 292-residue secretin-like protein that interacts through its C-terminal third with the 55-amino-acid residue OM lipoprotein VirB7 to form a disulfide-linked heterodimeric protein complex (2). This complex is anchored to the OM by a covalent interaction of VirB7 with this lipid bilayer (18, 24) and requires the IM protein VirB6 for proper formation (35). In addition, VirB9 can form homodimers and has also been shown to form intermolecular complexes with the IM proteins VirB8 and VirB10. Formation of VirB9-VirB8 and VirB9-VirB10 complexes involves the N-terminal third of VirB9, whereas self-interacting domains lie in both the N and C termini of VirB9 (18). VirB7 has also been shown to form homodimers that interact with subunits of the T pilus (35). VirB7 and VirB9 homologues have been detected in Helicobacter pylori as components of its surface filamentous structures, indicating that the H. pylori T4S apparatus is a filamentous macromolecular complex that protrudes from the cell envelope (68). The VirB2 pilin has been proposed to act in concert with VirB9 for translocation across the OM. Deletion of either of these two proteins arrests OM transport (10).
Two working models have been proposed for secretion via the T4S system. The "channel model" supports the idea that substrates are transported across the OM through the lumen of a pilus-like structure encased by the OM components VirB7 and VirB9. The "piston model," on the other hand, involves transport through a retractile pilus that extends and pushes the substrate through the OM pore (11). The piston model has been shown to be correct in the case of the F-plasmid T4S machinery and possibly in the case of the B. pertussis Ptl system (11, 42). Future studies are expected to provide a better understanding of the molecular mechanisms that govern substrate export through T4S systems.
| ENP |
|---|
| CONCLUDING REMARKS |
|---|
Two high-resolution crystal structures of OM transporters have been determined to date, the structure of an AT ß-domain (NalP) and that of the OM component of a T1S system (TolC). Information regarding the structures and functions of the other OM transporters (Table 1) has been based primarily on molecular, biochemical, and bioinformatics studies. Comparison of these six protein families indicates that (i) most protein transporters appear to be oligomeric or multimeric, perhaps with the exception of the AT ß-domain; (ii) internal channel dimensions range between 10 and 100 Å, which is significantly larger than those of the general OM porins; and (iii) at least three different protein families, the ushers, the secretins, and the TpsB proteins, fold into two distinct protein domains. The N-terminal domain of the secretins and TpsB proteins appears to be localized in the periplasm and is required for substrate recognition, whereas the C-terminal domain folds into a ß-barrel, providing the pore channel for substrate secretion across the membrane. In the case of the usher, recent analyses indicate that the N terminus, which contains the substrate recognition site, forms part of the ß-barrel, whereas the C terminus folds as a separate periplasmic domain tightly associated with the N-terminal ß-barrel (29).
Despite the vast knowledge acquired over the past years regarding protein transport across the bacterial OM, how these transporters insert and assemble in the membrane remains vague. For many years, it was thought that insertion of proteins into the OM occurs spontaneously without the requirement for a proteinaceous machinery. However, a recent study challenges this model. Voulhoux and colleagues have reported that a highly conserved OM protein, named Omp85 in N. meningitidis, plays an essential role in OM biogenesis (78). Omp85 was found to be directly involved in the biogenesis of general porins, a secretin of a T2S system, OM enzymes, and an AT protein. Although the details remain to be revealed, it is now proposed that biogenesis of OM proteins follows a "periplasmic chaperone (Skp)/usher (Omp85)-like pathway" (79). Future studies will no doubt be directed toward the molecular characterization of this mechanism and the unraveling of the architecture of the Omp85 secretion machinery.
Many more questions still remain to be addressed regarding protein export across the bacterial OM. What is the folding state of the secreted proteins before and during their transport across the OM? How are unfolded or partially folded proteins protected against proteolytic degradation? How do secreted proteins and macromolecular secretion machines get through the peptidoglycan layer? If energy sources are not present in the periplasm and the OM, how do molecules as large as the secreted proteins overcome the unfavorable entropy constraints and pass through the hydrophobic membrane environment? Are any chaperones universally required? What is the fate of an OM transporter after secretion? How do the OM pores prevent leakage of small periplasmic molecules? What mechanisms are involved in channel gating? How is specificity of secreted proteins assured in general? What is the architecture of the T4S and T3S machineries? What are the structures of the components of the different secretion apparatuses? Answers to these questions will provide a more complete picture of the secretion process and will lead to more efficient means of protection against bacterial infections.
| ACKNOWLEDGMENTS |
|---|
Our research is supported by grants from the USDA (94-37204-1091) and the Welch Foundation (E-1548) to C.S, and from the NIH (GM62987) to D.G.T.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |