Previous Article | Next Article ![]()
Journal of Bacteriology, March 2004, p. 1311-1319, Vol. 186, No. 5
0021-9193/04/$08.00+0 DOI: 10.1128/JB.186.5.1311-1319.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Repressor Fusions
Jonathan L. Minor, Nicola Reading,
and James C. Hu*
Department of Biochemistry and Biophysics and Center for Advanced Biomolecular Research, Texas A&M University, College Station, Texas 77843-2128
Received 18 August 2003/ Accepted 17 November 2003
|
|
|---|
repressor fusion vectors were subjected to selection for immunity to infection by phage
. Survivors were identified by sequencing the ends of the inserts, and the fused protein sequence was inferred from the known genomic sequence. Four hundred sixty-three nonredundant open reading frame-encoded interacting sequence tags (ISTs) were recovered from sequencing 2,089 candidates. These ISTs, which range from 16 to 794 amino acids in length, were clustered into families of overlapping fragments, identifying potential homotypic interactions encoded by 232 E. coli genes. Repressor fusions identified ISTs from genes in every protein-based functional category, but membrane proteins were underrepresented. The IST-containing genes were enriched for regulatory proteins and for proteins that form higher-order oligomers. Forty-eight (20.7%) homotypic proteins identified by ISTs are predicted to contain coiled coils. Although most of the IST-containing genes are identifiably related to proteins in other bacterial genomes, more than half of the ISTs do not have identifiable homologs in the Protein Data Bank, suggesting that they may include many novel structures. The data are available online at http://oligomers.tamu.edu/. |
|
|---|
Proteins use a wide variety of quaternary structures to assemble multisubunit complexes. Genome-wide identification of protein interactions by use of genetic (21, 36, 46, 57, 58) or biochemical (15, 19, 41) screens has provided a wealth of insight into the diversity of structures used for self-assembly. In the annotation of predicted open reading frames (ORFs), assembly interactions are an important feature that provides insights into structure and function. In addition, the involvement of a gene product in a multimeric complex suggests strategies for the generation of assembly-based inhibitors for functional studies (18). The possibility that protein interactions represent a large and largely underexploited target for drug discovery has also been discussed (6, 55)
The study of the protein interactome has focused on heterotypic interactions, as these can provide links between proteins of unknown function and proteins of known function. However, homotypic interactions, which are found in both homomultimeric proteins and as subcomplexes of heteromultimeric proteins, may be the most common way to form protein complexes in nature (31). Although by definition self-interaction does not link a protein's function to that of another protein, homotypic interactions are important in the study of protein structure, function, and evolution and should be just as useful as heterotypic interactions as potential targets for disruption in functional studies or drug development.
Homotypic interactions are poorly recovered by both two-hybrid and biochemical interaction screens. It has been shown that a modified version of a one-hybrid system based on fusion proteins to bacteriophage
repressor can be used to identify homomultimerization domains from the Saccharomyces cerevisiae genome (36). Our general strategy is to sample genomes for self-assembling domains from libraries of genomic DNA fragments cloned downstream of the
repressor DNA-binding domain. Clones that confer immunity to infection by phage
identify self-assembling proteins and protein fragments. Here, we describe a more extensive study to identify and partially localize homotypic interaction domains encoded by the Escherichia coli genome.
|
|
|---|
(ara-leu)7697
(lac)X74 galE15 galK16 rpsL(Strr) hsdR2 mcrA mcrB1] (20). The repressor fusion libraries were transformed into JH787 [AG1688 (
80 Su-3)]. The screen for insert dependence was done with LM58 [JH787 (
LM58)] and LM59 [AG1688 (
LM58)].
LM58 is a
imm21 specialized transducing phage that carries a pL-cat reporter. Repressor fusion libraries were constructed in pLM99 (GenBank accession no. AF308739), pLM100 (GenBank accession no. AF308740), and pLM101 (GenBank accession no. AF308741) (35-37). These vectors contain an amber mutation at codon 103 of the cI segment, between the DNA-binding domain and the DNA insert, which is used for screening for insert dependence (see below). Expression of the fusion proteins is driven by the P7107 promoter (59), a weak constitutive promoter derived from an operatorless PlacUV5. The P7107 promoter contains multiple mutations relative to PlacUV5 and has the sequences TTTATG and TACATT, respectively, at the -35 and -10 hexamers. While we do not know precisely how strong this promoter is, expression is below the basal levels observed in lacIq1 strains from multicopy expression vectors with the lacUV5 promoter lacking the lacO2 and lacO3 operators. Luria broth (LB) and LB agar were prepared from premixed powders (Difco). 2XYT broth was prepared as described by Miller (38).
Repressor fusion library construction. E. coli K-12 MG1655 (kindly provided by Debby Siegele) was used to prepare genomic DNA. Fifteen high-complexity libraries were generated by using either a multienzyme approach (22) or physical DNA shearing (43) to generate inserts used for the repressor fusion libraries. Enzymes were purchased from New England Biolabs (Beverly, Mass.) unless indicated otherwise.
For the multienzyme approach, a combination of restriction enzymes was used to partially digest E. coli genomic DNA and generate ends that are compatible with the cloning sites present in pLM99, pLM100, and pLM101. Ten micrograms of genomic DNA was partially digested for 1 h at the temperature recommended by the manufacturer. Equal amounts of separate CviTI (Megabase Research, Lincoln, Neb.) (8 U), BstUI (5 U), RsaI (2.5 U), and HpyCH4V (2.5 U) partial digests were pooled and cloned into the SmaI site of pLM99, pLM100, and pLM101 to generate three libraries (EB099, EB100, and EB101) in different reading frames. For the ET099, ET100, and ET101 libraries, partial digests of E. coli DNA with TaqI (0.2 U) were ligated into repressor fusion vectors digested with BstBI. The EN099, EN100, and EN101 libraries were generated from partially NlaIII (2.5 U)-digested E. coli DNA ligated into repressor fusion vectors digested with SphI. The ES099, ES100, and ES101 libraries were generated by using partially Sau3AI (2.5 U)-digested E. coli DNA ligated into repressor fusion vectors digested with BglII. In all cases, the digested vector DNA was treated with calf intestinal alkaline phosphatase (Roche) prior to use in ligations.
For mechanical shearing, the DNA was fragmented with a HydroShear apparatus (GeneMachines, San Carlos, Calif.) according to the manufacturer's instructions. Five micrograms of genomic DNA was subjected to 20 cycles at speed code 5. The average size of the resultant DNA fragments was about 2 kb. After shearing, the ends were converted to blunt ends by adding 4.5 U of T4 DNA polymerase and 21 U of Klenow fragment supplemented with 250 µM deoxynucleoside triphosphates in 1x EcoPol buffer (10 mM Tris-HCl [pH 7.5], 5 mM MgCl2, 7.5 mM dithiothreitol), the reaction mixture was incubated for 40 min at 25°C. The blunt-ended DNA fragments were repurified with a Qiagen QIAquick PCR purification kit. The EH099, EH100, and EH101 libraries were generated by cloning sheared and DNA polymerase-treated DNA into the three repressor fusion vectors digested with SmaI and treated with alkaline phosphatase as described above.
The complexity of the libraries was estimated from the number of transformants obtained in the absence of phage selection. We estimate that each library contains on the order of 106 independent inserts. To estimate the fraction of the clones that contained inserts, primers flanking the multiple cloning site (cI-up, 5'-AGTATGCAGCCGTCACTTAG-3', and LM3-R, 5'-GGGGTTATGCTAGTTATTGC-3') were used to PCR amplify 60 randomly chosen clones from each library. We estimate that 95% of the clones contained a genomic insert from each of the libraries, with a typical insert size of 1,000 ± 500 bp. Amplification reactions were done by PCR with Taq DNA polymerase (Promega).
Selection and screening procedure.
Detailed procedures for selection and screening have been described previously (37). Briefly,
107 JH787 transformants containing unamplified fusion libraries were plated on LB-ampicillin-kanamycin plates seeded with 108 PFU of
KH54 and
KH54h80/plate. The KH54 mutation is a deletion of cI, which prevents the selection phage from forming lysogens, which would be immune to
.
KH54 and
KH54h80 use different receptors to infect E. coli; using both phages simultaneously reduces the background of receptor mutants that would be seen with only one of the two phages. The plates were incubated at 37°C overnight, and survivors were picked into 96-well plates for further analysis. We performed insert dependence tests as described previously (35-37) for the EB099, EB100, and EB101 libraries and found that all of the survivors were insert dependent, as judged by the dependence of repressor function on an amber suppressor that allows translation of the insert. Therefore, the other libraries were not evaluated for insert dependence.
Identification of interacting fragments. Cultures of isolated colonies were grown overnight in 1.5 ml of 2XYT+Amp broth (200 µg/ml) for plasmid preps in 2-ml-deep well plates (Whatman). Plasmid DNA was extracted from the positive clones by the Promega MagnaSil method on a BioMek 2000 laboratory automation workstation. Plasmid preps were stored at 4°C until sequencing reactions were performed.
Inserts were identified by automated dye terminator DNA sequencing from the cI-up and LM3-R primers. DNA sequencing reactions were done with the ABI Big Dye terminator kit (Applied Biosystems), and sequences were obtained at the Laboratory for Plant Genome Technologies at Texas A&M University. Sequence trace files were processed with Phred (12) or Sequencher (Gene Codes Corp., Ann Arbor, Mich.). The sequences from each end of the inserts were identified by BLAST (1) searches against the E. coli protein database (National Center for Biotechnology Information) located at ftp://ftp.ncbi.nlm.nih.gov/blast/db/ecoli.aa.Z, and the full sequences of the interacting sequence tags (ISTs) encoding self-assembling domains were inferred from the reference E. coli genome sequence. Annotations assigning gene names were from EcoGene (50). The DNA traces, FASTA files, and BLAST reports generated for the identification of ISTs were stored in the Doodle (Database of Oligomerization Domains from Lambda Experiments) database at http://oligomers.tamu.edu (L. Mariño-Ramírez, X. Tang, and J. C. Hu, unpublished data).
|
|
|---|
![]() View larger version (35K): [in a new window] |
FIG. 1. General scheme for selection and analysis of ISTs. Random fragments generated by partial restriction digests or mechanical shearing were cloned into a series of cloning vectors to generate fusions to the DNA-binding domain of cI repressor in all three reading frames. These libraries are subjected to selection for repressor function by plating on phage. Survivors are identified by sequencing the ends of the inserts and using BLAST to find the corresponding segment in the complete sequence of the E. coli K-12 MG1655 genome. Clones are saved as frozen cultures, and data about each clone are archived in the Doodle database at http://oligomers.tamu.edu. Details for each step are described in the text.
|
|
View this table: [in a new window] |
TABLE 1. Genes containing ISTs identified in this study
|
![]() View larger version (12K): [in a new window] |
FIG. 2. Functional annotation of genes containing ISTs. Functional classification categories are from Blattner et al. (4) and Riley (48). Filled and open bars show the percentages of the genome and proteins containing ISTs, respectively, assigned to each functional class. Although MG1655 does not contain any plasmids, extrachromosomal genes include prophage genes that are present in the original genomic DNA used to construct the libraries.
|
Genes from the regulation category are overrepresented among the IST-containing genes, consistent with the idea that oligomeric transcription factors are a major component of this functional category. ISTs were found in 62 proteins annotated as transcription factors. The most abundant family of transcriptional regulators in E. coli is the LysR family. Thirteen of the 46 LysR family members were identified. Nine of the 12 DeoR family members of transcriptional regulators were identified. Five of the 15 PurR family members of transcriptional regulators were also identified along with 3 of the 4 RpiR family members.
The diversity of proteins identified by ISTs is also reflected in the evolutionary families they represent. The clusters of orthologous groups of proteins (COGs) database classifies the proteins encoded by 43 sequenced genomes according to their homologous relationships (52). Of the 232 homo-oligomeric proteins identified by ISTs, 210 have a COG assignment, indicating that they are members of conserved protein families. These 210 proteins are distributed among 153 different COGs of 1,905 present in the E. coli genome (Table 1).
We are especially interested in ISTs that might identify new oligomerization domains or motifs. However, we expect that many of the ISTs will be from proteins whose structural basis for assembly has already been determined. To determine how the ISTs are distributed among known and unknown structures, we performed BLAST sequence similarity searches against the Protein Data Bank (PDB) (3) database. Using a cutoff E value of 10-6 and sequence identities of more than 70% to detect E. coli proteins or very close homologs, we found that 23 of the 232 proteins identified by ISTs have structures in the PDB. Twenty-one of the 23 structures found are annotated as homotypic oligomers in the Protein Quaternary Structure (PQS) database (17) with a variety of oligomerization states (Fig. 3a). Although repressor fusions are able to find homodimers, our selection appears to be biased towards recovering higher-order oligomers (Fig. 3b).
![]() View larger version (14K): [in a new window] |
FIG. 3. ISTs from proteins of known structure. (a) Fraction of E. coli ISTs that represent known or inferred structures, as described in the text; (b) distribution of oligomerization states for proteins of known or inferred structure for the complete genome (filled bars) and for E. coli ISTs (open bars).
|
Mapping assembly domains within ORFs. The position of an IST within a gene defines a region sufficient for forming a homotypic interaction. The sizes of the ISTs range from 16 aa for EmrE to 794 aa for YebT. Figure 4a shows the distribution of the lengths of the shortest IST found for each IST-containing gene as a fraction of the length of the complete ORF. Although a majority of the ISTs comprise >90% of the full-length gene product, in many cases the shortest ISTs suggest the presence of a distinct domain that is sufficient for oligomerization. Some genes are represented by single ISTs, whereas in other cases, several ISTs are found for the same gene. The ISTs from the multienzyme libraries generate more multiple hits to the same genes than the libraries made by random shearing. Multiple ISTs in a gene can be used to identify the minimal region or regions involved in a homotypic interaction. For example, we found eight ISTs from ParC, the A subunit of DNA topoisomerase IV; the overlap between these maps the oligomerization domain to aa 333 to 475 (Fig. 4b).
![]() View larger version (18K): [in a new window] |
FIG. 4. Mapping oligomerization domains within IST-containing proteins. (a) Distribution of ISTs as different fractions of the length of the complete ORF. (b) Multiple ISTs define an oligomerization domain in ParC.
|
|
View larger version (16K): [in a new window] |
FIG. 5. (a) Segments of MutS in ISTs compared to MutS crystal structures. The filled bar indicates full-length MutS; open bars show ISTs found in this study. Diagonally hatched bars indicate amino acids in crystal structures of MutS from E. coli (1NG9 and 1E3M) and Thermus aquaticus (1EWQ, 1FW6, and 1EWR). (b) Segments of ClpB in ISTs compared to characterized oligomerization domains (39, 53). Filled bar, full-length ClpB; open bars, ISTs found in this study; diagonally hatched bar, N-terminal domain fragment that is monomeric in vitro; vertically hatched bar, C-terminal domain that is oligomeric in vitro.
|
In four cases, the homotypic ISTs correspond to domains of known structure (SucB, Kch, ArgR, and DnaX). The crystal structure of an oligomeric domain of SucB was determined for a fragment located between aa 173 and 405 (PDB accession no. 1E2O). The minimal IST was found between aa 191 and 405, and the amino acids not present in the IST are unstructured in the crystal structure. The oligomerization domain of the Kch protein is located between aa 241 and 393 (PDB accession no. 1ID1), and its minimal IST was found between aa 229 and 405. The hexamerization domain of the arginine repressor (ArgR) is located between aa 80 and 156 (PDB accession no. 1XXA), and its minimal IST was found between aa 48 and 156. The minimal IST for DnaX was found in between aa 247 and 455, which overlaps with the amino acids seen in the oligomerization domain of the gamma subunits in the clamp loader complex (aa 1 to 373; PDB accession no. 1JR3). Translational frameshifting within dnaX generates two gene products, tau and gamma (5, 14, 56). The domain III fragment (aa 222 to 382) of both tau and gamma has been shown to form homotetramers in vitro (7, 16).
|
|
|---|
repressor fusions, we identified homotypic interactions for 232 proteins encoded by the E. coli genome. As with a similar study with yeast genomic fragments (37), there are several criteria to support the idea that the ISTs identified here represent bona fide oligomerization domains. First, the strong bias toward fusions from annotated ORFs and in the correct reading frame is consistent with a requirement for correct folding; at higher expression levels, peptides encoded by sequences that are not in frame with annotated ORFs are common (23, 60). Second, in several cases, structural or biochemical evidence in the literature supports the oligomerization state of specific ISTs. Third, in no case do we identify a fusion to a protein where we have been able to find evidence that the fused domain should be monomeric. Nevertheless, for many of the genes identified here, the ISTs should be viewed as strong but not definitive evidence for oligomerization. False positives, while rare, are expected in the repressor system under our conditions. For example, repressor fusions to E. coli dihydrofolate reductase, a well-characterized monomeric protein, are immune to phage infection and purify as a mixture of monomers and dimers (unpublished data). Among the genes identified by ISTs, we find oligomerization domains that have been previously identified and many that are novel. The proteins with ISTs that have entries or close homologs in the PDB not only serve as positive controls but also give an idea of the range of different homotypic molecular architectures that can be identified by use of repressor fusions. We used annotations from the PQS database (17) to evaluate the oligomerization of ISTs that correspond to known protein structures. PQS uses an automated algorithm to guess the oligomerization state of a protein by evaluating the surface area buried by protein-protein contacts in crystal structures. While PQS annotations are not perfect, they provide a best guess in cases where biochemical data are not available. In the few cases where PQS annotations do not mark IST-containing proteins as homotypic oligomers, there is good reason to believe that they are homo-oligomeric. GlnG (NtrC) and DnaX have structures in the PDB but are not part of the homotypic PQS subset. However, it is well established that NtrC forms homotypic oligomers (30, 45), and repressor fusions have been used to study the oligomerization determinants within NtrC (13). The dnaX gene encodes two proteins, the tau and gamma subunits of DNA polymerase III holoenzyme. Translational frameshifting occurs at residue 430, which is within the DnaX ISTs. Thus, the phage immunity of these constructs could be due either to cI fusions to segments of tau, which dimerizes to hold together the two catalytic subunits in the DNA polymerase III holoenzyme (29), or to cI fusions to segments of the gamma subunit, which forms a homotetramer in vitro (7) and is part of a heteropentameric subcomplex of the clamp loader in DNA polymerase III holoenzyme in vivo (25). For FruR, a member of the PurR family of transcriptional regulators, a nuclear magnetic resonance structure is available for the N-terminal part of the protein (PDB accession no. 1UXD). However, the ISTs for FruR include C-terminal domains that are not present in the nuclear magnetic resonance structure. Although the oligomeric state of FruR is unknown, the sequences corresponding to the FruR IST form homodimers or homotetramers in other members of the PurR family.
From analysis of E. coli proteins of known structure and their close relatives, it is likely that on the order of half of the proteins encoded in the genome are involved in homotypic assemblies or subassemblies of larger heterotypic complexes (L. Mariño-Ramírez and J. C. Hu, unpublished data). The 232 proteins identified by ISTs thus represent a sampling of the possible oligomerization domains encoded in the E. coli genome rather than an exhaustive enumeration.
We find oligomeric proteins in all functional categories. The ISTs are biased toward transcription factors and against membrane proteins. The bias toward transcription factors is likely to reflect the tendency of regulators to be active oligomers at very low expression levels, comparable to those used here to avoid false positives. In addition, the low expression levels of most transcription factors may be favorable for the recovery of ISTs, as abundant dimeric proteins could interfere with the activity of repressor fusions by titrating them into inactive heteromultimeric complexes. The recovery of ISTs will also be dependent on the topology of the repressor fusions; the repressor domains must be close enough to each other to bind the operator half-sites. This may prevent fusions to integral membrane proteins from properly localizing. A recent report of the use of repressor fusion vectors specifically tailored to detect transmembrane domains identifies nine proteins that were missed here (33). That study assayed for the activity of repressor fusion proteins at expression levels above those used here.
Among the 232 proteins, we found several proteins that have been previously identified by others by use of repressor fusions: IbpB (23), PspA (11), and NtrC (13). However, there are E. coli proteins that are known to form active repressor fusions that were not found in our screen, FtsZ (8), MalK (27), PhoB (13), Fur (51), BglG (2), and YigA (23), consistent with the idea that our screen was not saturated. In addition, there are many proteins we have not recovered as ISTs that we think should be recoverable in repressor fusions. These include LacZ, LacI, CAP, TrpB, and many other well-studied stable oligomers. In several cases, we obtained ISTs for some members of a conserved family of proteins but not from others that are likely to oligomerize similarly. The LysR family of transcriptional regulators (LTTRs) is the second largest family of proteins present in the E. coli genome. It is likely that all LTTRs assemble through similar homotypic interactions into dimers or tetramers, but we recovered ISTs for only 13 of the 43 members of the LTTR family found in E. coli. Similarly, we have identified ISTs for some, but not all, members of the PurR family of transcription factors.
Despite subjecting numbers of clones that should be sufficient to provide full coverage to phage selection, the ISTs recovered from both sheared and restriction enzyme libraries are still missing many oligomeric proteins, indicating that nonrandom factors remain. Although random shearing provided a dramatic improvement in coverage compared to previous studies which used partial restriction digests only (54), several factors may skew the recovery of ISTs even from sheared DNA. First, the shearing process itself may not be perfectly random. Second, different numbers of fusion junctions may be possible for different oligomeric proteins, so that some proteins would be overrepresented even in a perfectly random library. Third, genes that are adjacent to other genes that are toxic in high copy will be underrepresented. Fourth, our ability to recover oligomerization domains depends, of course, on the ability of a fusion protein to assemble enough oligomers to provide immunity to phage
infection. Some oligomeric proteins may simply have dissociation constants that are too high to support repressor activity at the expression levels provided by the weak constitutive promoter in our vectors (37). Based on the phenotypes of fusions to variant GCN4 leucine zippers (59, 61), we estimate that cells expressing dimeric repressor fusions with dissociation constants in the low micromolar range should be immune to
. However, these estimates are based on many extrapolations from in vitro to in vivo conditions and may not be applicable to other proteins. Note also that steady-state expression levels are not the only factor affecting whether a clone is recovered. Freshly transformed cells must express sufficient repressor activity from the plasmid to confer immunity before encountering a phage particle seeded on the plate. We know that the plating efficiency on phage plates after transformation with plasmids carrying different repressor fusions varies over several orders of magnitude under the conditions we used to prevent the recovery of transformation siblings (data not shown). The observed bias toward recovering higher-order oligomers may reflect their improved ability to bind cooperatively to adjacent operators within OR and OL or to form looped repressor-operator complexes between OR and OL (9, 47).
While the present screen has not reached saturation, the ISTs we identified already provide us with a wealth of information about specific E. coli proteins and about oligomeric proteins in general. The identification of an IST defines oligomerization as a biochemical property of the protein containing it and often maps the oligomerization domain within the protein coding sequence. This is often the only functional annotation for hypothetical ORFs and may provide an entry for further studies of biological function. For example, repressor fusions can be used to study how the activity of specific proteins is modulated by controlling oligomerization (2, 24).
In many cases, ISTs identify an oligomerization domain in a specific region of the protein. This suggests the existence of a contiguous and independent folding unit in the protein that drives oligomerization. In other cases, the ISTs found are very close to the full length of the protein. In some cases, the entire protein may be necessary to observe the homotypic interaction. For example, we recovered multiple clones encoding FolX fusions to aa 1 to 120, suggesting that the entire protein was required for a homotypic interaction. FolX forms an octameric ring-like structure where the entire protein appears to be required for proper folding (PDB accession no. 1B9L) (44). However, we cannot conclude that subdomains are not sufficient for the oligomerization of most proteins where the IST covers the entire ORF, as we may have simply failed to sample the appropriate fragments.
More than half of the proteins identified with repressor fusions do not have an identifiable homotypic homolog in the PDB and may represent new folds. The identification of oligomeric subdomains may be useful for structure determination. Multidomain proteins are often difficult to study, and the ISTs should define independently folding domains that may be more amenable to structure determination than the full-length protein.
IST data can be combined with evolutionary analysis to provide better domain mapping and functional assignments. For example, multiple-sequence alignments of the DeoR family of transcription factors suggest two conserved domains separated by a nonconserved linker. Sequence analysis identified a helix-turn-helix located towards the N terminus of these proteins. The best-characterized member of this family, the DeoR repressor, appears to be an octamer in solution (40), but the location of the oligomerization domain was not previously described. Nine of the members of the DeoR family of transcriptional regulators were identified by using repressor fusions (Fig. 6). The ISTs include various amounts of the C-terminal end of the ORF, assigning oligomerization function to the conserved C-terminal domain.
![]() View larger version (13K): [in a new window] |
FIG. 6. Minimal ISTs found for the deoR family of transcriptional regulators with homotypic interactions in E. coli. There are 12 members of this family (COG1349) in E. coli K-12, and 9 of them have been identified with repressor fusions. Bars indicate the lengths of the full-length ORFs, aligned based on the multiple-sequence alignment in the COGS database; spaces indicate gaps in the alignment. Filled regions indicate the smallest ISTs found for each gene. The regions involved in the homotypic interaction cluster at the C terminus.
|
We thank Patricia Klein and Eun-Gyu No for invaluable help with library construction and DNA sequencing, respectively. Rodolfo Aramayo, John Mullet, Debby Siegele, and members of the Hu lab provided useful advice and discussions. Additional technical assistance was provided at various stages of the project by Barbara Blum, Brian Hatten, and Svenja Simon-Marshall.
Present address: Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894. ![]()
Present address: DCMB, University of Texas Southwestern Graduate School, Dallas, TX 75390. ![]()
|
|
|---|
54-dependent transcription. J. Bacteriol. 182:311-319.
repressor fusions. J. Mol. Biol. 259:575-578.[CrossRef][Medline]
repressor DNA binding domain in E. coli cells. Methods Mol. Biol. 205:235-250.
repressor fusions to isolate and characterize self-assembling domains, p. 375-393. In E. Golemis and I. Serebriiskii (ed.), Protein-protein interactions: a laboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»