Department of Genetics, Stanford University School of Medicine, Stanford, California 94305-5120
Received 12 November 2001/ Accepted 31 March 2002
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Another useful reaction is integration for the purpose of creating knockin and knockout alterations of the genome, such as those desirable in gene therapy, creation of transgenic organisms, and modification of cells in culture. For integration, a unidirectional recombinase such as a phage integrase is ideal, because there is no reverse reaction that could depress net integration frequency (9). Phage integrases mediate recombination between nonidentical phage attP and bacterial attB recognition sites (13). The well-studied lambda integrase is, like Cre and FLP, a member of the tyrosine-catalyzed recombinase family (17). However, the integrases from lambda phage and the closely related phage HK022 have cofactor requirements that hamper their use in eukaryotic cells (11, 15).
Some phage integrases are members of the unrelated serine-catalyzed family of recombinases (24) and are autonomous with no cofactor requirements, which makes them potentially ideal for use in foreign host environments, such as mammalian cells. The integrase encoded by phage
C31 of Streptomyces spp. (12, 20) requires no cofactors (27). We have shown that the
C31 integrase works well in human and mouse cells (9, 30), mediating efficient intramolecular integration in transfected plasmid DNA and intermolecular integration into the chromosomes. Based on these results, we examined the related integrase from Streptomyces phage R4 (16) and found that it, too, works in human cells (19).
Although not highly similar to each other, the
C31 and R4 integrases are members of an identifiable subgroup of particularly long serine recombinases (16, 27) that contains another distantly related integrase, that of phage TP901-1 (6). We carried out this study to test the hypothesis that this integrase might also possess properties useful in engineering higher eukaryotic genomes. TP901-1 is a temperate bacteriophage that infects Lactococcus lactis subsp. cremoris and can be induced by UV light (2). After infection, the bacteriophage is able to lysogenize by integrating its 38.4-kb genome site specifically into the bacterial chromosome (7). It has been shown that the phage integrase encoded by the orf1 gene, a 425-bp region immediately upstream of orf1, and the attP sequence just downstream of orf1 are sufficient to catalyze integration into the chromosomal attB site in L. lactis (6). The goal of this investigation was to test whether the TP901-1 integrase could function outside its native host, in Escherichia coli and in human cells. We also defined minimal attP and attB sites recognized by the integrase and performed in vitro studies with the enzyme. This work introduces the TP901-1 integrase as a valuable tool for engineering DNA in the context of living mammalian cells.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
|
C31 integrase gene under the control of the bacterial lacZ promoter. The
C31 integrase gene was removed by digestion with BamHI and SpeI and was replaced by a BamHI-SpeI fragment from pTA-TPInt containing the TP901-1 integrase, generating the bacterial expression plasmid pTPInt (Fig. 1B). Plasmid pCMVInt (9) expresses the
C31 integrase in mammalian cells. The
C31 integrase gene was removed by digestion with BamHI and SpeI and replaced with a BamHI-SpeI fragment containing the TP901-1 integrase gene. This ligation generated the mammalian expression plasmid pCMV-TPInt, in which the cytomegalovirus immediate early promoter drives expression of the TP901-1 integrase (Fig. 1C). The TP901-1 integrase gene plus 422 bp of upstream sequence, including orfA, was amplified from L. lactis subsp. cremoris strain 901-1 by using the primers 5'GCCATTAGACTAGTGATATTCGGCAAAAAGTTTACCG3' and 5'CGAGTTGGGATCCCTCGCAATTAAGCGAGTTGG3'. The PCR product was ligated into vector pCR2.1, generating plasmid pTA-TPInt+orfA. The TPInt+orfA BamHI-SpeI fragment was then ligated into pInt as described above for pTPInt to generate the bacterial expression plasmid pTPInt+orfA. Plasmids for production of TP901-1 integrase. The TP901-1 integrase gene was PCR amplified from pCMV-TPInt with the primers 5'GTCTAGAAATTAAGAAGGAGATAATGACTAAGAAAGTAGCAATCTAT3' and 5'TGGATCCCAATTAAGCGAGTTGGAATTT3'. The PCR product was ligated into pCR2.1 (Invitrogen) to create plasmid pTA-TP901. The integrase gene was removed from pTA-TP901 by XbaI and BamHI digestion and ligated into XbaI- and BamHI-digested pET-11a (Novagen, Madison, Wis.), generating plasmid pET11-TP901. This plasmid contains the TP901-1 integrase gene located 24 bp downstream of the T7 promoter.
Bacterial intramolecular integration assay. E. coli DH10B was transformed with pTPInt, grown under kanamycin selection conditions, and made electrocompetent by standard protocols. The resulting strain DH-TPInt cells were then used for the intramolecular integration bacterial assay. Twenty nanograms of assay plasmid DNA was electroporated into DH-TPInt cells, allowed to recover, and then plated on Luria-Bertani (LB) broth plates containing 25 µg of chloramphenicol per ml, 60 µg of kanamycin per ml, and 50 µg of 5-bromo-4-chloro-3-indolyl-ß-D-galactopyranoside (X-Gal) per ml and grown at 37°C. Intramolecular integration resulted in excision of lacZ from the plasmid, producing a white colony. The frequency of intramolecular integration was calculated by dividing the number of white colonies by the total number of colonies and multiplying the quotient by 100.
Mammalian intramolecular integration assay. Human 293 cells (8) were grown in Dulbecco's modified Eagle medium supplemented with 9% fetal bovine serum and 1% penicillin/streptomycin to 60 to 80% confluence on 60-mm-diameter dishes. The cells were then transfected with 250 µg of pBB-B304-P333 assay plasmid or its reduced-size derivatives, 4 µg of pCMV-TPInt or salmon sperm DNA, and 12.75 µl of Lipofectamine (Gibco BRL). At 24 h after transfection, DNase I was added to the medium at a concentration of 50 U/ml in order to reduce the background of untransfected DNA. Plasmid DNA was isolated by the Hirt method (10) 72 h after transfection. A fraction of this DNA was then transformed into electrocompetent E. coli DH10B and plated on LB medium plates containing X-Gal and chloramphenicol, which selected for the assay plasmid but not the integrase plasmid. The intramolecular integration frequency was calculated by dividing the number of white colonies by the total number of colonies and multiplying the quotient by 100.
TP901-1 integrase production. E. coli strain BL21-SI (Invitrogen) containing plasmid pET11-TP901 was grown at 30°C (to promote solubility) in 100 ml of LB broth without NaCl but with 100 µg of ampicillin per ml to an optical density at 600 nm of 0.727. The culture was induced with 0.3 M NaCl and 1 mM isopropyl ß-D-thiogalactopyranoside (IPTG) and grown for another 5 h. In this strain, salt and IPTG induce expression of the T7 polymerase, which in turn transcribes the TP901-1 integrase gene. The cells were resuspended in phosphate-buffered saline (pH 7.4) containing 10% glycerol and sonicated for 45 s on ice. The protein concentration was measured by using the Bio-Rad protein assay (Bio-Rad, Hercules, Calif.). The protein extract was tested for TP901-1 integrase function in vitro with the lacZ intramolecular integration assay.
In vitro assays with TP901-1 integrase protein extract. To test the integrase function of the protein extract, various amounts of the extract were incubated with 500 ng of pBB-B304-P333 in binding buffer (2 mM Tris-HCl [pH 7.5], 10 mM NaCl, 0.1% glycerol, 10 µM EDTA) (27) in 20-µl reaction mixtures at 30°C for 16 h. The reaction mixtures were heat killed at 65°C for 20 min, and the DNA was purified by using a QIAquick PCR purification kit (Qiagen, Valencia, Calif.). Each reaction mixture was transformed into E. coli DH10B, and each transformation mixture was plated on LB medium with 100 µg of chloramphenicol per ml. The plates were incubated overnight at 37°C, and 24 h later the colonies were counted and the recombination frequency was calculated by dividing the number of white colonies by the total number of colonies and multiplying the quotient by 100. In both the time course and temperature optimization experiments we used 20-µl reaction mixtures containing 500 ng of pBB-B304-P333 in binding buffer with 29.6 µg of TP901-1 integrase protein extract. The reaction mixtures were incubated at 37°C, and the cells were heat killed by incubation at 65°C for 20 min at zero time and after 1, 2, 4, 8, and 15 h for the time course studies. Reaction mixtures were incubated at various temperatures for 16 h before the cells were heat killed in order to obtain a temperature curve.
| RESULTS |
|---|
|
|
|---|
The only TP901-1 phage sequences required for integration into the L. lactis genome are attP, the orf1 gene, and 425 bp of sequence upstream of orf1 (6). orf1 encodes the 485-amino-acid TP901-1 integrase and is located just upstream of attP. The amino-terminal 150 to 180 amino acids of the integrase show
40% similarity to the amino-terminal catalytic domain of recombinases of the serine-catalyzed family and include the catalytic serine 12 residue (6). The extended carboxy-terminal region exhibits little identity with known proteins and presumably includes the DNA recognition domain. The 425 bp upstream of orf1 probably contains the native promoter for the gene but also encodes a 60-amino-acid orfA reading frame that is not likely to be expressed (6).
To test the ability of orf1 and orf1 plus orfA to mediate intramolecular integration in E. coli, we constructed pTPInt and pTPInt+orfA, bacterial expression plasmids in which the TP901-1 integrase gene is under control of the E. coli lacZ promoter. These expression plasmids are compatible with the pBB-B304-P333 assay plasmid. We established these integrase expression plasmids in E. coli DH10B, generating the TP901-1 integrase expression strains DH-TPInt and DH-TPInt+orfA, respectively. We assayed the frequency of intramolecular integration of the assay plasmid pBB-B304-P333 in the DH-TPInt+orfA and DH-TPInt bacterial strains, calculating the frequency of intramolecular integration by counting white bacterial colonies on X-Gal plates. In both strains, the integrative recombination frequency was >99% (Fig. 2). Restriction analysis of DNA purified from white bacterial colonies showed a digestion pattern consistent with a precise integration reaction between attB and attP, resulting in excision of the lacZ gene. This result indicated that the TP901-1 integrase was capable of functioning efficiently in E. coli and that the integrase protein was sufficient to carry out the integrative recombination reaction without Lactococcus-specific cofactors. The 425-bp upstream region was unnecessary, presumably because in our E. coli expression plasmid the lacZ promoter drove expression of the integrase and eliminated any dependence on the native promoter. Based on this result, the rest of our experiments were carried out with the TP901-1 integrase orf1 gene only, without the upstream sequences.
Minimal attB and attP sites required for efficient TP901-1 integrase activity. After demonstrating that the TP901-1 integrase functioned efficiently in E. coli, we wanted to determine the minimal att sites recognized by the enzyme. We first analyzed the ability of the TP901-1 integrase to catalyze integrative recombination between symmetrically shortened attB sites and a 333-bp attP. As shown in Fig. 2, TP901-1 integrase efficiently recombined shortened attB sites ranging in length from 53 bp down to just 31 bp. The 31-bp attB still resulted in an integrative recombination frequency of >99% when it was paired with the full-length 333-bp attP (Fig. 2), as well as with a 56-bp attP (data not shown). However, reduction of the length of attB to 30 bp resulted in some loss of activity, and further shortening of attB to 29 bp resulted in reduction of the integrative recombination frequency to 82.7%.
Shortened attP sequences were tested in combination with attB53. attP sites that were 56 bp long, either asymmetrically disposed around the core (5) or centered on the core, retained full activity (Fig. 2). attP shortened to 50 bp still showed full integrative recombination activity, but shortening the attP site to 42 bp reduced the integrase activity to
85%. We concluded that at least in the context of intramolecular recombination in E. coli, an attB that was 31 bp long sufficed, while an attP that was approximately 50 bp long was needed for full activity.
In vitro studies. A crude protein extract containing TP901-1 integrase was made by lysis of E. coli containing plasmid pET11-TP901 expressing the integrase from the T7 promoter. When samples of the culture taken at different times after induction of integrase gene expression were boiled in loading buffer and electrophoresed on a denaturing gel, a band at approximately the correct size, 55 kDa, appeared, and the intensity increased with time. The same band was also visible in the crude extract. The measured concentration of the extract was 7.4 mg/ml. The activity of the extract was tested in vitro by using the lacZ intramolecular integration assay, with scoring after transformation of in vitro-reacted DNA into E. coli lacking integrase. As expected, the integrative recombination frequency increased as the amount of protein extract increased. A maximum recombination frequency of 99% was obtained when 59.2 µg of protein was used. The postrecombination attR sites of four white colonies were sequenced to ensure that the TP901-1 integrase was completing the correct site-specific integration reaction, and all four colonies had the expected DNA sequence for a precise integrase-mediated event between attB and attP.
TP901-1 integrase showed good stability in vitro over time, with the integrative recombination efficiency increasing over 15 h (Fig. 3A). Between 2 and 4 h, the percentage of recombination in vitro increased from 6 to 36%; this was the interval with the largest recombination increase in the experiment. By 15 h, the percentage of in vitro recombination was 90%. The control reaction mixtures containing a crude protein extract made with the pET11 backbone in E. coli BL21-SI showed no integrative recombination in the in vitro assay.
|
TP901-1 integrase catalyzes intramolecular integration in human cells. The ability of TP901-1 integrase to function in E. coli and in vitro in the absence of added cofactors created the possibility that the enzyme could also function in mammalian cells. We constructed pCMV-TPInt (Fig. 1C) to place the integrase gene under expression of a promoter active in mammalian cells. To analyze TP901-1 integrase function in mammalian cells, the intramolecular integration assay plasmid pBB-B304-P333 (Fig. 1A) and plasmid pCMV-TPInt were cotransfected into human 293 cells. Seventy-two hours after transfection, plasmid DNA was extracted, transformed into E. coli DH10B lacking integrase activity, and spread on X-Gal indicator plates to determine whether intramolecular integration had occurred in the mammalian environment. The frequency of intramolecular integration in the mammalian cells was determined by counting white bacterial colonies.
As shown in Table 1, in the absence of the integrase plasmid, a background of 1.9% white colonies was obtained for a plasmid carrying full-length attB and attP sites. These colonies harbored plasmids that contained nonspecific deletions and other mutations engendered by the transfection process (14). Cotransfection of pBB-B304-P333 and the integrase expression plasmid yielded white colonies at a frequency of 23.2% (Table 1). PCR analysis of plasmid DNA extracted from 66 white colonies showed that >95% of the samples represented correct site-specific integration events, as shown by amplification of an expected 623-bp fragment containing attR. DNA sequencing of four junctions confirmed that the attB-attP recombination reaction was site specific and perfect to the base. Restriction analysis of the 66 plasmid samples showed that the 95% that underwent integrative recombination had no concurrent rearrangements. The remaining 5% of the events represented rearrangements of the assay plasmid, corresponding to the transfectional mutation rate observed when pBBBP-type plasmids were transfected without pCMV-TPInt. Controls were included to provide assurance that the integrative recombination events occurred in the human cells, not in E. coli. Direct transformation of pBB-B304-P333 into DH10B failed to produce white colonies. Likewise, transformation of pBB-B304-P333 plus pCMV-TPInt directly into DH10B produced negligible white colonies. In addition, a PCR was performed with the plasmid DNA extracted from the human cells before transformation into E. coli, and the 623-bp fragment diagnostic of site-specific integration was readily detected.
|
| DISCUSSION |
|---|
|
|
|---|
By using an intramolecular integration assay in E. coli, we characterized minimal recognition sites for the enzyme that were 50 bp long for attP and just 31 bp long for attB. These sites are somewhat smaller than the minimal 56-bp attP site (5) and 43-bp attB site (4) previously reported. In the previous cases, the minimal att sites were asymmetrically disposed about the 5-bp core, with most of the sequence upstream of the core, whereas the minimal att sites found in this study are centered on the 5-bp core. Numerous paired direct and inverted repeats have been identified within attP (5, 7) and attB (4) that may be involved in the integrase binding and recombination function. Our minimal attB sequence includes the B2 and B3 repeats (4), although only a single copy of each repeat is present (Fig. 2). Our minimal attP site includes the P1, R4, R5, and R6 repeats (5) (Fig. 2), although again only one copy of each repeat, not the pair, is present. These reduced sites function as well as full-length sites that are over 300 bp long in both E. coli (Fig. 2) and human cells (Table 1). The significance of the repeats is therefore unclear. The second copy present in the full-size att sites might provide enhancement of recombination that is undetectable with our assay because it is fully saturated.
The minimal attB31 sequence contains the C17 and A25 bases that have been shown to both reduce in vitro binding of the integrase and reduce recombination in E. coli (4). The TP901-1 integrase binds with similar affinity in vitro to attB and attP (4), as is also the case for the
C31 integrase (28), another member of the extended serine recombinase group. By mutating each base in the attB 5-bp core, it was revealed that only the first two bases are important (4). This result suggested that there is a 2-bp overlap region between attB and attP during recombination, as has been observed for other members of the serine recombinase family (24).
Our major interest in the TP901-1 integrase is in its potential as a tool for making directed genomic rearrangements in eukaryotic cells. Our in vitro studies with this enzyme reinforce the hypothesis that it possesses good reaction kinetics and stability and works well at 37°C (Fig. 3), making it eligible for use in mammalian cells. The small size of the att sites is consistent with a lack of cofactor requirements, which is also a highly desirable feature because it simplifies use of the enzyme in foreign hosts. The efficient function of the enzyme in vitro and in E. coli documented here and recently by other workers (4) and the positive results for human cells reported here suggest that the TP901-1 integrase should function in most cellular environments in a wide range of species. The enzyme joins the other two integrases of the extended serine recombinase family,
C31 (9, 26, 30) and R4 (19), in this respect. The intramolecular integration activity of the TP901-1 integrase in human cells is within twofold of the activities measured for the
C31 (9) and R4 (19) integrases in similar assays.
This study demonstrated efficient function of the enzyme in an intramolecular integration reaction in human cells (Table 1). This type of reaction is useful for creating chromosome rearrangements, such as deletions. We are currently investigating the ability of the enzyme to carry out such reactions in a mammalian chromosomal context. Another major area of interest is the enzyme's ability to carry out intermolecular integration reactions. The enzyme was designed to carry out such reactions in the normal phage life cycle, mediating integration between an incoming phage genome and the host bacterial genome. Plasmid integration vectors carrying attP mediate intermolecular integration into the native host attB site at a frequency of
20% (6). This integration reaction is largely unidirectional. A second phage protein, an excisionase encoded by orf7, is required to bring about an efficient reaction between attR and attL, which is required for phage excision (3). To maximize the integration reaction, the excisionase is simply left out.
It may be possible to use the TP901-1 integrase as an integration tool targeted to inserted wild-type att sites. We have obtained evidence for integration of a plasmid containing the TP901-1 attB site into a TP901-1 attP site that was placed into the genome of human 293 cells (Stoll and Calos, unpublished results). The TP901-1 integrase is therefore capable of intermolecular integration into chromosomes in the human cell environment. As well, the short length of the att sites makes feasible the idea of using native chromosomal sequences resembling the att sites as integration targets. Such pseudo-recombination binding sites exist in the case of the Cre recombinase (29) and the
C31 and R4 integrases (19, 30). The sequences of the att sites recognized by the TP901-1 integrase differ from those recognized by these other two integrases and offer additional points of entry into the genome, especially in AT-rich regions. It may be possible to enhance the reactivity of the enzyme with chromosomal sequences and to improve its affinity for particular native sequences by DNA shuffling (25) of the integrase gene, as has been shown for the
C31 integrase (23). If so, the TP901-1 integrase may become an important tool for engineering the genomes of higher living cells.
| ACKNOWLEDGMENTS |
|---|
S.M.S. and D.S.G. were supported by a graduate training grant from NIH. This work was supported by NIH grant DK58187 to M.P.C.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
C31. J. Mol. Biol. 222:897-908.[CrossRef][Medline]
C31 attachment site. Nucleic Acids Res. 19:5187-5189.
C31 site-specific recombination system. Mol. Genet. Genomics 265:1031-1038.[CrossRef][Medline]
C31. Mol. Microbiol. 38:232-241.[CrossRef][Medline]
C31 integrase. Mol. Cell. Biol. 21:3926-3934.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Appl. Environ. Microbiol. | Infect. Immun. | Eukaryot. Cell |
|---|---|---|
| Mol. Cell. Biol. | J. Virol. | Microbiol. Mol. Biol. Rev. |
| ALL ASM JOURNALS |