Previous Article | Next Article ![]()
Journal of Bacteriology, November 2004, p. 7745-7753, Vol. 186, No. 22
0021-9193/04/$08.00+0 DOI: 10.1128/JB.186.22.7745-7753.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Microbiology, University of Regensburg, Regensburg, Germany,1 Laboratory of Microbiology, Wageningen University, Wageningen, The Netherlands,2 Institut Pasteur, Paris, France3
Received 28 May 2004/ Accepted 16 August 2004
|
|
|---|
|
|
|---|
Apart from the distinct morphology, the genome organization of viruses of hyperthermophilic crenarchaeaota also appears to be unique. More than 90% of all open reading frames (ORFs) of the sequenced genomes still have not been assigned functions and share no homology to sequences in public databases (reviewed in reference 13).
Viruses of Crenarchaeaota are also exceptional with respect to virus-host interactions. The vast majority of dsDNA viruses of bacteria and euryarchaea eventually kill the host cell during release of progeny virions; in contrast, crenarchaeal viruses establish a productive infection without killing or lysing the cell, a so-called carrier state. In the case of the fusellovirus SSV1 of Sulfolobus, induction of virus production in the lysogenic strain has been observed (8, 20); however, again, this does not result in cell lysis. In all cases, the infected cells continue to replicate themselves as well as the virus, although cellular growth is slowed down, suggesting modification of some specific cell functions. Modification of cellular transcription as a result of infection is well studied in bacterial virus-host systems (9, 10).
Little is known about mechanisms and controls of viral gene expression in hyperthermophilic archaea. In several cases controls of gene expression have been observed but not elucidated (reviewed in reference 15). The only systematic studies on transcription were conducted on the virus SSV1 of Sulfolobus; however, these studies followed UV induction of virus production in the SSV1 lysogene rather than the infection cycle. By Northern analysis eight constitutive transcripts and one UV-inducible transcript were mapped and in nuclease S1 mapping were identified promoter regions (17, 18) and termination sites (19). The results enabled identification of archaeal promoter sequences. Their similarity to TATA box-containing promoters of the eukaryal RNA polymerase II confirmed original observations of the resemblance of archaeal transcription machinery with the eukaryal RNA polymerase II apparatus (16, 27). Subsequent biochemical studies provided strong arguments for this notion (reviewed in references 1, 22, and 26).
Here we present results of the first systematic study of transcription of viruses of crenarchaeota over the replication cycle. As a model were two closely related rudiviruses of Sulfolobus, SIRV1 (variant VIII) and SIRV2 (12). Linear dsDNA genomes of the two viruses, 32,312 and 35,502 bp long, have been sequenced and their nucleotide sequences are available in the EMBL and GenBank data libraries under accession numbers AJ344259 and AJ414696 (5, 11). The genomes are highly similar in their organization. They consist of blocks of well-conserved sequences, 74 to 94% identical, separated by sequences with low or no similarity. The genomes have covalently closed ends and carry inverted terminal repeats (ITRs) of 2,029 and 1,628 bp, respectively, which contain multiple direct repeats. G+C content of both viral genomes is extremely low, 25%, compared with 37% for the genome of Sulfolobus solfataricus (21).
|
|
|---|
DNA and oligonucleotides. Viral DNA was isolated from the particles by treating them for 30 min with 1.0% n-laurylsarcosinate at room temperature, followed by phenol chloroform extraction. Virus-specific dsDNA probes used in Northern hybridizations were generated by PCR. Sequences of the primer pairs used as well as of the oligonucleotides used as single-stranded probes are provided in the supplemental material (see Tables S1 and S2).
RNA isolation. Cells of S. islandicus LAL14/1 or REN2H1 were grown to an optical density at 600 nm of 0.2 (108 cells/ml) and infected with SIRV1 or SIRV2 at a multiplicity of infection (MOI) of 5. Following the time course of infection, 20-ml aliquots were taken at 0, 30, 60, 120, 180, and 240 min postinfection (p.i.). Cells were pelleted, and the RNA was extracted using the RNeasy system from QIAGEN. Thereby required complete homogenization of the samples was reached by passing them through a Qiashredder column (QIAGEN). To remove DNA contaminations, the RNA was treated with DNase I (QIAGEN) during the purification procedure directly on the column. Concentration and purity of the RNA were determined by the absorbances at 260 and 280 nm.
Northern blotting and hybridizations. RNA samples were analyzed by denaturing electrophoresis through 1.2% agarose gels containing 0.66% formaldehyde. The running buffer contained 20 mM morpholinepropanesulfonic acid, 5 mM sodium acetate, and 1 mM EDTA. After electrophoresis for 2.5 h at 120 V, gels were soaked in 0.05 N sodium hydroxide for 20 min, rinsed in RNase-free water, soaked in 20x SSC (0.3 M Na3-citrate, 3 M NaCl; pH 7.0), and then transferred to Biodyne B membranes (Pall) by capillary transfer with 20x SSC. RNA was fixed to the membrane by baking for 30 min at 80°C. Hybridizations with dsDNA probes were done in hybridization buffer containing 50% formamide, 5x SSC, 5x Denhardt's reagent (50x Denhardt's reagent is 5 g of Ficoll type 400, 5 g of polyvinylpyrrolidone, 5 g of bovine serum albumin, and H2O to 500 ml), and denatured, fragmented salmon sperm DNA (100 µg/ml) at 42°C overnight. Hybridizations with oligonucleotides were done in hybridization buffer containing 5x SSC, 5x Denhardt's reagent, 0.05 M sodium phosphate (pH 6.5), 0.1% sodium dodecyl sulfate (SDS), and denatured, fragmented salmon sperm DNA (100 µg/ml) at 42°C overnight. Membranes were washed two times for 5 min in washing buffer (0.2x SSC, 0.1% SDS) at room temperature, two times for 15 min in washing buffer at 60°C, and finally rinsed in 2x SSC. For the oligonucleotides, washing of the membranes was done for 15 min in washing buffer (1x SSC, 0.1% SDS) at room temperature, two times for 20 min in washing buffer at 55°C, and finally rinsed in 2x SSC. Probes were 32P labeled with the Megaprime DNA labeling system (Amersham) (dsDNA probes) or with T4 polynucleotide kinase (oligonucleotides).
Primer extension analysis. Transcription start sites were determined by using the reverse transcription system of Promega according to a modified protocol. Primer annealing was done in a separate reaction for 10 min at 70°C and then slowly cooled down to room temperature. The components for the reverse transcription reaction were added, and the primer extension was exceeded for 30 min at 42°C. Reactions were terminated by adding 250 µl of 10 mM Tris, pH 8.0, 10 mM EDTA, 750 mM NaCl, and 1% SDS. Afterwards, 10 µg of glycogen was added and DNA was extracted with phenol-chloroform. The used oligonucleotides are listed in the supplemental material (see Table S3).
|
|
|---|
![]() View larger version (20K): [in a new window] |
FIG. 1. ORF and transcription maps of SIRV1 (variant VIII) and SIRV2. ORF maps were modified with permission from those of Peng et al. (11). The numbers used to identify ORFs refer to the number of codons. Specific ORFs for either SIRV1 (variant VIII) or SIRV2 are highlighted in pink. Genes with identified or putative functions are indicated as follows: cp, coat protein; Hjc, Holliday junction resolvase; dUTPase, deoxyuridine triphospatase; gtf, glycosyl transferase. dsDNA probes used in the Northern hybridization are represented by bars and are numbered according to the information in Table 1. Single-stranded DNA probes are represented by dots. The transcription maps are surrounded by a frame.
|
The same amount of RNA (5 µg) prepared from virus-infected cells at different time points p.i. was separated by agarose gel electrophoresis and blotted onto nylon membranes for Northern hybridization. In hybridization experiments with whole-genome probes, numerous virus-specific transcripts were detected which were difficult to interpret and assign to specific ORFs (data not shown). For more precise identification of transcripts in hybridization experiments we used fragments of the viral genomes, produced by PCR amplification. In total, 11 fragments of the SIRV1 genome and 12 fragments of the SIRV2 genome were produced. Coordinates of terminal nucleotides of these fragments in the complete nucleotide sequences of the genomes are presented in Table 1. In the same table, the results of 23 Northern hybridization experiments are summarized, indicating sizes of RNA detected with different DNA probes at different time intervals p.i. All results were evaluated, and putative ORFs, transcription of which should have caused detected signals, are also indicated in Table 1. In several cases we observed very weak signals from RNAs longer than 3 kilonucleotides (knt). Because in these cases no corresponding operons could be identified, we did not consider them in evaluation of the data. In some cases, there were ambiguities in interpretation of data. These were resolved by designing single-stranded ORF-specific oligonucleotide probes and using them in Northern hybridization experiments. In total, 27 hybridizations were conduced with ORF-specific single-stranded probes, the results of which are also summarized in Table 1.
|
View this table: [in a new window] |
TABLE 1. Transcripts from genomes of the viruses SIRV1 and SIRV2 identified by Northern hybridization with double-stranded and single-stranded DNA probesa
|
![]() View larger version (49K): [in a new window] |
FIG. 2. Northern blot hybridization of infected cell RNA to specific SIRV1 and SIRV2 genome probes. RNA was extracted from cells at various times after infection. Lane 1, uninfected cell RNA; lanes 2 to 6, 30 min, 1 h, 2 h, 3 h, and 4 h p.i., respectively. Double-stranded probes were following: SIRV1 probe 3, covering ORFs 306 and 119 (A); SIRV1 probe 1, covering ORFs 55a and 90a (B); SIRV1 probe 11, covering ORFs 90c and 98 (C); SIRV2 probe 5, covering ORFs 134 and 355 (D); hybridization with an oligonucleotide complementary to a portion of SIRV2 ORF 55 (E).
|
Transcripts were detected over the whole genomes, and all identified ORFs of both viruses, except those mentioned below, were transcribed. They included also the ORFs unique to one or the other of the highly similar genomes. However, four SIRV2-specific ORFs, ORF156, ORF90, ORF116, and ORF76, were apparently not transcribed. By using either the above-mentioned probes or ORF-specific oligonucleotides, no transcription was detected for SIRV1 and SIRV2 ORF119c, ORF59, and ORF 91 and SIRV1 ORF 101/SIRV2 ORF103c.
The earliest time point when any transcript could be clearly detected was 30 min p.i. However, at this time point most of the ORFs of both genomes appeared to be transcribed. Transcription started nearly simultaneously from 21 sites on the genome of SIRV1 and from 29 sites in the genome of SIRV2. No strand specificity was observed in early transcription: both strands were transcribed with nearly equal efficiency. Transcripts were often large enough to span several ORFs, suggesting many ORFs were clustered into operons. Relative abundance of each transcript could be estimated only approximately; moreover, it was estimated only in those cases when compared signals were revealed in the same hybridization experiment. Generally, within 30 min all transcripts were relatively weak and reached higher levels 1 h p.i. Later, decrease of signals from some transcripts was observed, e.g., from SIRV1 ORF56 and the two transcripts starting from the leftward ITR of SIRV1 (Fig. 2B). The longer of these transcripts could not be detected at later stages of the replication cycle.
Very few transcripts appeared later, in addition to those observed 30 min p.i. These included monocistronic transcripts of SIRV1/SIRV2 ORF399 and those starting from SIRV1 ORF90b in the rightward ITR. In the latter case the signal was relatively weak, and we cannot completely exclude the possibility that a small amount of the transcript was also present earlier in the infection cycle. Moreover, the transcript starting from the identical site on the leftward ITR appeared already at 30 min p.i. However, the monocistronic transcript of ORF134 (coat protein gene) was clearly absent among early transcripts of both viruses (shown in Fig. 2D for SIRV2). In the host of SIRV2 it appeared 2 h p.i., and in SIRV1 host cells it appeared at 3 h p.i., in both cases close to the eclipse period of the infection cycle. It is noteworthy that results of hybridization with corresponding probes (probe 5) suggested that the coat protein gene of both viruses was transcribed in the context of the operon already at 30 min p.i. (Table 1 and Fig. 2D; data not shown for SIRV1).
The only ORF which apparently was transcribed exclusively late in the infection cycle was SIRV2 ORF55/SIRV1 ORF55c. Appearance of a weak transcript of this ORF coincided with the late appearance of the monocistronic transcript of the coat protein gene (ORF134; Fig. 2E). It is noteworthy that transcription of ORF55/ORF55c starts immediately downstream of the coat protein gene and proceeds in the other direction. Thus, the RNA would be complementary to the intergenic region of the early polycistronic transcript of ORF134-ORF335.
Results of evaluation of all hybridization experiments are schematically represented as transcription maps of the two viruses in Fig. 1. The majority of identified transcripts, shown in Fig. 1, could be terminated at T-rich sequences which were found downstream of stop codons of translation of the last (or only) ORF in an operon. However, in extremely A+T-rich genomes of SIRV1 and SIRV2 (75% A+T), it is not justified to speculate about putative functions of T-rich segments.
Primer extension and sequence of promoters. To confirm transcription start sites and identify promoters, primer extension experiments were conducted. RNA isolated from SIRV1-infected cells 3 h p.i. was used in reverse transcription experiments. At this time, transcripts from all start sites were available in host cells. The 5' termini for 13 transcripts from the virus SIRV1 were precisely mapped by comparing the primer extension product with a sequencing ladder generated with the same primer. Some typical examples are shown in Fig. 3, and the results are summarized in Fig. 4. Transcripts were termed according to the first (or only) ORF which they encompass, and those for which 5' termini have been mapped are in italics. Comparison of the sequences around transcriptional initiation sites revealed the presence of three conserved sequence elements, as indicated in Fig. 4: (i) an AT-rich pentanucleotide sequence centered 20 to 25 nucleotides upstream of the transcription start site, presumably corresponding to the TATA box; (ii) two purines, presumably representing the transcription factor B-responsive element (BRE), immediately upstream of the potential TATA box; and (iii) a trinucleotide GTC or, more generally, a pentanucleotide A/TGTCA/T, immediately downstream of presumable TATA boxes in 70% of cases. In four out of five putative promoters of SIRV1 which have not been mapped, these three elements could all be identified (Fig. 4). Putative promoters of SIRV1 ORFs 179, 562, and 112 could not be identified by sequence analysis; due to a very high AT content (75%) in the regions upstream of these ORFs, and also the absence of the trinucleotide GTC, it is difficult to identify putative TATA boxes.
![]() View larger version (99K): [in a new window] |
FIG. 3. Mapping of the initiation sites of four transcripts of the SIRV1 virus. The first (or only) ORF of the operon is indicated. Primer extension analysis was carried out using sequence-specific primers (see Materials and Methods). Each of the runoff products (PE) was electrophoresed alongside its DNA sequencing reactions (A, T, C, and G), also carried out with the corresponding primer. The promoter sequences are written in letters next to the autoradiographs. The TATA box, BRE, and GTC feature are highlighted in boxes.
|
![]() View larger version (67K): [in a new window] |
FIG. 4. DNA sequences of the promoters of SIRV1and SIRV2. Promoters mapped by primer extension are shown in italics. Other promoters were determined by similarities to them. The respective positions of the TATA-box-, BRE-, and GTC elements are shown in pink, orange, and green, respectively. In case of mapped promoters the transcription initiation site is indicated by an asterisk.
|
|
|
|---|
The rather uniform pattern of transcription in the course of the whole replication cycle fits the characteristics of interactions of the rudiviruses SIRV1 and SIRV2 with their hosts and their presence in host cells in a stable carrier state. This apparently does not require as much transcription control as in, for example, the case of temperate bacteriophages.
The two sites where we presume operation of transcriptional controls are rather long intergenic regions between ORF56 and ORF131(b) and between ORF158b and ORF134. From here, in addition to polycistronic RNAs, later in the infection cycle single gene transcripts are initiated. Termination of transcription downstream of the coat protein gene could be linked with transcription of a small ORF55 (ORF55c) from the counter strand (Fig. 2E), which can function as an antisense RNA. Late appearance of the transcripts of ORF55 (ORF55c) and the coat protein gene coincides with an increased transcription of the latter, suggesting additional positive transcription regulation. The suggestion is supported by the presence of multiple inverted and direct repeats in the long noncoding region upstream of ORF134. It seems reasonable to induce expression of the coat protein at the final stage of the viral replication cycle close to the time of appearance of the first virions, the eclipse. Accumulation in the cell of the coat protein could induce the assembly of rather simply organized nonenveloped virions of rudiviruses, the body of which is just a nucleoprotein consisting of the linear dsDNA and subunits of the coat protein (12).
An apparent candidate for the role of transcription regulator is the putative protein encoded by ORF56. The transcript from this gene is abundant throughout the complete infection cycle (Fig. 2A). The small protein reveals a helix-turn-helix motif characteristic of many DNA binding proteins.
For translation initiation, members of Sulfolobales are known to use two different mechanisms. On distal cistrons of polycistronic mRNAs, they mostly operate via Shine-Dalgarno-dependent initiation resembling the system prevalent in extant bacteria, whereas on monocistronic mRNAs and on opening cistrons of polycistronic mRNA they use "leaderless" initiation, reminiscent of the eukaryotic pathway (3, 23). The picture is slightly different for genomes of the rudiviruses, at least of SIRV1, for which most transcription starts have been mapped. Potential Shine-Dalgarno sequences can be identified upstream of 50% of mapped or putative transcription start sites.
Transcription starts of 13 transcripts of the genome of SIRV1 have been determined by primer extension. This comprises 52% of all identified transcripts. In all cases, 20 to 25 nucleotides upstream of the transcription start potential TATA boxes have been identified, as well as an adjacent purine-rich region, which most likely corresponds to the BRE, the transcription factor B binding site identified in all well-characterized Sulfolobus promoters (2). Location and consensus sequence of these promoter elements in the genomes of the two rudiviruses are very similar to their counterparts in the Sulfolobus hosts (2). In 9 out of 13 mapped SIRV1 promoters (about 70%) and in 4 out of 5 putative promoters (80%), a pentanucleotide A/TGTCA/T is present immediately downstream of the TATA box (Fig. 4). This pentanucleotide is present also in most of the predicted promoters of SIRV2 (Fig. 4). Conservation of such a motif has never been reported for promoters of the Sulfolobus hosts. However, we have detected it in two of seven mapped promoters of the virus SSV1 of Sulfolobus (18, 26). We infer that the GTC motif is a rudivirus-specific cis-regulatory element. This suggestion is strongly supported by the presence of the same trinucleotide in most of the putative promoters of yet another rudivirus, ARV1, isolated recently from Italian hot springs (M. Häring, G. Westergaard, R. A. Garrett, and D. Prangishvili, unpublished data). The GTC-containing unusual promoters can be recognized by specific virus- or host-encoded factors. The presence of the trinucleotide GTC in the promoter apparently does not regulate transcription temporally: transcription started simultaneously from GTC-containing promoters and promoters without it (Fig. 1 and 4). Also, no correlation was observed between the presence of GTC in the promoter and the abundance of transcripts.
Several strategies known from prokaryotic and eukaryal viruses could explain the early transcription of genomes of the rudiviruses SIRV1 and SIRV2 and the specific nature of most of the viral promoters: (i) the viruses inject RNA polymerase and/or a transcription regulatory protein at the time of infection; (ii) viral promoters are recognized by cellular factors and as such modulate gene expression; (iii) viral infection is a general transcription stimulus, e.g., up-regulating the activity of the cellular RNA polymerase; (iv) viral genes are transcribed, at least in part, by the basal transcription system of the host, although expression of a viral regulator stimulates preferential transcription of certain viral genes. Understanding the strategies used by the rudiviruses will contribute to our knowledge of mechanisms of transcription-level gene regulation in archaea and will shed light on the evolution and diversity of transcriptional control in general.
The work was supported by grants from the Deutsche Forschungsgemeinschaft (PR 663/1-1 and PR 663/1-2).
Supplemental material for this article may be found at http://jb.asm.org/. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»