Previous Article | Next Article ![]()
Journal of Bacteriology, December 2007, p. 8914-8921, Vol. 189, No. 24
0021-9193/07/$08.00+0 doi:10.1128/JB.00108-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Anders F. Andersson,2,
Cecilia Jernberg,2
Katja Schirwitz,3
Cristofer Enroth,3,
Margareta Krabbe,4 and
Lars Engstrand1,2*
Department of Microbiology, Cell and Tumor Biology, Karolinska Institutet, 171 77 Stockholm, Sweden,1 Swedish Institute for Infectious Disease Control, 171 82 Solna, Sweden,2 EMBL Hamburg Outstation, Notkestrasse 85, 22303 Hamburg, Germany,3 Biology Education Center, Uppsala University, Box 592, 751 24 Uppsala, Sweden4
Received 19 January 2007/ Accepted 4 September 2007
|
|
|---|
|
|
|---|
![]() View larger version (15K): [in a new window] |
FIG. 1. Schematic representation of the genetic organization of the R.HpyAIV-M.HpyAIV genes in strains 26695, J99, and HPAG1. The M.HpyAIV genes are closely homologous in the three strains. The R.HpyAIV gene in HPAG1, HPAG_1299, is truncated, and the adjacent gene, HPAG_1298, is homologous to the 3' end of the genes in strains 26695 and J99. The lined area shows the nucleotide position where the adenine repeat was found to be variable in active and inactive strains (start at 601 bp).
|
|
|
|---|
DNA and RNA techniques. DNA was prepared using a DNeasy kit (Qiagen, Hilden, Germany) according to the manufacturer's protocol. Primers HP1352F, HP1352R, 1351RTF, and 1351RTR (Table 1) were used to detect the MTase and REase genes. PCR was performed using DyNAzyme Taq polymerase and the corresponding buffers (Finnzymes, Espoo, Finland) and was carried out in 30-µl reaction mixtures using the following cycling conditions: 94°C for 5 min followed by 25 cycles at 94°C for 1 min, 55°C for 1 min, and 72°C for 1 min, with a final extension period of 5 min at 72°C. To determine the GANTC modification status of the different H. pylori strains, digestion by HinfI, an R.HpyAIV isoenzyme (10 U; New England Biolabs, Beverly, MA), was performed overnight using 1 µg of isolated genomic DNA.
|
View this table: [in a new window] |
TABLE 1. Primers used in the present study
|
The qPCR reactions were performed with an ABI PRISM 7500 sequence detection system (ABI) using a SYBR green kit (Applied Biosystems). The primers for the reactions are listed in Table 1. Seven genes fulfilling the criteria of containing GANTC sites upstream of the ORFs and having GANTC sites present in both strain 26695 and strain J99 were chosen. The following genes were analyzed: cag13 (HP0534), cag16 (HP0537), cag21 (HP0542), katA, HP0835, HP0922, and HP1564. As an endogenous control, the 16S rRNA gene was used after verification that the expression of this gene was the same in the wild type as in the mutant strain. The amplification efficiencies for the primers were calculated from a standard curve using a 10-fold dilution series of cDNA. The qPCR reactions were performed according to standard protocols. Calculation of the relative gene expression was done using the Q-gene software (24) (Biotechniques software library, http://www.biotechniques.com).
Sequencing. Primers used for sequencing are shown in Table 1. PCR products amplified with primers targeting the flanking sequences of the M.HpyAIV gene and purified using GFX DNA and a gel band purification kit (Amersham, Buckinghamshire, United Kingdom) were used as templates in the sequencing reaction. Cycle sequencing was performed using BigDye Terminator v3.1 (Applied Biosystems, Foster City, CA), and the products were separated on an ABI Prism 3100 genetic analyzer. All sequences were analyzed and aligned using Vector NTI, suite 9.0.0 (Invitrogen).
Construction of M.HpyAIV gene insertion mutants.
PCR products of the M.HpyAIV gene were amplified from 26695 DNA by use of primers HP1352F and HP1352R. The PCR amplicons were purified using GFX DNA and a gel band purification kit (Amersham) and cloned into pGEM-T Easy vector (Promega). The M.HpyAIV gene insert was then cleaved out of the pGEM-T Easy vector with NotI and cloned into a new vector, pGEM-5zf(+), which was digested with NotI to generate one insertion site. To avoid self-ligation, the NotI-linearized pGEM-5zf(+) vector (Promega) was treated with alkaline phosphatase from calf intestinal mucosa (Promega). The new vector construction was transformed into competent Escherichia coli cells (DH5
) by heat shock, and the insertion of the correct fragment was confirmed by PCR. HindIII (New England Biolabs) was used to generate one restriction site in the M.HpyAIV gene situated in pGEM-5zf(+) and to cleave out the kanamycin cassette gene from pJMK30 (13). The marker gene was purified, cloned into the linearized vector, and transformed to competent E. coli cells (DH5
) by heat shock, and the transformants were selected on LB agar plates containing 30 µg/ml kanamycin. Plasmids were isolated with a Qiagen plasmid mini or Midi kit according to manufacturer's protocols. To obtain H. pylori insertion mutants, plasmids were electroporated to confluently grown bacteria by use of standard methods and selected on kanamycin-containing GC agar plates (30 µg/ml). PCR and sequencing were used to confirm that the M.HpyAIV::Km construct was inserted into the correct location.
Measurement of interleukin-8 induction in AGS cells. AGS cells, from a gastric cancer cell line, were infected with either H. pylori 26695 wild-type or M.HpyAIV gene mutant strains. Measurement of interleukin-8 secretion in the supernatant of H. pylori-infected AGS cells was performed as described earlier (25).
DNA-binding assay. The M.HpyAIV gene (HP1352) from strain 26695 was expressed and purified in E. coli as previously described (36). P32-labeled PCR fragments (0.32 pmol) containing one single GANTC site (109 bp, amplified from strain 26695 by use of primers EMSF and EMSR) (Table 1) were mixed with 1 µl NEB2 buffer (50 mM NaCl, 10 mM Tris-HCl, 10 mM MgCl2, 1 mM dithiothreitol, pH 7.9 [New England Biolabs], and 0, 10, 30, 80, 150, 200, or 500 nM purified M.HpyAIV protein). The reaction mixtures were incubated at room temperature for 30 min. The samples were analyzed using polyacrylamide gel electrophoresis (4% native polyacrylamide gel for 5 h, 200 V). The results were evaluated using a PhosphorImager (Molecular Dynamics Inc.).
Protection from restriction digestion by methylation. To investigate methylation and protection by the recombinant protein, we incubated 1 µg of a PCR fragment containing one GANTC site (778 bp, amplified with 1351GANTCF and 1351GANTCR) (Table 1) with NEB2 buffer, S-adenosylmethionine (New England BioLabs), and different M.HpyAIV concentrations (0, 200, 400, 800, and 1,200 nM) and performed incubation for 1 h at room temperature followed by protein inactivation at 95°C for 10 min. Samples were digested with HinfI overnight at 37°C. The quantities of digested and undigested PCR products were determined on a Bioanalyzer (Agilent, Palo Alto, CA) in duplicate.
Heat shock stress response. Bacteria were grown in liquid Brucella broth to an optical density at 600 nm of approximately 0.2. Aliquots of mutant and wild-type strains were transferred from cultures at 37 to 42°C. Viable counts were performed at different time points between 0 and 180 min after exposure. Samples from mutant and wild-type strains were analyzed in duplicate.
In silico genomic analysis. Genome sequences and gene annotations of strains 26695 and J99 were downloaded from NCBI. In-house-developed PERL scripts were used to search the intragenic, including start and stop codons, and intergenic regions for GANTC occurrences. For the construction of randomized 26695 genomes, the nucleotide order within intergenic regions was shuffled by performing 10,000 random pair-wise exchanges of nucleotides within the same intergenic region, whereas intragenic regions were randomized by performing 10,000 random pair-wise exchanges of codons encoding the same amino acid within the same gene, excluding start and stop codons.
The intergenic distribution of GANTC sites was analyzed by dividing each intergenic region containing (exactly) one GANTC site into 10-nucleotide (nt) nonoverlapping windows {w1, w2, ..., wn}, starting immediately upstream of the start codon. When a partial window remained, this was excluded, and if the GANTC was located within this, the intergenic region was excluded from the analysis. GANTC occurrences were then summarized for all windows {w1, w2, ..., wn}, over all intergenic regions, where n is the number of windows in the longest intergenic region. This distribution was then compared to those obtained after randomly positioning the GANTC within the same intergenic regions. Ten thousand such randomized distributions were created, and for each window a P value of GANTC enrichment was calculated as the proportion of the randomized distributions containing at least as many GANTC sites as the original distribution in that window. Functional classifications of different genes were obtained from the Pasteur Institute (http://genolist.pasteur.fr/PyloriGene/index.html).
|
|
|---|
To examine the methylation activity of M.HpyAIV, genomic DNA from the same strains mentioned above was digested with HinfI. Resistance or susceptibility to HinfI digestion indicates the presence or absence of an active MTase, respectively. The results showed that the M.HpyAIV gene was active in 42% (25/60) of the strains, while no activity was demonstrated for the remaining 58% (35/60). None of the strains lacking the M.HpyAIV gene as judged by the PCR test could be digested with HinfI, indicating a good correlation between the PCR test and the HinfI digestion test. The presence of the gene, however, did not directly correlate to activity, since 31% (11/35) of the strains had no M.HpyAIV activity, even though the M.HpyAIV gene was present. To identify possible genetic alterations that would explain the lack of methylation activity although the gene was present according to the PCR result, we sequenced the M.HpyAIV gene in 10 strains with inactive and 6 strains with active M.HpyAIV. In six of the inactive strains, one adenine residue was absent in a homopolymeric tract of adenine residues. In one of the inactive strains, an additional cytosine was present in a repeated poly(C) tract, as compared to the sequence of the gene in the strains with an active MTase. Both these genetic changes are predicted to result in the generation of premature translational stops, which in turn may result in truncated proteins. No genetic alterations which could explain the inactivity were found for 3 of the 10 analyzed strains.
The H. pylori strains used for PCR and activity tests originated from patients diagnosed with a range of gastroduodenal diseases (Table 2) . The presence of the active MTase could not with statistical significance be linked to a more severe disease development (Fisher's exact test; data not shown).
|
View this table: [in a new window] |
TABLE 2. Distribution of R.HpyAIV-M.HpyAIV genesa
|
Characterization of the M.HpyAIV gene knockout mutant. To investigate the function of M.HpyAIV, a knockout mutant was created from strain 26695. Isolated mRNA from wild-type and mutant bacteria grown to exponential phase was analyzed by RT-PCR. We were able to amplify M.HpyAIV and R.HpyAIV gene transcripts in the wild type but not in the M.HpyAIV gene mutant strain, which indicated a lack of transcription of both MTase and REase genes in the latter (data not shown). To determine the ability of M.HpyIVM to modify GANTC sites, HinfI was used for the digestion of genomic DNA isolated from the wild type and the M.HpyAIV gene mutant. As expected, HinfI was not able to cleave the genomic DNA of the wild type, but the M.HpyAIV gene mutant DNA was digested (data not shown). This showed that M.HpyAIV is active in H. pylori strain 26695 and is necessary for this site-specific methylation.
In vitro analysis of recombinant M.HpyAIV protein. To investigate the enzymatic activity of M.HpyAIV, the protein was purified from 26695 and analyzed in vitro. First, the DNA-binding capacity of the purified M.HpyAIV protein was studied in a gel shift assay (data not shown). At protein concentrations of 30 nM and above, gel retardation was obtained, indicating that the purified enzyme was bound to the DNA sample. To further assess the capability of M.HpyAIV to methylate GANTC sites, we analyzed the ability of the purified protein to protect DNA from digestion by HinfI in vitro (Fig. 2). Various concentrations of M.HpyAIV protein were incubated with a PCR fragment (778 bp) containing one GANTC site for 1 h followed by inactivation of the MTase and digestion with HinfI. Capillary electrophoresis using a Bioanalyzer (Agilent, Palo Alto, CA) was used to analyze the results. The increased M.HpyAIV concentration resulted in increased amounts of undigested PCR products, illustrating the capability of M.HpyAIV to protect GANTC sites from being digested in a concentration-dependent manner.
![]() View larger version (98K): [in a new window] |
FIG. 2. Purified M.HpyAIV protects a GANTC-containing DNA fragment from HinfI digestion. Increasing concentrations of M.HpyAIV protein incubated with a 778-bp PCR fragment containing one GANTC site and S-adenosylmethionine. HinfI digestion of the GANTC-containing DNA fragment resulted in two fragments of 540 bp and 238 bp. The increased amount of undigested PCR products as a consequence of an increased M.HpyAIV concentration illustrates the in vitro capability of M.HpyAIV to protect GANTC sites from digestion in a concentration-dependent manner. L, ladder (samples in duplicate with increasing amounts of M.HpyAIV added [0, 200, 400, 800, and 1,200 nM]); UC, uncut control.
|
We searched the distribution of GANTC sites, i.e., the potential regulatory sites in the two fully sequenced H. pylori genomes. In strain J99, 230 and 2,522 GANTC sites were located in inter- and intragenic regions, respectively. In strain 26995, the corresponding numbers were 231 and 2,508. This was compared with the numbers obtained in randomized 26695 genomes constructed by shuffling the nucleotide order within intergenic regions (thus preserving the nucleotide frequencies) and by performing pair-wise codon exchanges for randomly chosen codon pairs encoding the same amino acid within ORFs (thus preserving the protein sequences and codon frequencies). On average, 500 intergenic GANTC sites were found in each of the 100 randomized genomes (range, 446 to 558), and all of the genomes contained more sites than in the original 26695 strain (231 sites). The difference was less pronounced within genes, which showed 3,528 sites on average (range, 3,432 to 3,636), but also in this case, all genomes displayed more sites than the original (2,508).
We then investigated how GANTC sites were distributed within intergenic regions. For GANTC-containing intergenic regions upstream of a single gene (located between adjacent genes on the same strand), we found that the region proximal to the start codon was significantly enriched in GANTC (P < 0.005) (Fig. 3 and 4). Thus, GANTC sites seem to be avoided in intergenic regions, but when they do occur, they are situated close to the translational start codon more often than expected by chance. We also examined how many of the intergenic GANTC sites located upstream of ORFs were common for both J99 and 26695 sequenced genomes. Sixty of these genes were common for both strains, distributed between different functional groups (Table 3). The functional group with the highest frequency of GANTC sites in potential promoter regions (6% out of the total number of genes) was the group containing genes involved in cellular processes. Interestingly, in this group, 9 of 13 genes belonged to the cag pathogenicity island.
![]() View larger version (30K): [in a new window] |
FIG. 3. Intergenic regions containing one or more GANTC sites in H. pylori strain 26695. Gray bars indicate region lengths. Black bars indicate that the J99 ortholog has upstream GANTC sites, while dark gray bars indicate the absence of upstream GANTC sites. White bars indicate that an ortholog in J99 is missing. Arrows indicate the direction of ORFs.
|
![]() View larger version (13K): [in a new window] |
FIG. 4. GANTC site distribution along intergenic regions of strain 26695 for true and randomized data. Intergenic regions containing one GANTC site were divided into 10-nt nonoverlapping windows, and the occurrences of GANTC sites (start positions) in each window were summarized over all intergenic regions (dark gray bars). Randomized data were obtained by randomly positioning the GANTC sites within each intergenic region and counting occurrences of GANTCs as before. The procedure was repeated 10,000 times, and average counts were calculated (light gray bars). *, P < 0.005; 38 of 10,000 randomizations displayed at least as many occurrences as observed. Intergenic regions extending 260 nt are not shown.
|
|
View this table: [in a new window] |
TABLE 3. ORFs that contain GANTC sites upstream of translational start codons in strains 26695 and J99 and those common for both strainsa
|
|
|
|---|
Phase variation is described as a mechanism where simple repetitive DNA motifs lose or gain repeats during replication, which in turn may lead to premature translational stop codons and truncated proteins, and is proposed to help the bacteria to adapt to changes in environmental conditions and immune evasion (15, 34). In H. pylori, three major groups of genes with short sequence repeats that may serve as targets for phase variations have been identified: cell surface-associated genes, R-M systems, and genes involved in lipopolysaccharide biosynthesis (3, 9, 26, 35). We observed variability in intergenic homopolymeric tracts in strains that possessed the M.HpyAIV gene. A nucleotide insertion or deletion in these homopolymeric tracts results in an inactive M.HpyAIV, suggesting that this gene may phase vary by slipped-strand DNA mispairing.
From an in vivo-propagated population of H. pylori single-colony isolates, we found no evidence for frequent MTase inactivation due to the slipped-strand mispairing mechanism. It is likely that the number of repeated nucleotides present in a phase-variable region influences the rate of slipped-strand mispairing. In the investigated material, seven adenine residues were present in active strains, a rather short repetitive sequence. The region may therefore be too short for high-frequency slipped-strand mispairing. In single-cell colonies obtained from the same individual, we found that both the MTase and REase genes were either present or absent in these colonies, which may indicate reacquisition/deletion of the complete R-M system, as described by Takata et al. (41). In their study, 113-bp repeats flanking the R.HpyAIV-M.HpyAIV gene system upstream and downstream in strains 26695 and J99 are described. They showed that in related strains lacking the entire R-M system, a deletion PCR product using primers targeting flanking genes contained only one copy of the 113-bp repeat, which indicates that the R-M system is a mobile genetic element. For a type III R-M system, there is evidence of the coordinated transcription of the res and mod genes (restriction and modification subunits in type III R-M systems) (9). Coregulated transcription is probably common in R-M systems, since the lack of an active MTase in combination with a corresponding active REase could digest the self-DNA, which would be lethal for the strain. Usually, nonmethylated plasmid vectors are difficult to transform to H. pylori strains, due to the various numbers of R-M systems. It would be interesting in future studies to investigate if modifications of R-M systems increase the possibility of circumventing these problems in experimental strains.
According to RT-PCR results, both the MTase and the REase were transcribed in the wild type, but neither transcript was detected in the mutant strain. In the mutant strain, the lack of a transcript of the REase gene, situated downstream of the MTase, may indicate that a coregulatory mechanism inactivates the transcription of this gene, since R.HpyAIV in strain 26695 has been described as active elsewhere (19).
Although both R.HpyAIV-M.HpyAIV genes were present in most of our strains, the activity of the REase was not investigated, and other studies have shown that it is not unusual that strains have active MTases but inactive REases (18, 19, 43); this suggests that it might be beneficial for the bacteria to preserve specific MTase function or that there is no selective advantage to preserve the REase.
Previous studies have shown that R-M systems are associated with the outcome of H. pylori infection. Expression of the type II REase iceA1 was up-regulated upon contact with epithelial cells. Two iceA alleles have been found (iceA1 and iceA2); iceA1 strains are associated with peptic ulcer disease (30). The inactivation of the cognate MTase M.HpyI results in the alteration of dnaK operon transcript levels in stationary-phase cultures and following host cell contact in vitro (10). In another study, mutants with decreased adherence and elongated cell structure were identified. The change in phenotype was due to a knockout of a type II REase gene, the R.HpyC1I gene (20). The M.HpyAIV gene and other genes belonging to R-M systems (type I hsdS genes) were associated with a high host response in transgenic mouse models (7). These different studies indicate that some H. pylori R-M systems may be important when the gastric environment changes and in response to the host.
An MTase with the same GANTC specificity as M.HpyAIV, CcrM, has a regulatory role in C. crescentus, Brucella abortus, Rhizobium meliloti, and Agrobacterium tumefaciens. In these alphaproteobacteria, CcrM is essential for viability (17, 32, 37, 47). However, in the case of H. pylori, we showed that M.HpyAIV is not essential for viability and that the isogenic mutant does not have reduced growth compared to the wild-type strain in liquid culture; moreover, no difference in heat shock survival was observed.
The in silico analysis showed that there are fewer GANTC sites present in the sequenced genomes, especially in intergenic regions, than would be expected by chance. This might imply that the presence of GANTC sites is avoided and might affect gene expression if present at a higher frequency, possibly by the methylation of these sites. Sixty GANTC sites upstream of ORFs were conserved between strains 26695 and J99. These may represent candidates for genes that are regulated by GANTC methylation. Interestingly, the largest group of genes found consisted of genes that belonged to the cag pathogenicity island. However, in our qPCR assay, none of the cag genes tested were affected in the mutant strain. An explanation may be that the RNA in our assay was isolated in exponential phase, and other settings may have had different results. Nevertheless, katA transcription levels were found to be down-regulated in the mutant strain. Since M.HpyAIV is not present in all H. pylori strains, the effect of transcriptional regulation is not essential but may give rise to strain-specific alterations which could cause the strains to become more or less virulent.
Thanks to Lena Eriksson for help with RNA isolation and to Annelie Lundin for help with the heat shock assay.
Published ahead of print on 5 October 2007. ![]()
Present address: Genome Institute of Singapore, Singapore 138672, Republic of Singapore. ![]()
Present address: Department of Earth and Planetary Science, University of California, Berkeley, CA 94720-4767. ![]()
Present address: Department of Science, Örebro University, 701 82 Örebro, Sweden. ![]()
|
|
|---|
3-fucosyltransferase genes. Infect. Immun. 67:5361-5366.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»