
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
1 Genome Sciences Centre and 2 Cancer Control Research, British Columbia Cancer Research Centre; Departments of 3 Medical Oncology and 4 Pathology and Laboratory Medicine, British Columbia Cancer Agency; Departments of 5 Healthcare and Epidemiology, 6 Oncology, 7 Laboratory Medicine, and 8 Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
Requests for reprints: Angela R. Brooks-Wilson, Genome Sciences Centre, British Columbia Cancer Research Centre, 675 West 10th Avenue, Room 7-111, Vancouver, British Columbia, Canada V5Z 1L3. Phone: 604-675-8156; Fax: 604-675-8178. E-mail: abrooks-wilson{at}bcgsc.ca
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
NHL is divided into subtypes that have histologically and clinically diverse presentation, disease progression, and outcome. Most cases are sporadic. Many types of NHL tumors are known to have characteristic translocations that generally juxtapose proliferation-associated or apoptosis-inhibiting genes with immunoglobulin-related loci that are active in B or T cells. The tendency for NHL tumors to have specific translocations may imply an underlying defect in the cellular systems that protect against the occurrence of translocations. It has been suggested that defects in immunoglobulin switching may be involved in susceptibility to lymphoid cancers (6). Given that not all NHL tumor translocations involve immunoglobulin genes, however, genes more generally involved in protection against DNA double-stranded breaks (DSB) may also play a role in NHL susceptibility. Consistent with this idea, loss-of-function mutations in the ATM gene, a central player in the detection and response to DNA DSBs, cause the autosomal recessive syndrome ataxia telangiectasia, a feature of which is susceptibility to cancer, particularly lymphoma. Genetic variants in genes involved in DNA DSB repair are therefore candidates for susceptibility factors for NHL.
H2AFX, or H2AX, is a member of the histone H2A gene family and is fundamental to the detection of and response to DNA DSBs (7). H2AFX has a serine residue near its COOH terminus that is rapidly phosphorylated when cells are exposed to DNA damage (8). ATM initiates H2AFX phosphorylation in response to ionizing radiation (9, 10). Many factors associated with the DNA damage response (e.g., ATM, BRCA1, RAD51, and the MRE11/RAD50/NBS1 complex) are found to colocalize with phosphorylated H2AFX (
-H2AFX) at the sites of DSBs. Although
-H2AFX is not essential for DSB repair, it may modulate the process by reorganizing chromatin and preventing the separation of broken DNA ends, thus facilitating the concentration of repair factors and complexes at the lesion (7).
H2afx knockout mice are radiation sensitive, growth retarded, and immunodeficient and display chromosomal instability and repair defects (11). These mice have a high incidence of B-cell lymphomas that develop at an accelerated rate on a p53-deficient background (12, 13). Tumors derived from such animals show translocations. H2AFX is a genomic caretaker that requires the function of both alleles for optimal protection against tumorigenesis (12, 13). A dosage-sensitive gene, such as H2AFX, for which activity could vary due to genetic variants in the gene or its regulatory sequences, is an ideal candidate for a lymphoma susceptibility gene.
NHL, like most cancers, is a complex genetic disease and, as such, is suited to the application of genetic association studies. We have used 487 NHL cases and 531 controls from a population-based case-control study to test for association between genetic variants in the H2AFX gene with NHL and with individual subtypes of NHL. Genetic association tests were preceded by single nucleotide polymorphism (SNP) discovery resequencing of the H2AFX gene in a subset of NHL cases to validate and estimate the frequency of genetic variants in the gene.
| Materials and Methods |
|---|
|
|
|---|
|
Resequencing of the H2AFX Gene
Eight overlapping PCR primer pairs were designed to span the promoter and single exon of the H2AFX gene, including 1,000 bp upstream and 50 bp downstream of the transcribed region. Primer design was done using the H2AFX genomic sequence (accession number BC013416) and Primer3 (primer3_www.cgi v 0.2).9 Forward and reverse primers incorporated M13 forward or M13 reverse extensions, respectively, at their 5' ends. The sequences and annealing temperatures of all primers used in bidirectional SNP discovery sequencing are available in Supplementary Table S1. PCR sequencing and sequence analysis procedures were carried out as described previously (14). Haploview (15) was used to calculate and display inter-SNP linkage disequilibrium information derived from sequence data.
SNP Genotyping
Taqman allelic discrimination assays were designed using the Assays-by-Design service (Applied Biosystems). The sequences of primers and probes are shown in Supplementary Table S2. Assays were done in 384-well format with up to 9.6 ng of genomic DNA, 2.5 µL of 2x Taqman Universal PCR Master Mix, 0.125 µL of 40x Taqman probes/primers mix, and 2.375 µL H2O in a total reaction volume of 5 µL. Cycling conditions in the ABI PRISM 7900HT instrument were 95°C for 10 min followed by 40 cycles of 92°C for 15 s and 60°C for 1 min. Fluorescence data were collected post-PCR and analyzed using the SDS2.2 program (Applied Biosystems).
Statistical Analysis of Genotype and Haplotype Data
Statistical analysis of genotype data was done using standard methods for case-control studies (16). Tests for trend were done when five or more rare homozygous alleles were present. The primary analyses used logistic regression models to estimate the odds ratio (OR) for the development of NHL with 95% confidence intervals (95% CI) for each of the identified SNPs. ORs were adjusted for age (20-49, 50-59, 60-69, and 70+ years), sex, place of residence, and ethnicity (Caucasian, Asian, South Asian, and mixed/other/unknown). Tests for Hardy-Weinberg equilibrium (HWE) were also carried out. Statistical analysis of the haplotypes was done using The Haplo.stats Package (17), which uses an expectation-maximizationbased maximum likelihood approach to simultaneously do haplotype reconstruction by estimation of haplotype weights and estimate ORs and to account for haplotype uncertainty in the risk estimates. Case/control status, non-SNP variables, and diallelic SNP data are used to best estimate the haplotype weights. To evaluate the robustness of risk estimates, we computed the false discovery rate (18). The false discovery rate reflects the expected proportion of false-positive findings to the total number of significant findings and was computed using the P trends for each genotype (n = 3 comparisons) for P < 0.05.
Methylation Analysis
We assessed the methylation state of one H2AFX SNP in blood genomic DNA using a methylation-sensitive HpaII restriction assay. SNP2 is within a recognition site (CCGG) for HpaII. If the C residue of the CpG dinucleotide within the HpaII site is methylated, digestion cannot occur; if it is unmethylated, digestion can take place. HpaII digestion of genomic DNA followed by PCR using primers flanking SNP2 was used to test the methylation status of this HpaII site, which was the only one within the PCR amplicon. Production of PCR product indicates that the CpG site of SNP2 was methylated.
| Results |
|---|
|
|
|---|
Figure 1 illustrates the structure of H2AFX, a single-exon gene at 11q23.3, and summarizes its genetic variants and their frequencies. Seven SNPs were detected: three 5' to the transcribed region, three in the 3'-untranslated region, and one in the 3' flanking region. No insertions, deletions, or microsatellite sequences were noted in the 2,700 bp region sequenced. All but two of the seven SNPs had a minor allele frequency (MAF) >30%. SNP7 had a MAF of 6% and SNP3 was rare, with MAF 0.5% in our sequence data. The six most frequent SNPs were represented in dbSNP.10 Corresponding dbSNP accession numbers are shown.
|
0.67) encompasses SNP1, SNP2, SNP4, SNP5, and SNP6 but excludes SNP7. Linkage disequilibrium is highest between SNP1, SNP4, SNP5, and SNP6; linkage disequilibrium between SNP2 and these other four SNPs is lower (r2 = 0.67-0.75), perhaps indicating that SNP2 is a different age (likely younger) than the other four SNPs in this region of linkage disequilibrium. Additional genetic variants in H2AFX not seen in our resequenced samples are reported in dbSNP.12 Many of these were observed in individuals of African ancestry, however, an ethnic group that is very uncommon in our study population.
|
HWE was tested in controls only, using exact methods (22) in each major ethnic group in the study (Caucasians, Asians, and South Asians) separately, for each of the three SNPs genotyped. The ethnic distribution of study participants corresponds to that of the geographic areas in which sample collection took place (Vancouver and Victoria, British Columbia). SNP2, for which we saw the major positive results of this study, was in HWE in each of the three ethnic groups. SNP6 showed a barely significant deviation from HWE in Caucasians (P = 0.044), and SNP7 deviated from HWE in Asians (P = 0.025) and South Asians (P = 0.025). These deviations do not remain significant, however, when correction is made for multiple comparisons. The deviations from HWE of SNP7, which is rare, in the two smallest groups may not be surprising. Many of the Asians and South Asians who come to British Columbia are from different countries and regions. This recently immigrated group may not yet be in population equilibrium ("random mating"), so HWE may not be a valid expectation in this case, particularly for low-frequency SNPs.
Association Tests of H2AFX SNPs
Table 2
summarizes the results of association tests of SNP2, SNP6, and SNP7 with NHL. These multivariate analyses were adjusted for age, sex, ethnicity, and region of residence. The AA genotype of SNP2 is associated with protection from NHL, with an OR of 0.54 (95% CI, 0.37-0.79; P = 0.001), indicating that this genotype is associated with a near halving of risk for NHL. Heterozygosity at SNP2 gave an OR of 0.92 that was not statistically significant, although the trend was significant (P = 0.003). SNP6 and SNP7 did not show association with NHL.
|
These results show that the association of H2AFX SNP2 genotypes is subtype specific and is evident for FL and possibly MCL but not DLBC. The P values for trend remained significant even after adjustment for the false discovery rate for SNP2 with NHL, all B-cell NHL, all other NHL without FL, and DLBC, FL, and MCL.
T-cell types of NHL were not associated with H2AFX SNPs, either as a group (all T-cell NHL together) or as separate subtypes. Other NHL subtypes (marginal zone lymphoma/mucosa-associated lymphoid tissue, small lymphocytic lymphoma, lymphoplasmacytic lymphoma, miscellaneous B-cell lymphomas, mycosis fungoides, peripheral T-cell lymphoma, and miscellaneous T-cell lymphomas) were also tested for association with H2AFX SNPs (see Supplementary Table S3), but the borderline P values noted in two comparisons did not remain significant after adjustment for the false discovery rate.
We also tested the association of SNP2 with NHL separately in Caucasians, Asians, and South Asians and found the protective effect of homozygosity for the A allele to be statistically significant only in Caucasians (OR, 0.56; 95% CI, 0.37-0.85; P = 0.007; data not shown). This is not surprising as Caucasians account for 81% of our study population, and the smaller number of samples in the other ethnic groups significantly reduces the statistical power to detect association. The association of SNP2 with NHL is unlikely to be due to population stratification in this study because detailed ethnicity data (of all four grandparents of each study subject) were used to divide groups for separate analysis or for adjustment in combined analyses.
Haplotype Analysis of H2AFX
Additional methodologic controls in the form of 72 normal reference DNA samples (derived from members of five three-generation Centre d'Etude du Polymorphisme Humain reference families) were also genotyped. Inclusion of these samples provides a consistency check for probabilistic haplotype analysis in the case/control samples. Genotypes from the families were used to deduce the coinheritance of SNP markers and haplotypes. Haplotypes determined in the Centre d'Etude du Polymorphisme Humain families corresponded in every case to common haplotypes deduced by probabilistic methods.
Table 3 summarizes the results of association tests of haplotypes predicted using the Haplo.stats Package (17) of H2AFX SNP2, SNP6, and SNP7 with NHL. Additional analyses were done using the hapassoc and PHASE 2.0 programs,13 with comparable results. Three common haplotypes (GCA, ACA, and ATA) and four rare haplotypes were inferred. Rare haplotypes (with individual frequencies of <5%) were combined in the association test. A significant protective effect was observed for haplotypes ACA (OR, 0.51; 95% CI, 0.35-0.75; P = 0.001) and ATA (OR, 0.80; 95% CI, 0.66-0.97; P = 0.03) with NHL. All three common haplotypes (including the referent haplotype) contained the A allele of SNP7, making SNP7 unlikely to contribute to the protective effects of these haplotypes. We therefore also tested the association of the haplotypes formed by SNP2 and SNP6 only with NHL (Table 3). The haplotypes AC and AT were both significantly protective against NHL, a result that may be driven by the presence of the SNP2 A allele in all haplotypes inversely associated with NHL. The haplotype-based association test results in different NHL subtypes and groups paralleled those seen with SNP2, with positive associations seen in NHL, all B-cell NHL, all other B-cell NHL without FL and DLBC, FL, and MCL, but not DLBC (Table 3).
|
Evolutionary Conservation and the Origin of H2AFX SNP2
The SNP2 G/A variant corresponds to the G residue of a CpG dinucleotide. The C residue of a CpG dinucleotide can be methylated through epigenetic regulatory system that affects gene expression in mammals. SNP2 is near the 5' end of an extensive CpG island detected using CpG Island Searcher14 (23) that spans the 5'-untranslated region, promoter, and entire transcribed region of H2AFX (data not shown). Methylation of cytosine creates 5-methylcytosine (24, 25), which can spontaneously deaminate to thymine. Consequently, C to T transitions accumulate over evolutionary time scales (26, 27). If a methylcytosine residue deaminates to thymine, it will be paired with adenine on the opposite strand and the original CpG dinucleotide will become CpA. This process can create a G/A SNP immediately following a C nucleotide.
To gain insight into the origin of SNP2, we did a multispecies alignment (Fig. 3 ) of the sequence surrounding this polymorphic site. Sequences from one gorilla, one orangutan, one baboon, one gibbon, and 20 chimpanzees were generated using primate DNA samples using the same primers and reaction conditions for human H2AFX amplicon 3. Although not transcribed, this region is highly conserved. All primate sequences have an A at the position of human SNP2; dog shows a C and mouse and rat both have a G nucleotide. All 20 chimp sequences were identical, so it is unlikely that this position corresponds to a common polymorphism in the chimpanzee. Given that only one DNA sample was compared for each of the other species, we cannot assume that this site is invariant in these species. In no genome, except human, however, was this position annotated as being polymorphic.
|
Methylation Analysis of SNP2 in Blood DNA of Cases and Controls
If the C residue adjacent to SNP2 were highly methylated, this could lead to relative transcriptional silencing of H2AFX and compromise of DNA DSB repair. We assessed the methylation state of this residue in blood genomic DNA using a methylation-sensitive HpaII restriction assay. AA homozygotes were excluded because the A allele is not a HpaII restriction site. AG heterozygotes were excluded for simplicity of interpretation of the assay. The methylation status of all GG homozygotes in the case/control group was assayed using blood genomic DNA. Only 5 (4 cases and 1 control) of 247 samples tested gave a PCR product after digestion (data not shown), indicating that this site seems unmethylated in blood genomic DNA in a large proportion of both case and control samples. Given that blood genomic DNA is derived from a mix of many different cell types, this experiment likely indicates that the H2AFX SNP2 CpG site is unmethylated in the most abundant DNA-containing cell types in blood. However, this result may not reflect the methylation status of this site in the cell types of origin of NHL, FL, or MCL.
| Discussion |
|---|
|
|
|---|
We have shown that a G/A SNP 417 bp upstream of the start codon of H2AFX is associated with NHL. The AA genotype is associated with protection from lymphoma; conversely, the GG genotype can be considered to increase risk. The association is statistically significant in the overall NHL case/control comparison as well as in Caucasian cases and controls. Additional studies in other populations will be required to determine if this variant site has an effect in other ethnicities. When the most common two types of NHL are assessed separately, the association is present in FL but absent in DLBC. An additional significant association with MCL is based on a small numbers of cases. The numbers of cases with other NHL types are too small for this study to provide a definitive assessment of them. Analysis of less common NHL types will likely require the pooling of data from multiple lymphoma population studies. Larger studies will also be required to determine if SNP2 heterozygotes have intermediate level of protection. The power of data pooling in association studies was exemplified recently for the TNF and IL-10 genes (29).
HapMap (30) data show that linkage disequilibrium is very high for at least 40 kb upstream of H2AFX, so, alternatively, the SNP2 association could be due to a gene in linkage disequilibrium with H2AFX. Genotyping of additional SNPs in this 40 kb region will help to clarify the SNP2 association.
H2AFX is critical for protection against DNA DSBs that can lead to lymphoma. Our observation that a DNA variant in the human H2AFX gene is associated with protection/risk of lymphoma makes sense in light of the observation that haploinsufficiency for this gene leads to B-cell lymphomas in mice (12, 13). Genes that show haploinsufficiency (dosage dependence) are particularly intriguing as candidates for human disease variants; genetic variants causing slight variation in the activity or expression level of such genes could contribute to disease susceptibility in human populations.
We speculate that the AA genotype of SNP2 is associated with greater transcription of this DNA repair gene in a cell type or developmental compartment relevant for lymphomagenesis and that the protective "A" allele is less easily silenced by normal or aberrant methylation. Determination of whether SNP2 genotype correlates with H2AFX expression level or function will be important for understanding the effect of this variant. Given that the SNP2 association is observed in FL but not in DLBC, the relevant cell type may be follicle center B cells that are undergoing immunoglobulin class switching or somatic Ig mutation. The observation that the B-cell lymphomas of H2afx-deficient mice have predominantly IgH/c-myc translocations is consistent with such a hypothesis (12, 13). If the A allele were advantageous for DNA repair, however, why would the G allele be more common? The G allele, which may allow methylation of the adjacent C residue, could offer a different advantage by allowing more finely tuned control of the times and precise compartment of expression of H2AFX. The immune system, which needs to be exquisitely responsive at certain times (to maximize the immunoglobulin diversity necessary to resist infection) but turned off under appropriate circumstances (to avoid autoimmunity), could benefit from such fine control. Better DSB repair, however, may not necessarily be optimal when applied to the immune system. It is possible that down-regulation of H2AFX (possibly by methylation) during immunoglobulin switching helps enhance the diversity of the antibody repertoire and that this balances the potential negative consequences of "accidentally" down-regulating this gene, by methylation, in other tissues. Assessment of the overall methylation status of H2AFX with different SNP2 genotypes in normal and abnormal germinal center B cells would help determine if this is the case.
Other studies have recently found association between nonhomologous end-joining genes, which are important in protection against DSBs, in both NHL (31) and multiple myeloma (32), supporting a role for DSB repair genes in NHL risk and protection.
The AA genotype of H2AFX SNP2, or G(417)A, is associated with decreased risk of NHL (OR, 0.54), FL (OR, 0.4), and possibly MCL relative to the GG genotype. The risk for the GG genotype relative to the AA genotype can be expressed by the reciprocal ORs for NHL (OR, 1.85) and FL (OR, 2.5). The attributable risk of NHL for individuals who have the GG genotype is
46%. The GG genotype of this SNP is very common and is seen in 33% of NHL cases and in 35% of FL cases in this study. Thus, the population attributable risk for this SNP is
15% for NHL and 21% for FL. Although this SNP has a modest effect in terms of approximately doubling risk, its high frequency means that it is potentially a very important susceptibility factor for NHL, FL, and potentially MCL. It will be important to test this association in other populations not only to try to replicate this finding but also to verify whether the GG genotype is common in other populations.
| Acknowledgments |
|---|
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: Supplementary data for this article are available at Cancer Epidemiology, Biomarkers & Prevention Online (http://cebp.aacrjournals.org/).
9 http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi ![]()
10 http://www.ncbi.nlm.nih.gov/SNP ![]()
11 http://pga.gs.washington.edu/VG2.html ![]()
12 http://www.ncbi.nlm.nih.gov/projects/SNP/ ![]()
13 Shumansky and Spinelli, in preparation. ![]()
Received 7/31/06; revised 12/14/06; accepted 3/29/07.
| References |
|---|
|
|
|---|
-globin locus: implications for evolution of the
-globin pseudogene. Embo J 1987;6:9991004.[Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |