
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Departments of 1 Epidemiology and 2 Urology, The University of Texas M. D. Anderson Cancer Center, Houston, Texas
Requests for reprints: Xifeng Wu, Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Unit 1340, 1155 Pressler Boulevard, Houston, TX 77030. Phone: 713-745-2485; Fax: 713-792-4657. E-mail: xwu{at}mdanderson.org
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
It was estimated that in 2006, bladder cancer would be the fourth most frequently diagnosed cancer in men and the ninth in women in the United States (8). Cigarette smoking is an established risk factor for bladder cancer (9). Occupational exposures to 4-aminobiphenyl, 2-naphthylamine, benzidine (10), and aromatic amines, such as o-toluidine (11), also play an important role in the initiation of bladder cancer. These exposures lead to DNA damage that, if remained damaged, may result in unregulated cell growth and even cancer. DNA damage repair and cell cycle checkpoints facilitate cellular responses to DNA damage from endogenous and exogenous mutagenic exposures to maintain genomic integrity. The base excision repair (BER) pathway is one of the four major DNA repair pathways in human cells. The proteins in the BER pathway mainly work on damaged DNA bases arising from endogenous oxidative and hydrolytic decay of DNA. Base damage and DNA single-strand breaks are mainly repaired through the BER pathway (12).
This pathway is a multistep process that requires the activity of several proteins (12, 13). Cigarette smoke is a rich source of reactive oxygen species that can induce a variety of DNA damages, some of which are repaired by the BER pathway.
In this study, we estimated the frequency of eight SNPs from seven BER pathway genes, including MBD4 Glu346Lys, MUTYH Gln335His, OGG1 Ser326Cys, APEX1 Glu148Asp, XRCC1 Arg194Trp, XRCC1 Arg399Gln, ADP-ribosyltransferase (ADPRT) Val762Ala, and POLD1 Arg119His in bladder cancer cases and controls. We applied several statistical approaches to evaluate BER pathway gene-gene and gene-environment interactions in bladder cancer susceptibility.
| Materials and Methods |
|---|
|
|
|---|
Epidemiologic Data
After informed consent was obtained, all study participants completed a 90-min in-person interview that was given by M. D. Anderson Cancer Center staff interviewers. The interview elicited information on demographics and smoking history. The questionnaire consisted of a fixed script and included introductory and transitional statements. All interviewers were trained for the use of probes. At the conclusion of the interview, a 40-mL blood sample was drawn into coded heparinized tubes. Human subject approval was obtained from the M. D. Anderson Cancer Center, Baylor College of Medicine, and the Kelsey-Seybold institutional review boards. An individual who had smoked at least 100 cigarettes in his or her lifetime was defined as an ever smoker. Ever smokers include former smokers, current smokers, and recent quitters (those who had quit within the previous year).
DNA Isolation
Genomic DNA was isolated from peripheral blood using QIAamp DNA blood maxi kit (Qiagen, Valencia, CA) according to the manufacturer's protocol. The working aliquots of the genomic DNA were stored at 20°C until use.
Genotype Assays
Each single nucleotide polymorphism (SNP) genotyping was done using the Taqman method with a 7900 HT sequence detector system (Applied Biosynthesis, Foster City, CA). The primer and probe sequences for each SNP are available on request. Typical amplification mixes (5 µL) contained sample DNA (5 ng), 1x Taqman buffer A, 200 µmol/L deoxynucleotide triphosphates, 5 mmol/L MgCl2, 0.65 units of AmpliTaq Gold, 900 nm of each primer, and 200 nmol/L of each probe. The reactions were carried out in the Dual 384-Well GeneAmp PCR System 9700. The thermal conditions were 95°C for 10 min followed by 50 cycles of 92°C for 30 sec and 60°C for 1 min. Following the amplification reaction, the reacted plates were read using the ABI Prism 7900HT Sequence Detection System. The analyzed fluorescence results were then automatically called into genotypes using the built-in software of the system. Water control, amplification internal controls, and previously genotyped samples were included in each plate to ensure accuracy of the genotyping, and 5% of the samples were randomly selected and run in duplicates with 100% concordance.
Statistical Analysis
Using the Intercooled Stata 8.0 statistical software package (Stata Co., College Station, TX), The Pearson
2 test was used to test for differences between the cases and the control subjects for the categorical variables of gender, smoking status, and each SNP genotype. The Student's t test was used to test for differences between the case and control subjects for the continuous variables of age and pack-year. Hardy-Weinberg equilibrium for the genotypes was tested by a goodness-of-fit
2 test. Odds ratios (OR) and 95% confidence intervals (95% CI) were calculated as an estimate of relative risk. Unconditional multivariate logistic regression was used to control for possible confounding by age, gender, and smoking status, when appropriate as well as when examining interactions between SNPs and smoking. Interaction was tested using a multiplicative interaction term included in the multivariate model. Joint effects were analyzed using never smokers with the wild-type (WT) genotype as the reference group. Statistical significances of the interactions were assessed using likelihood ratio tests comparing the models with and without interaction terms.
Classification and Regression Tree Approach
For higher-order gene-gene interactions, classification and regression tree (CART) analysis was done using the HelixTree Genetics Analysis software (version 4.1.0; Golden Helix, Bozeman, MT). CART is a binary recursive partitioning method that produces a decision tree to identify subgroups of subjects at higher risk (14). Specifically, the recursive partitioning algorithm in HelixTree starts at the first node (with the entire data set) and uses a statistical hypothesis testing method, formal inference-based recursive modeling, to determine the first locally optimal split and each subsequent split of the data set, with multiplicity-adjusted P values to control tree growth (P < 0.05). This process continues until the terminal nodes have no subsequent statistically significant splits or the terminal nodes reach a prespecified minimum size (at least 10 subjects for each terminal node in our analysis).
Multifactor Dimensionality Reduction Approach
The nonparametric multifactor dimensionality reduction (MDR) approach was selected to complement logistic regression for the analysis of gene-gene and gene-environment interactions. The MDR method was first described by Moore et al. (15-18). Here, we briefly describe MDR method. MDR is a nonparametric and genetic modelfree alternative to logistic regression for detecting and characterizing nonlinear interactions among discrete genetic and environmental attributes. The MDR method combines attribute selection, attribute construction, and classification with cross-validation and permutation testing to provide a comprehensive and powerful data mining approach to detecting nonlinear interactions. The method involved several steps. In step one, the data were divided into a training set (9 of 10 of the data) and an independent testing set (the remaining 1 of 10 of the data) as part of cross-validation. In step two, a set of n factors (in this case, factors) were selected, where n = 1 to 5. In steps 3 and 4, the n factors and their possible multifactor classes were represented in n dimensional space. The ratio for the number of cases to the number of controls was calculated within each multifactor class. Each multifactor class in n dimensional space was then labeled as "high risk" if the case to control ratio met or exceeded a threshold (for example, 1.1065) or as "low risk" if that threshold was not exceeded, thus reducing the n dimensional space to one dimension with two levels (low risk and high risk). In step five, the model that gave the lowest misclassification error was selected for each set of n factors. In step six, a prediction error was estimated for each model selected in step five, as a cross-validation procedure. Steps one to six were repeated 10 times using a random seed number. We did this entire 100-fold cross-validation procedure 10 times, using different random seed numbers, to reduce the chance of observing spurious results due to chance divisions of the data. In addition to prediction error, we also estimated a cross-validation consistency, defined as a percentage of the same combination of factors selected as the best model among different cross-validation data sets, for each set of n factors. A testing accuracy of 0.5 was expected under the null hypothesis. Statistical significance was determined using permutation testing. Here, the case-control labels were randomized n times, and the entire MDR model fitting procedure was repeated on each randomized data set to determine the expected distribution of testing accuracies under the null hypothesis. In this study, we used 100-fold cross-validation and 1,000-fold permutation testing. MDR results were considered statistically significant at the 0.05 levels. To better visualize interactions, we built an interaction dendrogram that places strongly interacting variables close together at the leaves of the tree. This method is included in the MDR software and was described by Moore et al. (19).
| Results |
|---|
|
|
|---|
Risk Associated with Individual SNPs Stratified by Smoking Status
The distributions of all selected SNP in the control subjects were in agreement with Hardy-Weinberg equilibrium (P > 0.05). By evaluating the independent effects of each SNP on bladder cancer susceptibility using unconditional multivariate logistic regression, we did not observe that the main effects of the BER polymorphisms at each SNP were related to bladder cancer risk. Among ever smokers, however, OGG1 S326C variant genotype was associated with a significantly reduced risk of bladder cancer (OR, 0.74; 95%CI, 0.56-0.99). In the never smoking group, ADPRT V762A variant genotypes conferred a significantly reduced risk (OR, 0.58; 95% CI, 0.37-0.91; Table 1
).
|
|
|
|
|
|
| Discussion |
|---|
|
|
|---|
Several studies have in fact found associations between single genetic polymorphisms in some BER genes, such as OGG1 S326C, APEX1 D148E, MUTYH H335Q, APEX1 D148E, ADPRT V762A, and XRCC1 R194W, and risk of certain cancers, including human breast (20), colorectal (21), gastric (22), and endometrial cancer (23). Association of the common S326C polymorphism of OGG1 with an increased risk for cancer was observed in several case-control studies (24). However, no previous studies have found associations between OGG1S326C polymorphisms and bladder cancer risk, and only one study showed the OGG1S326C variant genotypes with a significantly reduced risk for superficial bladder cancer recurrence (25). This result was consistent with our finding that OGG1 S326C variant genotypes was associated with a significantly reduced risk of bladder cancer in ever smokers.
Two epidemiologic studies examined the effect of APEX1 D148E polymorphisms on cancer. One study found a significant positive association between APEX1 Glu/Glu genotype and lung cancer (26) in a Japanese population, whereas another study is consistent with our finding, in which there was no association between APEX1 genotype and bladder cancer risk (27). Six epidemiologic studies examined the effect of the XRCC1 polymorphisms on bladder cancer. A reduced risk of XRCC1 R399Q homozygous variant Q genotypes compared with those with one or two WTs was observed by Kelsey et al. (28). This association was particularly apparent among heavy smokers in a study by Shen et al. (29). Two other studies by Sanyal et al. (30) and Matullo et al. (31) suggested that XRCC1 R399Q had no effect on the risk of bladder cancer. Stern et al. (32, 33) reported that the XRCC1 R194W homozygous variant W genotypes have a protective effect on bladder cancer. The ADPRT Val762Ala polymorphism plays an important role in the development of gastric cancer, and the XRCC1 Arg399Gln polymorphism may serve as a risk modifier (34). Differences in ethnicity and sample size of the study populations and differences in the etiology of different cancer sites might account for some of the discrepancies among previous studies and our data.
The BER pathway involves a serial of critical actions from the genes we investigated in this study. MBD4, MUTYH, and OGG1 are three base-specific glycosylases that have active roles in releasing the modified base and creating a basic site. The APEX endonuclease then incises the DNA strand at the abasic site. XRCC1 functions as scaffold protein in BER by bringing DNA polymerase and ligase together at the site of repair. ADPRT is another important enzymes that can temporarily bind to and protect DNA single-strand interruptions and recruit other repair proteins. The proofreading domain of DNA polymerase
(encoded by POLD1 gene) has a critical role in faithful DNA synthesis in this DNA repair process (35). Because the BER pathway is a group of proteins functioning cooperatively to repair base damages from environmental and exogenous insults, studies designed to analyze an individual gene has the obvious limitations to elucidate the effect of the entire BER pathway. Our results supported our hypothesis that multiple genes and smoking are involved in the predisposition to bladder cancer. The relationship between DNA BER polymorphisms, smoking, and cancer risk may be particularly complex because the effects of genetic variation in the repair process may depend on the presence of a DNA lesion (e.g., gene-environment interaction) or the presence or absence of polymorphisms in other genes in the same or a different pathway (Fig. 2). Thus, we suspect that some of the conflicts between the results of previous studies might also be due to uncharacterized gene-gene or gene-environment interactions.
For studies attempting to examine possible interactions among two or more genetic polymorphisms, traditional methods, such as unconditional logistic regression, may either prove infeasible due to combinations of factors with no observations or have limited power to detect clinically relevant interactions due to a low number of events per variable in the model. The CART and MDR method was proposed as a possible solution in such settings (14-19). Andrew et al. (36) has recently applied MDR to analyze the gene-gene and gene-environment interactions and identified some interesting interactions among DNA repair gene polymorphisms and smoking in a bladder cancer case-control study. These approaches improve statistical power to efficiently identify potential gene-gene and gene-environment interactions. The results of these novel algorithms were consistent with our logistic regression analysis for the two-way interaction models. Using the logistic regression approach, we identified a positive interaction between smoking and ADPRT V762A (P = 0.019; Table 2). This is consistent with what we found in Fig. 1 using CART, where we identified that in nonsmokers, ADPRT V762A was the most important factor that influences bladder cancer risk. These findings also agreed with the effect of this SNP on bladder cancer risk stratified by smoking status showed in Table 1, in which ADPRT V762A variant genotypes had a protective role only in never smokers. Similarly, using CART, we also identified that OGG1 S326C was the most important factor in smokers for bladder cancer risk (Fig. 1). Although the interaction between OGG1 S326C and smoking was not significant in logistic regression analysis in Table 2, we did find that OGG1 S326C variant genotypes had a significant association with decreased risk for bladder cancer in ever smokers when relative risk was calculated (Table 1). These different analysis approaches have validated each other and have emphasized the reproducibility of our findings. When never smokers with ADPRT V762A variant genotypes were set as the reference group, using CART, we found that the smokers carrying WT genotype of OGG1 S326C, variant genotypes of XRCC1 R194W, and variant genotypes of MUTYH H335Q had a 31.86-fold (95% CI, 4.01-253.1) increased risk for bladder cancer. These data indicated the significant joint effects between smoking and genetic polymorphisms in the BER pathway.
We attempted to test four-way interactions to replicate our findings from the MDR analysis in logistic regression; however, the model failed to converge due to the small number of individuals in some cells. Thus, our experience highlights the need for alternative, more powerful methods. Of the entire possible two-factor combinations tested, MDR analysis selected smoking and MUTYH-335 as the best two predictors of bladder cancer risk. However, comparing with the one-factor model with smoking status as the only risk factor, this model did not improve on the testing accuracy and had a decreased cross-validation consistency. These data suggested that the two-factor model was not a good choice for bladder cancer risk prediction. Similarly, the three-factor model was worse in both cross-validation consistency and average testing accuracy when compared with the one-factor model. The five-factor model had a similar 100% cross-validation consistency as the one-factor model but had a decreased average testing accuracy. Only the four-factor model, including smoking, APEX1 D148E, ADPRT V762A, and OGG1S326C, was the strongest model overall because it had the highest level of testing accuracy and showed good cross-validation consistency. The MDR four-factor model indicated that smoking, APEX1 D148E, ADPRT V762A, and OGG1 S326C were a high-risk combination of factors but did not specify whether there was a synergistic relationship. Figure 2 helped us interpret the nature of the interactions in these multifactor models. In Fig. 2, we observed that although smoking was an established risk factor for bladder cancer, in some cases, depends on the genotypes the studied individuals were carrying, ever smokers (harboring TT genotype of ADPRT V762A and CG or GG genotype of OGG1 S326C and also TC or CC genotype of APEX D148E) could have low bladder cancer risk and never smokers (harboring TT genotype of ADPRT V762A and CC genotype of OGG1 S326C and TT genotype of APEX D148E) could have higher bladder cancer risk.
In summary, we used the multifaceted analytic approach (CART and MDR) to explore the complex interaction effect between multiple genes and smoking on bladder cancer susceptibility in large case-control populations in Texas. In this study, we have revealed that the interaction relationship among the SNPs, smoking, and bladder cancer risk. These results support the hypothesis that common polymorphisms in DNA repair genes modify bladder cancer risk and emphasize DNA repair is a complex process involving the cooperation of multiple enzymes in DNA BER pathways.
| Acknowledgments |
|---|
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Received 8/22/06; revised 10/23/06; accepted 11/10/06.
| References |
|---|
|
|
|---|
, VEGF, hOGG1S326C, GSTM1, and GSTT1:useful determinants for clinical outcome of bladder cancer. Urology 2005;65:705.[CrossRef][Medline]This article has been cited by other articles:
![]() |
T. Lao, W. Gu, and Q. Huang A meta-analysis on XRCC1 R399Q and R194W polymorphisms, smoking and bladder cancer risk Mutagenesis, September 2, 2008; (2008) gen046v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Margulis, J. Lin, H. Yang, W. Wang, C. G. Wood, and X. Wu Genetic Susceptibility to Renal Cell Carcinoma: The Role of DNA Double-Strand Break Repair Pathway Cancer Epidemiol. Biomarkers Prev., September 1, 2008; 17(9): 2366 - 2373. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Chen, A. M. Kamat, M. Huang, H.B. Grossman, C. P. Dinney, S. P. Lerner, X. Wu, and J. Gu High-order interactions among genetic polymorphisms in nucleotide excision repair pathway genes and smoking in modulating bladder cancer risk Carcinogenesis, October 1, 2007; 28(10): 2160 - 2165. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |