| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
1 Department of Biostatistics, Section on Statistical Genetics, University of Alabama at Birmingham, Birmingham, Alabama; 2 Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University, South Carolina, California; 3 Department of Pediatrics, State University of New York at Buffalo, Buffalo, New York; and 4 Department of Psychiatry, Transdisciplinary Tobacco Use Research Center, University of Pennsylvania, Philadelphia, Pennsylvania
Requests for reprints: David T. Redden, Department of Biostatistics, RPHB 309D, 1530 3rd Avenue South, Birmingham, AL 35294-0022. Phone: 205-975-9165; Fax: 205-975-2540. E-mail: dredden{at}ms.soph.uab.edu
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
Confounding due to population stratification is often emphasized in association studies. Simply stated, population stratification can create confounding leading to spurious results in genetic association studies. We define a spurious association as any association between allelic variation at a marker locus and phenotypic variation that is due neither to the marker locus causing variation in the phenotype nor to being linked to a marker locus that causes variation in the phenotype. Confounding by population stratification can occur when both of two conditions are met (8-10). First, allele frequencies under investigation must vary among subpopulations. Second, disease prevalence or mean phenotypic value must vary among subpopulations. If researchers are unaware of the subpopulations, subjects can be unknowingly selected from differing subpopulations and a spurious association between genotype and phenotype can be created. Under such circumstances, the relationship between allele frequency and disease is confounded by subpopulation. Fortunately, statistical methods have been developed to identify when association studies may be influenced by population substructure and to adjust for the influence of population stratification in association studies (11-14). Even when population stratification is absent or has been controlled statistically, other causes that could lead to a false significance (inflated type I error rates due to multiple testing and influential observations) must be addressed. Within studies attempting to replicate reported associations, the issues of the appropriate effect size to investigate, statistical power to detect the genetic effect, and comparability of study populations and measures must be addressed.
This article presents two studies that used independent samples of treatment-seeking smokers to test for an association between a functional single nucleotide polymorphism (SNP) in the catechol-O-methyl-transferase (COMT) gene and nicotine dependence. COMT is the primary enzyme involved in the degradation and inactivation of dopamine (15). The COMT gene has a SNP in exon 3, a G1947A transition, which results in the substitution of a methionine for valine at codon 108 (Val108Met). This amino acid change affects the function of the transcribed protein 3- to 4-fold; the valine high activity (H) allele produces higher levels of enzyme activity and therefore decreased brain dopamine levels (16, 17). Because nicotine stimulates the secretion of dopamine into the neuronal synapse (18), we hypothesized that smokers who carried the high activity (H) allele and who therefore may have lower levels of dopamine, would experience a different level of reward from nicotine and possibly be more or less nicotine dependent.
The initial study of 342 individuals provided evidence in support of this hypothesis. However, the second study of 443 individuals failed to replicate the association between COMT and nicotine dependence detected within the first study. The analyses of this article document and illustrate a process to identify and examine possible reasons for the observed findings regarding the relationship between the COMT locus and nicotine dependence. Using the replication sample, this article illustrates the challenges of using a readily available independent sample to attempt confirmation of a genetic association. Furthermore, this article examines factors that may have contributed to nonreplication, including population stratification, influential observations, lack of power, and a type I error in the first study. Finally, this article presents recommendations regarding research designs to improve the reliability of findings.
| Materials and Methods |
|---|
|
|
|---|
18 years and had smoked at least 10 cigarettes/d for the prior 12 months. Exclusion criteria included pregnancy or lactation, uncontrolled hypertension, unstable angina, heart attack or stroke within the past 6 months, current treatment or recent diagnosis of cancer, drug or alcohol dependence, current diagnosis or history of a Diagnostic and Statistical Manual Fourth Edition psychiatric disorder, seizure disorder, and current use of bupropion or nicotine-containing products other than cigarettes. All participants completed self-report questionnaires during an in-person visit to the clinic, before the initiation of treatment. To reduce potential bias due to racial admixture, analyses were limited to persons of European ancestry (n = 342 in study 1 and n = 443 in study 2). All participants provided a 40-mL blood sample for genotyping and completed a set of standardized self-report questionnaires.
Genotyping Methods
PCR for the COMT genotype was done as described (19). Briefly, genomic DNA (50 ng) was amplified using 20 pmol of each primer (5'-CTGTGGCTACTCAGCTGTG- 3' and 5'-CCTTTTTCCAGGTCTGACAA-3'), in Applied Biosystems buffer [10 mmol/L Tris-HCl (pH 8.3), 50 mmol/L KCl, 1.5 mmol/L MgCl2], and Amplitaq DNA polymerase (1.0 unit; Roche Molecular Systems, Alameda, CA) with 2'-deoxynucleoside-3'-triphosphates (1.87 mmol/L; Invitrogen, Carlsbad, CA) in a 50-µL reaction volume. The PCR reaction had an initial melting temperature of 95°C (5 minutes) followed by 35 cycles of melting (95°C, 1 minute), annealing (55°C, 1 minute), and extension (72°C, 1 minute). An extension period of 4 minutes at 72°C followed the final cycle. The PCR reactions were done using an Applied Biosystems GeneAmp PCR System 9700 (Foster City, CA). The 169-bp PCR product was digested overnight at 37°C with Hsp92II (Promega, Madison, WI) and gave rise to fragments of 114, 23, and 32 bp for the high activity allele and 96, 23, 32, and 18 bp for the low activity allele. The digested PCR fragments were resolved by agarose gel electrophoresis (4% w/v, MetaPhor agarose, Cambrex Bio Science, Rockland, ME) and observed by ethidium bromide staining. The assay was validated by confirming polymorphic Mendelian inheritance patterns in 12 human family cell lines (n = 133 family members), encompassing three generations (data not shown; samples were obtained from National Institute of General Medical Sciences Human Genetic Mutant Cell Repository, Coriell Institute, Camden, NJ). Two independent investigators read the results of the genotyping study and 20% of the samples were repeated for quality control and their genotypes were confirmed. There were no discordant samples.
Forty-three SNPs were genotyped to assess possible population substructure. These 43 markers were spaced such that there were two to three SNPs per chromosome (three SNPs in the larger chromosomes and two in the smaller ones). None of these 43 SNPs were close to or in genes under study. These SNPs were not in coding regions. All SNPs have a minor allele frequency in people of European ancestry and African Americans of >0.1. All SNPs were available as validated and inventoried assay kits from Applied Biosystems.
Measures
Outcome: Nicotine Dependence. Baseline nicotine dependence was assessed using the Fagerstrom Test for Nicotine Dependence. This scale is a six-item self-report measure of nicotine dependence derived from the Fagerstrom Tolerance Questionnaire (20). Scores can range from 1 to 10 with higher scores indicating greater nicotine dependence.
Covariates
Weight was measured at baseline using a balance beam scale that was calibrated daily. Self-reported weight and measured weight at baseline were highly correlated (r = 0.97, P < 0.00001). Gender, education, marital status, age, and ethnic ancestry of grandparents were assessed by self-report during the pretreatment assessment visit. The Center for Epidemiologic Studies Depression Scale was used to assess the severity of depression symptoms (21).
Explanatory Variable: COMT Genotype. Using methods described above, participants were classified as homozygous for the H (high activity) allele, heterozygous, or homozygous for the low activity (L) allele. Based on our hypotheses, we characterized participants according to the presence (H/H or H/L) or absence (L/L) of the high activity allele.
Statistical Methods
Statistical examination of the data from the two studies included the cross-tabulation of genotypes by study to ensure comparability of the prevalence of the COMT H allele in the two studies. Tests of Hardy-Weinberg equilibrium (HWE) were used to test whether the genotypic information conformed to the Hardy-Weinberg principles. Two-sample t tests assuming unequal variances were used to compare the two samples of smokers with regard to baseline nicotine dependence, age, depression symptoms, and self-reported body mass index (BMI). Analysis of covariance (ANCOVA) models were used to test for an association between the COMT locus and the baseline Fagerstrom Test for Nicotine Dependence. Specifically, a variable added-last test was conducted to determine if presence of the H allele was significantly associated with nicotine dependence in the presence of and controlling for baseline depression symptoms as measured by the Center for Epidemiologic Studies Depression Scale, baseline self-reported BMI (kg/m2), and age (22). The covariates were included in the models because previous research had indicated associations of these variables with nicotine dependence and smoking cessation (23, 24). All covariates were standardized to have a mean of zero and SD of one to improve interpretability of the estimated variables. In addition to the analysis of the COMT locus, we examined hypotheses for the DRD2 Taq1 locus and the DAT VNTR locus (results not included). For all models, there was only one phenotype under investigation, nicotine dependence. To account for the multiple models built to address different loci, Bonferroni corrections were used to maintain experiment wise error at or below 0.05. The effect size attributable to the presence of the H allele at the COMT locus was estimated by the difference in the coefficient of the determination (R2) for the regression model containing baseline BMI, depression symptoms, age, and an indicator variable for the presence of the H allele and the coefficient of the determination (R2) for the regression model containing only baseline BMI, depression symptoms, and age (i.e., semipartial R2). Influence diagnostics, specifically Cook's D, were calculated to examine whether any individual observation exhibited undue influence upon the fitting of the ANCOVA models.
To examine for confounding due to population substructure, a method recommended by Pritchard and Rosenberg (13) was used. This method uses the sum of K single degree of freedom (df) Pearson
2 test statistics calculated from the cross-tabulation of case-control status and the frequency of alleles at K independent random markers. Evidence of substructure is indicated if the observed sum is an unlikely result from a
2 distribution with K degrees of freedom. Cases were defined as study participants with baseline Fagerstrom scores total greater than or equal to 6 points, indicating high levels of nicotine dependence (25). To assess false-positive report probability, the method proposed by Wacholder et al. (26) was employed.
All analyses were conducted using SAS 9.0 (Cary, NC).
| Results |
|---|
|
|
|---|
2 = 1.42, df = 1, P = 0.23) of deviations from the principles of Hardy-Weinberg.
|
2 = 39.87, df = 43, P = 0.61). Based upon the observed P associated with the presence of the H allele and assuming a range of prior probability from 0.25 to 0.001 that the COMT locus affects nicotine dependence, false-positive report probability was calculated using the method of Wacholder et al. (26). If one assumes the prior probability of a true effect to be 0.25, 0.1, 0.01, or 0.001, the false-positive report probability is 0.05, 0.13, 0.63, or 0.95, respectively.
|
2 = 20.93, df = 1, P < 0.0001) of deviations from the principles of Hardy-Weinberg. Examinations of genotype frequencies revealed that the sample contained more HH and LL genotypes and fewer HL genotypes than expected under the principles of HWE.
Association of COMT Alleles with Nicotine Dependence within Study 2
Given the lack of influential observations and lack of evidence of population substructure within the initial study, the association between the presence of the H allele and nicotine dependence was examined within an independent set of 443 individuals. The necessary outcomes, covariates, and genotypes on these individuals were readily available due to their participation in a parallel nicotine replacement study. Analysis of the study 2 data found the presence of the H allele was not significantly associated with nicotine dependence (P = 0.6418) in the presence of the standardized baseline depression symptoms (P = 0.0132), self-reported BMI (P = 0.2956), and age (P = 0.0007). Details of the model are provided in Table 2.
Given the failure to replicate the association, influence diagnostics were examined to ensure that the lack of significance could not be attributed to masking due to the influence of outliers or leverage points. Two observations within the replication study exhibited moderate influence relative to other observations within the analysis, removal of these two points actually decreased the association between the presence of the H allele and nicotine dependence. Because confounding due to unrecognized population substructure can also mask associations within genetic association studies (27), the 43 random markers were again used to determine if substructure existed among the 443 subjects. Using the hypothesis test proposed by Pritchard and Rosenberg (13), no evidence of substructure was found (
2 = 40.51, df = 43, P = 0.58).
Comparison of the Two Study Samples
To investigate potential reasons for the discrepancy of the results of the two analyses, comparisons of the genotype frequencies, allele frequencies, and means of age, BMI, depression symptoms, and nicotine dependence were conducted. Genotype frequencies for COMT did not differ between the two studies (
2 = 5.31, df = 2, P = 0.07). The frequency of the H allele did not differ between the two studies (
2 = 0.25, df = 1, P = 0.62). To examine whether the study groups differed with regard to the 43 random markers, a modified version of Pritchard and Rosenberg's test was used. Instead of the cross-tabulating case-control status and the frequency of alleles at K independent random markers, we cross-tabulated a study indicator and the frequency of alleles at K independent random markers and summed the K independent
2 statistics. Evidence of differing population structure between the two studies is indicated if the observed sum is an unlikely result from a
2 distribution with K degrees of freedom. However, this approach revealed no evidence of differing population structure (
2 = 46.4, df = 43, P = 0.34).
Significant mean differences exist between the two samples with regard to nicotine dependence (P = 0.0011), age (P = 0.0054), and BMI (P < 0.0001). On average, the participants of the initial study were older, heavier, and more nicotine dependent than participants within the replication study. Results of other comparisons are contained in Table 1. To examine if these differences could account for the discrepancy in association results, an ANCOVA model on the pooled data was constructed.
Pooled Analysis
Given the different distribution of covariates between the two studies and the lack of population substructures within the data, a pooled ANCOVA analysis was constructed to adjust the analyses for differences in covariates. Regression analysis of the pooled samples indicated the presence of the H allele at the COMT locus was significantly associated with nicotine dependence (P = 0.0345) in the presence of the standardized baseline depression symptoms (P = 0.0002), self-reported BMI (P = 0.0096), age (P < 0.0001), and an indicator for initial study membership (P = 0.0089).
Pooling the data from the two studies allows for tests of effect heterogeneity (i.e., the effect of covariate adjustment between the two studies is different or the measured effect of the H allele at the COMT locus differed between studies). To further explore the pattern of results, ANCOVA models using the pooled data and including interactions between COMT and study indicator were developed. A significant association between COMT and nicotine dependence (P = 0.0068) and a nearly significant interaction between marker and study (P = 0.0795) were observed. There was no indication that the effects of age (P = 0.8891), BMI (P = 0.5812), and depression (P = 0.4366) upon baseline nicotine dependence differed between studies.
Statistical Power
After finding that population substructure and influential observations did not account for the results of the studies, we focused our attention upon statistical power. Specifically, we seek to estimate the power of the second study to replicate the statistically significant finding of the first study.
Arguably, there are two separate approaches that can be used to estimate statistical power within a replication study. The first approach, documented in Taylor and Muller (28), is to use the estimated residual variance and estimated effect size from the initial study in the calculation of the power. However, because both the estimated residual variance and estimated effect size are observed random variables, statistical power within the replication study becomes a random variable. Hence, the question becomes not what is the power of the replication study but what is the likely range of power for replication studies of fixed sample size based upon random estimates. Based upon the estimates of the initial study [residual variance (controlling for age, depression, and BMI) of 4.05 and effect size of 2%], power to replicate the initial findings with a fixed sample size of 443 ranges would likely range from as low as 20.9% to 100% (95% confidence interval). Ideally, this interval would have a much narrower range with the lower bound closer to 80%.
The second approach, documented in Taylor and Muller (29) is to a priori determine the minimum effect size of scientific relevance worth detecting and use only the estimated residual variance to estimate power in a replication study. Given nicotine dependence is a complex phenotype and variability in one locus is unlikely to explain a medium to large amount of the variability, an increase in R2 of 1% might be a reasonable minimum effect size. However, because the residual variance is still estimated, statistical power for the replication study would still be a random variable. Based upon the estimates of the initial study [residual variance (controlling for age, depression, and BMI) of 4.05] and assuming a priori scientifically relevant effect size of 1%, power to replicate the initial findings with a fixed sample size of 443 would likely range from as low as 53% to 65% (95% confidence interval). Table 3 presents estimated power for a range for other possible effect sizes.
|
| Discussion |
|---|
|
|
|---|
Both studies reported here were conducted by the same investigational team. Although designed to test different smoking interventions, both studies report the phenotypic outcome (baseline nicotine dependence) using the Fagerstrom Test for Nicotine Dependence before treatment. Genotyping for both studies was conducted by the same lab under the same quality control measures. Therefore, the phenotypic and genotypic measures between the two studies can be considered comparable. With regard to equivalence of samples, there were baseline differences between the samples. On average, the participants of the initial study were older, heavier, and more nicotine dependent. Because interactions between a study indicator and these covariates were not significant, it seems unlikely that these differences in covariate levels between studies explain the observed discrepancy in the observed tests of association between COMT and nicotine dependence. With regard to COMT genotype frequency and prevalence of the H allele, the two studies seem comparable. However, a test of HWE for the COMT locus was rejected for the replication study. Typically, rejection of HWE implies nonrandom mating or recent immigration. Given the reported results from the test of population substructure, these are unlikely interpretations. Given the quality control standards used and a review of genotyping results, we do not believe the rejection of HWE is due to genotyping errors. Another possible reason for the observed deviation from HWE is the violation of the assumption of random sampling from the population under investigation, which is required for the test of HWE to be valid. Both studies used nonrandom self-selecting samples defined by treatment-seeking smokers responding to local media advertisements. Other possible reasons for the observed rejection of HWE include genetic selection, assortative mating, or a type I error. However, we cannot definitively ascribe the apparent departure from HWE to any single cause.
With regard to the initial study, it is possible that the finding is a type I error. Within the study we did use type I error control methods (i.e., Bonferroni corrections). We examined specifically three hypothesis tests for three different loci (COMT, DRD2 Taq1, and DAT VNTR). Therefore, under the statistical assumptions of the hypothesis tests, there is an experiment-wise chance of no more than 5% that we have committed at least one type I error. As recommended by a reviewer, we estimated the false-positive report probability as detailed in Wacholder et al. (26). We would like to note that we agree with many of the criticisms presented by Thomas and Clayton (30) of the method. However, calculation of the false-positive report probability did not indicate that the results of the initial study were highly likely to be a false-positive result assuming a prior probability of 0.01.
We examined the power of our replication study to detect the association observed within the initial study. This calculation proved to be complex and must account for the facts that the effect size and variance are estimated from a previous study. As indicated in Table 3, if the true effect of the COMT locus was slightly less than the estimated effect in the initial study, then the method of Taylor and Muller (29) indicates that the replication study was likely underpowered. Statistical power within the replication study may have been adversely affected by both measurement error in the phenotype as well as measurement error in the depression covariate. Therefore, we cannot rule out the possibility of a type II error within the replication sample. A future study designed to detect an effect size of partial R2 equal to 1% based upon the information estimated from our initial study would require 850 individuals to be 95% confident that the power was at least 80%.
To address possible confounding due to population stratification, we measured 43 biallelic SNPs on the study participants in both studies and used the method of Pritchard and Rosenberg (13) to test for population substructure. Examination of evidence of substructures within the first study was warranted due to the possibility that the detected association between nicotine dependence and the COMT locus was spurious due to confounding by population substructure. Examination of evidence of substructures within the second study was warranted due to the possibility that the association between nicotine dependence and the COMT locus was masked due to confounding by population substructure. However, no evidence of population stratification was found. Although no evidence indicating population stratification was found, the possibility of confounding due to population substructure cannot be absolutely excluded. If more SNPs were available, our ability to detect possible population substructures would increase (13). However, given the lack of evidence of substructure provided by the analysis of the 43 available SNPs, we believe confounding due to population stratification studies is not a likely explanation for the observed discrepancy in the observed tests of association. Furthermore, examination of influential observations indicated that the different results from the studies were not attributable to outliers or leverage points.
Pooling the data from the two studies together into one ANCOVA model provided marginal evidence of effect heterogeneity between the two studies. Given that the participants of the two studies were recruited from different geographic locations, this marginal P may indicate that the observed associations between the high activity allele at the COMT locus and nicotine dependence is moderated by some unobserved environmental factor confounded between studies. Based upon the analyses done, we believe a likely explanation of our results is either a type II error in the replication study or possibly effect heterogeneity. Other possible explanations, which we believe to be less likely, include a type I error in the initial study or confounding due to undetected population stratification.
| Conclusions |
|---|
|
|
|---|
| Acknowledgments |
|---|
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Received 9/ 1/04; revised 1/10/05; accepted 2/10/05.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. Quaak, C. P. van Schayck, A. M. Knaapen, and F. J. van Schooten Genetic variation as a predictor of smoking cessation success. A promising preventive and intervention tool for chronic respiratory diseases? Eur. Respir. J., March 1, 2009; 33(3): 468 - 480. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. C. Johnstone, K. M. Elliot, S. P. David, M. F.G. Murphy, R. T. Walton, and M. R. Munafo Association of COMT Val108/158Met Genotype with Smoking Cessation in a Nicotine Replacement Therapy Randomized Trial Cancer Epidemiol. Biomarkers Prev., June 1, 2007; 16(6): 1065 - 1069. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |