
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Short Communication |
1 Center for Clinical Epidemiology and Biostatistics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania; 2 Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland; 3 Health Services Research, Vanderbilt University, Nashville, Tennessee; and 4 Department of Pharmacology and Toxicology and James Graham Brown Cancer Center, University of Louisville School of Medicine, Louisville, Kentucky
Requests for reprints: Montserrat García-Closas, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892. Phone: 301-435-3981; Fax: 301-402-0916. E-mail: garciacm{at}exchange.nih.gov
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
At the time of article submission, there were 29 reported NAT2 alleles (http://www.louisville.edu/medschool/pharmacology/NAT.html) encoding proteins with varying degrees of acetylation capacity. Each of the 29 NAT2 alleles possesses a combination of one to four SNPs at 13 sites within the 870-bp coding region. The majority of studies investigating the relationship between NAT2 genotype and disease risk use PCR-based assays that detect only three SNPs (C481T, G590A, and G857A) to infer NAT2 acetylation status. When none of these SNPs are present, wild-type NAT2*4, a high-activity (rapid) allele, is designated (4). Although several NAT2 SNPs are in linkage disequilibrium, assessment of only these three SNPs results in the misclassification of the following NAT2 low-activity (slow) alleles (NAT2*5C, NAT2*5D, NAT2*14A, NAT2*14B, NAT2*14E, NAT2*14F, NAT2*14G, NAT2*17, and NAT2*19) as NAT2*4, a high-activity (rapid) allele. Additionally, NAT2*11 and NAT2*12C, high-activity alleles, would be misassigned as NAT2*5B, a low-activity allele formerly designated as M1 (see Table 1 for allele descriptions).
|
| Methods |
|---|
|
|
|---|
NAT2 phenotypes were also assigned by assuming that a 3-SNP (C481T, G590A, and G857A) rather than the 11-SNP assay had been used. In both instances, individuals were classified as "R" if they possessed two high-activity alleles (NAT2*4, NAT2*11, NAT2*12A, NAT2*12B, NAT2*12C, NAT2*13, and NAT2*18), "I" if they possessed one of these alleles, and "S" if they possessed none. All genotype assignments were blinded to case-control status.
To compare "R," "I," and "S" phenotype assignments made by the 3-SNP assay relative to the 11-SNP assay (gold standard), a 3 x 3 misclassification table was created for controls from the case-control study of stomach cancer. Although recent data suggest that "R" and "I" are likely separate phenotypes (8-10), for simplicity, NAT2 acetylation status was dichotomized into "S" and "I/R" groups, and NAT2 misclassification probabilities (e.g., sensitivity and specificity) were determined. Given this bimodal phenotype model, misclassification of "I" as "R" or "R" as "I" could not be evaluated. To confirm that sensitivity and specificity values were not unique to this population, sensitivity and specificity were determined as described above for controls from a case-control study of breast cancer comprised of Caucasian women from Iowa (7) and an unpublished case-control study of prostate cancer comprised of 45% African-Americans.
Estimates for prevalence of smoking, prevalence of NAT2 acetylation status, OR of smoking (ORE), OR of NAT2 acetylation status (ORG), and the multiplicative genotype-smoking interaction parameter (
) were based on data from previously published European studies of NAT2, smoking, and bladder cancer that used the 3-SNP assay (11). Sensitivity and specificity were used to calculate expected parameters in the absence of misclassification (12). The expected values for these five parameters using the 11-SNP assay (gold standard) were calculated using formulas described in Garcia-Closas et al. (5). Sample sizes for these genotype-exposure interaction studies were estimated using the POWER software available at http://dceg.cancer.gov/POWER/.
| Results |
|---|
|
|
|---|
As shown in Table 2, agreement between the two genotyping assays for assigning "R," "I," and "S" phenotypes was very high among controls in the case-control study of stomach cancer. Relative to the 11-SNP assay, the proportion of individuals correctly classified as a slow acetylator by the 3-SNP method (i.e., sensitivity) was 94% (95% CI, 8996%), whereas the proportion of individuals correctly classified as a rapid or intermediate acetylator by the 3-SNP method (i.e., specificity) was 100% (95% CI, 98100%). Sensitivity and specificity values were comparable among controls from the breast cancer study (96% and 100%, respectively). Sensitivity was much lower (83%) for the multiracial prostate cancer controls but increased to 93% when the G191A SNP was added to the assay (data not shown). This SNP is unique to the NAT2*14 cluster, common among African-American and Hispanic populations (4). Interestingly, of the 16 acetylator phenotypes that were misclassified in the stomach cancer controls, all were due to the NAT2*5C (T341C, A803G) allele, whereas 94% of the misclassification in the breast cancer controls was due to NAT2*5C (data not shown). In both of these Caucasian case-control studies, the NAT2*5C allele frequency was
2% among controls.
|
= 1.65), 1,444 cases and 1,444 controls would be required detect a genotype-smoking interaction OR of 1.65 at 80% power and
= 0.05. After adjusting these parameters for sensitivity and specificity, the joint effects OR remained practically unchanged (observed 3.57 versus expected in the abscence of misclassification 3.63), but
increased from 1.65 to 1.78. Thus, in the absence of genotype misclassification (i.e., using the 11-SNP assay rather than the 3-SNP assay), sample size to detect genotype-smoking interaction would have been reduced to 1,121 cases and 1,121 controls. This corresponds to a 22% decrease in sample size. | Discussion |
|---|
|
|
|---|
Another way that misclassification can be reduced is by determining all SNPs that are relevant to inferred phenotype, as we have shown in this example. Similarly, it is important to screen for all SNPs that are relevant to the race/ethnicity of the sample population. The 3-SNP NAT2 assay was designed to detect the most frequently occurring NAT2 alleles in Caucasian populations, so it was no surprise that its sensitivity was high among our Caucasian controls. The 3-SNP assay, however, performs more poorly in other racial/ethnic groups as shown in the control population that included a high percentage of African-Americans. Based on 11-SNP screening of 950 alleles, we found that seven SNPs (G191A, C282T, T341C, C481T, G590A, A803G, and G857A) explained 100% of the alleles that were detected. Therefore, we recommend that these seven SNPs be screened in Caucasian and African-American populations to accurately infer NAT2 acetylator phenotype. A TaqMan assay, which costs less than one dollar per SNP, has recently been developed for this purpose (13). It is important to note that the number of SNPs that need to be determined to attain high accuracy in phenotype assignments may vary depending on the ethnic background of the population under study because of SNP prevalence across ethnic groups. See http://snp500cancer.nci.nih.gov for useful information on NAT2 SNP frequencies in four subpopulations; unfortunately, comprehensive NAT2 SNP screening has not been done in many ethnic groups. Until then, we recommend that at least seven NAT2 SNPs be screened in most populations, especially given the relatively low cost of genotyping and the potential for population admixture.
Although our 11-SNP assay is comprehensive, allele (or haplotype) assignment can sometimes be ambiguous. For example, an individual who is typed as a heterozygote at nucleotides 341 and 803 may be a NAT2*5D/NAT2*12A if both SNPs reside on separate alleles or a NAT2*5C/NAT2*4 if both SNPs are located on the same allele. Because NAT2 polymorphisms are well characterized, it is possible to collapse resulting genotypes into inferred phenotype categories. In this case, both genotypes result in the assignment of "I" phenotype. When function is largely unknown, however, correct allele/haplotype assignment is critical. Recent advances in high-throughput genotyping should facilitate comprehensive SNP screening of other highly polymorphic loci, such as NAT1 and CYP2D6.
Our results indicate that, despite relatively small errors in NAT2 phenotype assignments and small biases in OR estimates, substantial decreases in sample size required to detect genotype-exposure interaction can be attained using the 11-SNP NAT2 genotyping assay rather than the 3-SNP assay. Given the expense associated with enrolling subjects in molecular epidemiologic studies, reducing genotype misclassification is likely to result in substantial reduction in study costs. In addition, reducing genotype misclassification will reduce the bias in the estimated parameters.
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: Three new SNPs have been identified in the human NAT2 coding-region, resulting in 7 additional NAT2 alleles. Of these 16 SNPs, we continue to recommend screening for the seven most common: G191A, C282T, T341C, C481T, G590A, A803G, and G857A.
Received 4/14/03; revised 4/11/04; accepted 4/20/04.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. Jiao, M. A. Doll, D. W. Hein, M. L. Bondy, M. M. Hassan, J. E. Hixson, J. L. Abbruzzese, and D. Li Haplotype of N-Acetyltransferase 1 and 2 and Risk of Pancreatic Cancer Cancer Epidemiol. Biomarkers Prev., November 1, 2007; 16(11): 2379 - 2386. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zang, M. A. Doll, S. Zhao, J. C. States, and D. W. Hein Functional characterization of single-nucleotide polymorphisms and haplotypes of human N-acetyltransferase 2 Carcinogenesis, August 1, 2007; 28(8): 1665 - 1671. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Zhu, D. W. Hein, M. A. Doll, K. K. Reynolds, N. Abudu, R. Valdes Jr, and M. W. Linder Simultaneous Determination of 7 N-Acetyltransferase-2 Single-Nucleotide Variations by Allele-Specific Primer Extension Assay Clin. Chem., June 1, 2006; 52(6): 1033 - 1039. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rabstein, K. Unfried, U. Ranft, T. Illig, M. Kolz, H.-P. Rihs, C. Mambetova, M. Vlad, T. Bruning, and B. Pesch Variation of the N-Acetyltransferase 2 Gene in a Romanian and a Kyrgyz Population Cancer Epidemiol. Biomarkers Prev., January 1, 2006; 15(1): 138 - 141. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Cell Growth & Differentiation |