
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
1 Department of Public Health and Epidemiology, School of Medicine, University of Birmingham, Birmingham, United Kingdom; 2 Department of General Practice, Comprehensive Cancer Institute Limburg, Catholic University of Leuven, Leuven, Belgium; 3 Department of Epidemiology, Faculty of Health Sciences, Maastricht University, Maastricht, Netherlands; 4 Department of Urology and Epidemiology, University Medical Centre Nijmegen, Nijmegen, Netherlands; 5 Department of Urology, School of Medicine, University of Miami, Miami, Florida and 6 Human Genetics Program, School of Medicine, New York University, New York, New York
Requests for reprints: Maurice P. Zeegers, Department of Public Health and Epidemiology, School of Medicine, Public Health Building, University of Birmingham, Birmingham B15 2TT, United Kingdom. Phone: 44-121-414-6721; Fax: 44-121-414-7878. E-mail: M.P.Zeegers{at}bham.ac.uk
| Abstract |
|---|
|
|
|---|
Method: Publications were identified through database searches for epidemiologic studies published until February 2004. For each study, mean differences in repeat length between cases and controls were calculated as well as continuous odds ratios (OR) per one CAG or GGN repeat decrement and discrete ORs to compare prostate cancer risk in men with short CAG repeats (
21 repeats) versus long CAG repeats (>21 repeats) and short GGN repeats (
16 repeats) versus long GGN repeats (>16 repeats). The study-specific estimates were combined by random effects metaregression analyses.
Results: Nineteen case-control studies were included in this review comprising a total of 4,274 cases and 5,275 controls. Prostate cancer cases had on average 0.26 fewer CAG repeats and 0.09 fewer GGN repeats than controls. The continuous ORs of prostate cancer per one repeat decrement were 1.02 and 1.01 for CAG and GGN repeats, respectively. The summary discrete OR (95% confidence interval) were 1.19 (1.07-1.31) and 1.31 (1.06-1.61) for CAG and GGN repeat polymorphisms, respectively.
Conclusion: Although the presence of shorter repeats seemed to be modestly associated with prostate cancer risk, the absolute difference in number of repeats between cases and controls is <1 repeat. We question whether such a small difference is enough to yield measurable biological impact in prostate carcinogenesis.
| Introduction |
|---|
|
|
|---|
Global ethnic variation contributes to the explanation of the substantial differences in prostate cancer incidence existing throughout the world, being highest in Western countries, especially among African American men, and lowest in developing countries. However, little is known about why certain populations are more susceptible for developing prostate cancer. Some have suggested that varying levels of androgens across ethnic groups may be responsible for these differences (2).
In our recent meta-analysis of 32 population-based studies, we have shown that family members of a prostate cancer patient experience a 2.46-fold increased risk of prostate cancer (3). The nature of this familial clustering is such that the risk is higher for family members with a brother with prostate cancer compared with those with an affected father.
Both observations of ethnic and familial clustering provided the rationale to study whether polymorphisms in the human androgen receptor (AR) gene located at the X chromosome (Xq11.2-q12) may explain this variation in susceptibility to prostate cancer. The AR gene comprises eight exons that encode for four functional domains including the amino-terminal transcription activation (transactivation) domain, the DNA binding domain, a hinge region, and the carboxyl-terminal ligand binding domain (4). The DNA binding domain and ligand binding domain are highly conserved through evolution, whereas the transactivation domain, which regulates expression of target genes, is highly polymorphic. Within this latter domain, there are three microsatellite trinucleotide repeats, two of which are polymorphic in length. Variability in the reported molecular size of the AR is due in part to the upstream CAG and downstream GGN repeat polymorphisms, encoding polyglutamine and polyglycine regions, respectively (4-6).
Several studies have suggested that one or both of the trinucleotide repeat polymorphisms in AR may be associated with prostate cancer (7-41). However, to our knowledge, no meta-analysis of all previous studies on this relationship has been conducted thus far. Earlier narrative reviews (4, 42-47) have concluded that prostate cancer risk tend to be highest for men with shorter (CAG)n or (GGN)n alleles. Inconsistency between study results has been ascribed to small sample sizes (low power) or poor research strategies for those studies not observing an association. Although these reviews reported study-specific relative effects [i.e., relative risk or odds ratio (OR)], they did not attempt to quantify these in a summary measure. In addition, the reviews did not report on the absolute mean difference (MD) in number of CAG or GGN repeats between prostate cancer cases and controls.
The objective of this study is to increase power by meta-analyzing all epidemiologic studies up to February 2004 to provide both additive and relative quantitative summary estimates of the association between (CAG)n and (GGN)n repeat polymorphisms in the AR gene and prostate cancer risk and to evaluate changes in summary estimates by differences in study characteristics.
| Materials and Methods |
|---|
|
|
|---|
For inclusion in this analysis, the articles had to present data to estimate an OR or MD and the corresponding standard error (SE), which compares prostate cancer cases with unrelated controls on variation in CAG or GGN repeat length in the AR gene.
A distinction was made between an article and a study. In case the same study population was referred to in more than one article, these articles were considered as part of a single study. On the other hand, if an article presents data of different groups of research subjects, the results of these analyses were seen as separate studies.
Qualitative Data Extraction
Four reviewers (M.Z., L.K., A.N., and H.O.) extracted qualitative information from each article. They independently assessed the following qualitative items: design (retrospective case-control or nested case-control); population characteristicsethnic label (Hispanic, Caucasian, Asian, or African) and geographic area (country of study); patient characteristicsnumber, age (years), and histologic grade prostate cancer (percentage advanced prostate cancer, T3-T4, M0; T0-T4, M1); and control characteristicsnumber, age (years), and disease status [healthy or benign prostatic hyperplasia (BPH)]. When continuous data were presented across different subgroups, we calculated a weighed average using the median of each subgroup weighed by the frequency of observations in these strata. For example, if a population's age distribution is presented in strata of 10 younger (61-65 years) men and 15 older men (66-70 years), the average age of the population is estimated to be (10 x 63 + 15 x 68) / 25 = 66 years. When disagreement existed on an item, it was discussed until consensus was reached. These qualitative items are used to provide covariables for inclusion in metaregression models to explore reasons for observed heterogeneity in results between the studies.
Quantitative Data Extraction
Two reviewers (M.Z. and H.O.) independently extracted data, allowing us to calculate mean number of repeats in cases and controls, MDs in numbers of repeats between cases and controls, ORs of prostate cancer for short versus long repeats, and ORs of prostate cancer per decrement of one repeat.
When mean number of repeats and corresponding SD for cases and controls were not reported, these were calculated using published tables with frequency distributions of repeats for cases and controls. When these tables were not published, these were reconstructed by measuring published frequency graphs. When no information was available to calculate the SD, it was estimated as a quarter of the repeat range or calculated by using the P of the unpooled t test comparison of means between cases and controls:
, where Z is the corresponding Z score of the P of the unpooled t test, ncases is the number of cases, and ncontrols is the number of controls. The MD in both CAG and GGN repeats were calculated as MD = mcases mcontrols, where mcases is the mean repeat number among cases and mcontrols is the mean repeat number among controls. The SE of the MD was calculated by
, where VARcases is the squared SD for cases and VARcontrols is the squared SD for controls.
We extracted discrete ORs (ORd) to compare prostate cancer risk in men with short CAG repeats (
21 repeats) versus long CAG repeats (>21 repeats) and short GGN repeats (
16 repeats) versus long GGN repeats (>16 repeats). The cutoff values were chosen because these were the pooled median value of repeats in the control populations. If these could not be recapitulated, other nearby cutoff points were used. Preferably, ORd and corresponding SE of the ln(ORd) were extracted directly from the original reports. For studies that reported separate ORs for more than two strata of numbers of repeats, we combined the repeat-specific ORs by using the prevalence of the controls as weight to calculate ORd (48). If these were not available, two-way contingency tables were constructed for each study based on published or reconstructed exposure frequency distributions to calculate the ORd. The SE of latter ln(ORd) was calculated using the method of Woolf (49).
Continuous ORs (ORc) per one CAG or GGN repeat decrement (OR1) were preferably extracted directly from the published articles. ORs per one repeat increment were inverted. If ORc and corresponding SE were not presented, they were derived directly from the estimated mcontrols, mcases, and VARcontrols using
and
according to the method of Chene and Thompson (50).
Statistical Analysis
To detect publication or related biases, we explored heterogeneity in funnel plots [i.e., plots of MD against their estimated precision (reciprocal of the variance)]. We examined funnel plot asymmetry visually and measured the degree of asymmetry by using the Egger et al. (51) unweighed regression asymmetry test.
Because of potential heterogeneity between studies, we estimated the summary MD (SMD), summary ORd, and summary ORc together with their corresponding 95% confidence intervals (CI) with random effects meta-analysis by using the Stata statistical software (52). The between-study variance was estimated by a noniterative procedure using a method of moments estimator. To explore reasons for heterogeneity in MD, we used three methods: (a) we calculated the I2 statistics (i.e., the percentage of variability in point estimates that is due to heterogeneity; ref. 53) for each summary measure of CAG or GGN repeat polymorphisms and across specific strata based on different levels of the extracted study characteristics. Heterogeneity exists when I2 within unstratified analyses is higher than within stratified analyses; we did (b) metaregression analyses by including the study characteristics as covariates in the regression model (statistical significance indicates an effect modifier) and (c) stratification.
| Results |
|---|
|
|
|---|
|
7 across all studies was 32.6% (data not shown). The average mean (SD) age of controls was 63.7 (6.7) years ranging from 54.0 to 73.6 years. Most studies selected healthy controls, although three studies selected controls with BPH (12, 20; Table 1).
Publication Bias
We could not identify heterogeneity in funnel plots neither visually (Fig. 1) nor in terms of statistical significance (P = 0.39 and 0.96 for CAG and GGN repeats, respectively).
|
21 CAG repeats to men with >21 CAG repeats was 1.19 (1.07-1.31) based on the results of 22 studies (7-26, 40, 41). The ORc (95% CI) of prostate cancer per one CAG decrement was 1.02 (0.99-1.06) based on the results from the same studies that were used to calculate the SMD (7, 8, 12-17, 19-27, 40; Fig. 2).
|
16 GGN repeats seemed to have a 31% higher risk of prostate cancer than men with >16 GGN repeats (ORd, 1.31; 95% CI, 1.06-1.61). This estimate is based on the results of eight studies (8-11, 15, 16, 23, 25-27). The ORc (95% CI) of prostate cancer per one GGN decrement was 1.01 (0.98-1.04) based on the results from the same studies that were used to calculate the SMD (8, 14-16, 23, 25-27; Fig. 2).
Heterogeneity
We further examined heterogeneity in study results by categories of study characteristics including study design, ethnicity of study population, country of study, age of cases and controls, severity of disease, and type of disease status of controls. The definitions of the specific categories can be found in Table 2. The analyses were limited to studies reporting the association between number of CAG repeats and prostate cancer risk only, because the data on GGN repeats were too sparse. Except for the ethnicity of the study population (P = 0.02), none of the study characteristics seemed to be a statistically significant interaction term in metaregression analyses. However, quantitative differences in stratified MDs in CAG repeats were found. The MD in number of CAG repeats between cases and controls seemed to be larger in retrospective (0.31) versus prospective (0.08) case-control studies, non-Caucasian (0.67) versus Caucasian (0.10) study populations, the United States (0.046) versus other countries (0.16), younger (0.55) versus older (0.17) cases, more (0.29) versus less (0.18) cases with advanced prostate cancer, younger (0.48) versus older (0.18) controls, and healthy (0.35) versus BPH (0.01) controls. However, within all subgroups, still a considerable amount of the variation (17-93%) could be explained by random heterogeneity. Furthermore, none of the stratified results showed a meaningful difference in CAG repeat between cases and controls. All differences were <1 repeat (Table 2).
|
| Discussion |
|---|
|
|
|---|
In this meta-analysis, we conclude that the relative epidemiologic associations derived from previous literature confirm these biological expectations. We have shown that men with shorter CAG and GGN repeats experience a 1.19- and 1.31-fold increased risk of prostate cancer, respectively, although these increased risks should be interpreted as being very small. It should be noted that it is possible that CAG and GGC repeats are in linkage disequilibrium and it should therefore be considered that effects seen at one microsatellite might reflect effects at the other.
The results from the heterogeneity analyses were to some extent consistent with previous literature. For example, the association between CAG repeat polymorphism and prostate cancer seemed to be somewhat larger when controls were compared with cases with more advanced prostate cancer or with cancer that was diagnosed at an earlier age. According to Giovannucci (44), this could suggest that a heightened state of androgenicity directly influences a pool of relatively aggressive, early-onset androgen-driven prostate cancers. Possibly, prostate cancers that occur at older ages may be less driven by androgenicity and more related to pathologic processes such as oxidative insults (44). In addition, the MD in number of CAG repeats between cases and controls seemed to be larger for non-Caucasians and study populations from developing countries. Black males have a population average of 18 repeats, whereas Caucasian men have on average 20 repeats (47). Combining this with the fact that there is an inverse relationship between CAG length and AR transactivation function, such that the shorter the CAG, the better the AR is at activating gene transcription (68), Coetzee and Ross (57) hypothesized that the racial differences seen in prostate cancer susceptibility may be due in part to variability in the AR CAG repeat length (4). However, because the association between AR repeat length polymorphisms and prostate cancer is small, other factors must be involved as well.
The incidence of prostate cancer has increased in several countries because the availability of prostate-specific antigen testing in the late 1980s in the United States and in the early 1990s in other countries. As pointed out by Giovannucci (44), preprostate-specific antigen era patients might differ significantly from postprostate-specific antigen era patients. However, in this meta-analysis, all studies have been conducted after the introduction of prostate-specific antigen screening. Because prostate-specific antigen screening of the general population is of more common use in the United States and less often recommended and carried out by other countries, the larger SMD in CAG repeat polymorphism between prostate cancer cases in the United States compared with other countries may be attributed to differences in screening policy.
Although previous epidemiologic studies have consistently reported that the magnitude of relative association between AR repeat polymorphisms and prostate cancer is relatively limited, most narrative reviews to date have suggested a positive association between shorter CAG or GGN repeat lengths and prostate cancer. Some have argued that even a modest increased risk could still be of importance from a public health perspective because of the large population frequency of short repeat lengths (46). Based on this reasoning, Nelson and Witte (46) suggested that this potentially low-penetrance, high-frequency polymorphism could theoretically account for many more prostate cancer cases than a high-penetrance, low-frequency mutation for which the etiologic fraction is estimated to be 9%. This may be true on a population scale for the GGN polymorphism but not for the CAG polymorphism. We have estimated that the etiologic fraction is 13.4% and 8.7% for GGN and CAG repeat polymorphisms, respectively, assuming a short repeat frequency of 50% (because median cutoff values were used) and the reported summary ORd.
However, our analyses also show that although cases seemed to have shorter repeats and shorter repeats seemed to be modestly associated with prostate cancer risk on a relative scale, the absolute difference in number of repeats between cases and controls is <1 repeat. The SMDs in number of CAG and GGN repeats were 0.27 and 0.09, respectively. Although the direction of results in the heterogeneity analysis seemed to be consistent with previous literature, none of the stratified differences between cases and controls exceeded one repeat. Based on these results, we question whether an average MD of <1 repeat is enough to yield measurable biological impact in prostate carcinogenesis.
| Footnotes |
|---|
Received 4/20/04; revised 5/11/04; accepted 5/28/04.
| References |
|---|
|
|
|---|
variants in prostate cancer families. Int J Cancer 2002;98:93842.[CrossRef][Medline]
-reductase genes and prostate cancer prognosis. Prostate 2002;52:26978.[CrossRef][Medline]
-reductase genes in prostate enlargement. J Urol 2001;166:15604.[CrossRef][Medline]
This article has been cited by other articles:
![]() |
L. M. Dong, J. D. Potter, E. White, C. M. Ulrich, L. R. Cardon, and U. Peters Genetic Susceptibility to Cancer: The Role of Polymorphisms in Candidate Genes JAMA, May 28, 2008; 299(20): 2423 - 2436. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Shah, H. J. Antoine, M. Pall, K. D. Taylor, R. Azziz, and M. O. Goodarzi Association of Androgen Receptor CAG Repeat Polymorphism and Polycystic Ovary Syndrome J. Clin. Endocrinol. Metab., May 1, 2008; 93(5): 1939 - 1945. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Davis-Dao, E. D. Tuazon, R. Z. Sokol, and V. K. Cortessis Male Infertility and Variation in CAG Repeat Length in the Androgen Receptor Gene: A Meta-analysis J. Clin. Endocrinol. Metab., November 1, 2007; 92(11): 4319 - 4326. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Zitzmann and E. Nieschlag Androgen Receptor Gene CAG Repeat Length and Body Mass Index Modulate the Safety of Long-Term Intramuscular Testosterone Undecanoate Therapy in Hypogonadal Men J. Clin. Endocrinol. Metab., October 1, 2007; 92(10): 3844 - 3853. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. F Brockschmidt, M. M Nothen, and A. M Hillmer The two most common alleles of the coding GGN repeat in the androgen receptor gene cause differences in protein function J. Mol. Endocrinol., July 1, 2007; 39(1): 1 - 8. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Alvarado, L. K. Beitel, K. Sircar, A. Aprikian, M. Trifiro, and B. Gottlieb Somatic Mosaicism and Cancer: A Micro-Genetic Examination into the Role of the Androgen Receptor Gene in Prostate Cancer Cancer Res., September 15, 2005; 65(18): 8514 - 8518. [Abstract] [Full Text] [PDF] |
||||
![]() |
C Hughes, A Murphy, C Martin, O Sheils, and J O'Leary Molecular pathology of prostate cancer J. Clin. Pathol., July 1, 2005; 58(7): 673 - 684. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Cell Growth & Differentiation |