| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Julius Center for General Practice and Patient Oriented Research [F. J. B. v. D., O. L. v. d. H., P. A. H. v. N., P. H. M. P.] and Department of Medical Genetics [F. J. B. v. D., R. B. v. d. L.], University Medical Center, 3508 GA Utrecht, the Netherlands, and National Institute of Public Health and Environmental Protection, 3720 BA Bilthoven, the Netherlands [H. B. B. d. M.]
| Abstract |
|---|
|
|
|---|
20 years. This study was performed to determine the reproducibility and accuracy of N-acetyltransferase 2 (NAT2) genotyping by RFLP analysis using DNA from stored urine. To obtain long-term frozen urine and blood samples from the same person, the databases of two large prospective studies were linked by name and date of birth. Six polymorphisms within the coding region of NAT2 were determined in 65 urine and blood samples after which, genotypes and imputed phenotypes (rapid, slow) were derived. To test reproducibility, all of the six polymorphisms were determined twice in 47 urine-blood pairs. Reproducibility of imputed phenotypes was 91.5% in urine samples and 97.9% in blood samples. To test accuracy, results for all six polymorphisms were compared between urine and blood DNA. All of the
s were at least 0.85 except one. Identical results for all six polymorphisms were seen in 78.5% of urine-blood pairs. Taking blood samples as a reference standard, rapid acetylators were classified as rapid in 97% of subjects (95% confidence interval, 90100%), and slow acetylators were classified as slow also in 97% of subjects (95% confidence interval, 91100%), when using urine. This study shows that stored urine samples can be used for DNA genotyping in large cohort studies, when blood samples are not available. | Introduction |
|---|
|
|
|---|
In field conditions common to epidemiological research, several factors can induce misclassification of genetic markers, such as a mix-up in sample collection, errors in processing, and laboratory factors, e.g., the failure to recognize a variant that contributes to the genotype of interest, false priming in PCR-based arrays, and the erroneous recognition of a pseudogene. Finally, coding errors at any point during sample collection, laboratory analysis, or data processing may occur (5) .
Urine samples have been used earlier as a source for DNA (4
, 6) . In a large prospective study (DOM3
; Ref. 7
), we collected urine samples (19741986) from thousands of women and stored them at -20°C for future use. During follow-up, some of these women were diagnosed with cancer. In a nested case-control study, we plan to isolate DNA from relevant study subjects. One of the interests is to study polymorphic low-risk genes in interaction with environmental factors and the occurrence of cancer. For this purpose, genes coding for enzymes involved in the metabolism of carcinogens, such as N-acetyltransferases, are of interest. These enzymes are genetically polymorphic. In the present study, we assessed the reproducibility and accuracy of NAT2 genotyping, with the RFLP method in DNA obtained from frozen urine samples (stored for
20 years), in 65 subjects from whom blood samples had also been collected.
| Materials and Methods |
|---|
|
|
|---|
30,000) who were born between 1911 and 1945 and were living in Utrecht and surroundings (the Netherlands), were enrolled between 1974 and 1986 for the DOM study. Apart from a screening mammography, lifestyle questionnaires were filled out and overnight urine samples were collected. Approximately 200 ml of urine was stored for each participant at 4°C for 8 h and was then transported to a -20°C freezer. The samples were kept at -20°C until DNA isolation was performed, which was
20 years later (range, 1123 years).
Some of the DOM participants were also included in the Prospect-EPIC study, which is a Dutch cohort participating in the EPIC study (8)
. This study included
17,000 women who were born between 1923 and 1947 and were living in Utrecht and surroundings (the Netherlands). Between 1993 and 1997, questionnaires were filled out and blood samples were collected, which were stored at -196°C. After fractionating, some (whole) blood remained and was stored at -20°C for every participant after 1995.
To identify persons with urine and blood samples, data of the DOM cohort and the Prospect-EPIC cohort were linked. Linkage was done on surname with initials and date of birth. This resulted in the identification of 67 persons who had donated both a urine and a blood sample.
DNA Isolation from Frozen Urine.
The urine samples were thawed overnight at room temperature and mixed vigorously, before 50 ml were removed. Cells and nuclei were centrifuged at 2,000 x g for 15 min and then washed twice in PBS at 2,000 x g for 15 min. After resuspension in 500 µl of PBS, cells and nuclei were spun down for 5 min. at 12,000 x g. The pellet was resuspended in 300 µl of cell lysis buffer and was incubated overnight at 55°C. Proteins were precipitated by the addition of 100 µl of protein precipitation solution (6 M NaCl) and the precipitate was pelleted by a 5-min centrifugation at 12,000 x g. The supernatant was treated with one volume absolute ethanol, and DNA was centrifuged at 12,000 x g for 5 min. The pellet was washed with 70% ethanol and centrifuged for 5 min at 12,000 x g. After drying, the DNA pellet was resuspended in 40 µl of TE [10 mM Tris, 1 mM EDTA (pH 7.6)]. Previously, we studied quality and quantity of DNA isolated from comparable urine samples (9)
. Because the range of DNA concentration was not normally distributed (3 of 48 samples were outliers), the 25th percentile was 4 ng/µl, the 50th percentile (median) was 14 ng/µl, and the 75th percentile was 28 ng/µl. This previous study showed that although DNA amount was variable, 2689% of the samples, depending both on the length of the PCR amplicon and on PCR conditions, yielded a visible PCR product. Furthermore, the percentage of visible PCR product on agarose gel was lower for low amounts of DNA (00.1 µl) than for higher amounts of DNA (0.120 µg; Ref. 9
).
DNA Isolation from Frozen Blood.
The blood samples were thawed overnight at 4°C and mixed vigorously. The DNA isolation was performed with a QIAamp DNA Mini kit (Qiagen, Leusden, the Netherlands) according to the manufacturers Blood and Body Fluid Spin protocol. The quantity of DNA extracted from blood was measured using a UV spectrophotometer. The concentration ranged from 12 ng/µl to 55 ng/µl. The mean was 27 ng/µl.
Fragment Analysis.
Because administrative twins exist, two polymorphic markers, D3S1263 and D20S119, were used for fragment analysis to determine whether the urine and blood samples were indeed from the same person. A 10-µl multiplex PCR was performed, which contained Perkin-Elmer Gold buffer [50 mM KCl, 15 mM Tris-HCl (pH 8.0)], 2.5 mM MgCl2, 0.2 mM each nucleotide (Pharmacia, Biotech, Uppsala, Sweden), 25 ng of each primer (Isogen Bioscience BV, Maarssen, the Netherlands), 0.4 units of Amplitaq DNA polymerase (Perkin-Elmer), and 2 µl of DNA from urine or blood. PCR conditions are given in Table 1
.
|
|
First, a standard PCR was performed. To yield enough PCR product for RFLP, the product of the first PCR was used as a template for a nested PCR. All of the PCR products of the standard PCRs were stored at 4°C and used later in another experiment to test the reproducibility. Two different sets of PCR primers were used to obtain NAT2 fragments of 230 and 524 bp for further analysis. All of the PCRs were performed in 25-µl volumes, which contained Perkin-Elmer buffer II [50 mM KCl, 10 mM Tris-HCl (pH 8.3)], 2.5 mM MgCl2, 0.2 mM each nucleotide (Pharmacia, Biotech), 0.2 mg/ml BSA, 10 pmol of each primer (Isogen Bioscience BV, Maarssen, the Netherlands), and 0.05 units of Amplitaq Gold polymerase (Perkin-Elmer). For PCRs, 2 µl of DNA from urine, 1 µl of DNA from blood, or 1 µl of PCR product was used. To generate a DdeI restriction site for the polymorphism at nucleotide position 341, a mismatch primer was used. Primer sequences and PCR conditions are shown in Table 1
.
The 230-bp fragment contains two NAT2 polymorphisms (C282T and T341C) and is digested with restriction enzymes FokI (0.92 units; New England Biolabs, Leusden, the Netherlands) and DdeI (2.3 units; Boehringer Mannheim, Mannheim, Germany). The four other NAT2 polymorphisms within the 524-bp fragment (C481T, G590A, A803G, and G857A) are determined with the use of restriction enzymes TaqI (4.6 U; New England Biolabs, Leusden, the Netherlands), KpnI, DdeI, and BamHI (2.3 units; Boehringer Mannheim). All of the digestion reactions were performed in 10 µl with the most optimal buffer for every restriction enzyme according to the manufacturer. Restriction products were electrophoresed in a 3% agarose gel and restriction fragment patterns were visualized by ethidium bromide staining. Expected basepair sizes are shown in Table 2
. The samples were classified for each polymorphism as homozygous wild type, heterozygous, or homozygous mutant by two observers independently (F.J.B.v.D. and O.L.v.d.H.). The concordance rate was 97%. The few differences in interpretation that occurred could always be resolved by discussion. To check this assay, five urine and blood pairs were sequenced. The sequence data of these samples were completely concordant with the RFLP data.
|
Statistical Analyses.
To assess reproducibility, PCRs for all six of the polymorphisms were done twice in 47 urine and blood samples. Observers were blinded for the initial allele determination. Unfortunately, 18 of 65 urine-blood pairs were excluded from the reproducibility tests because of an error that occurred during storage of the samples after the accuracy tests. Percentages of agreement between first and second RFLP results were computed.
To test accuracy, all of the six polymorphisms were determined in 65 samples of urine and blood. When determining the polymorphisms in blood samples, observers were blinded for initial polymorphism determination in urine samples. For each polymorphism, a weighted
was computed (10)
. Next, genotype and imputed phenotype were determined for each person based on DNA obtained from urine and subsequently based on DNA obtained from blood. Taking blood as the reference standard, the sensitivity rate (those correctly classified as a rapid acetylator) and specificity rate (those correctly classified as a slow acetylator) were computed for imputed phenotypes based on DNA extracted from urine. Next, we calculated the bias in the estimation of the RR caused by a misclassification of the imputed phenotype. For this, we assumed a (nested) case-control study, in which the classification of rapid or slow acetylator type was based on DNA obtained from urine samples. We may then assume that the misclassification is nondifferential, if the observers were blinded as to case or control status. The following formula (11)
was used to calculate the ORR as a function of the TRR, the true prevalence of the risk factor (Prev), and the sensitivity (Se) and specificity (Sp) of the assay.
![]() |
| Results |
|---|
|
|
|---|
Differences between the first and second time of NAT2 genotyping were seen in 8 (2.8%) of 282 polymorphism determinations (47 samples x 6 polymorphisms per sample). This resulted in a different genotype in 7 (14.9%) of 47 urine samples (6 samples differed in 1 polymorphism and 1 sample differed in 2 polymorphisms). Although different genotypes were seen in seven urine samples, three of these seven samples still showed the same imputed phenotype. Eventually, 4 (8.5%) of 47 urine samples showed a difference in imputed phenotype.
One (0.4%) of 282 polymorphisms in blood samples showed a difference, resulting in a different genotype and an imputed phenotype in 1 (2.1%) of 47 blood samples.
Accuracy.
Six NAT2 polymorphisms were determined for all 65 urine and blood samples. The percentages of agreement per polymorphism between urine and blood samples showed an average of 93%.
for each polymorphism was >85% for all but one (i.e., 66%). Allele frequencies per polymorphism for DNA obtained from urine and blood samples were comparable with those observed for a Caucasian population (Ref. 12
; Table 3
).
|
After determining the NAT2 polymorphisms, NAT2 genotypes and imputed phenotypes were calculated for most of the samples. It was not possible to assign genotypes and, thus, imputed phenotypes in three urine samples and one blood sample, because the combination of the six polymorphisms for these samples was not described in literature.4
NAT2 phenotypes for urine samples and blood samples were compared with results obtained from blood samples and are shown in Table 4
. A difference in imputed phenotype occurs in 6 (9.2%) of 65 samples. When unknown samples are excluded, differences occur in 2 (3.3%) of 61 samples.
|
|
| Discussion |
|---|
|
|
|---|
for each of the six polymorphisms was >0.85 with the exception of one, which was attributable to a low frequency of the mutant allele. Taking blood samples as the reference standard, rapid acetylators were classified as rapid in 97% of subjects (sensitivity) and slow acetylators were classified as slow also in 97% of subjects (specificity), when urine samples were used. We had the unique possibility to test accuracy of NAT2 genotyping using DNA that was isolated from urine samples, which were frozen and stored for many years. In this way our study takes into account all of the conditions that are relevant to field work in epidemiological biomarker studies and that may introduce misclassification of the biomarker. These are: (a) possible errors in the identification code and storage, when large numbers are involved; (b) a loss of quality and quantity of DNA because of storage; (c) errors in the linkage of data sets; and (d) errors in identifying and retrieving both the urine and blood samples from the -20°C storage facility and errors during isolation of DNA and during the laboratory tests.
Although epidemiological studies make a great effort to minimize biomarker misclassification, the above mistakes will most likely occur in population-based studies. In this study, we have come across some errors, which may result in misclassification, despite our utmost carefulness. Although all of the urine-blood pairs were from the same person according to identifying data, polymorphic markers showed that 2 of 67 matched samples were not from the same individual. This can be caused by a linkage error or a mix-up in the storage of either urine or blood samples that were collected years before. Another explanation is that an error occurred in the process of extracting DNA from urine or blood or in the additional handling of this DNA in fragment analysis.
The reproducibility of imputed phenotype was not 100% (91.5% in urine samples and 97.9% in blood samples), which could have several reasons. It is not possible to know whether it is a problem in the assay (faint bands, incomplete digestion, poor gel resolution) or a technician error (contamination or sample mix-up).
Another 14 urine-blood pairs showed differences in NAT2 genotyping results, although they seemed to be from the same person according to administrative data and the identity of two polymorphic markers. However, 2 of
1000 persons will show the same identity when using these two markers, although they are different persons. It is unlikely that this occurred in 14 of 67 samples. However, if they were classified incorrectly as the same individual and we had excluded these individuals, results for reproducibility and accuracy would have been better. We think that these 14 urine-blood samples were, in fact, from the same person and that the differences were caused by incorrect classifying polymorphisms attributable to low amounts of DNA extracted from urine.
No genotype and, therefore, no imputed phenotype could be established for 3 of 65 urine samples and 1 of 65 blood samples. It is possible that an error occurred in the determination of one or more polymorphisms in these samples, which resulted in an unknown genotype and phenotype. Another explanation is that new genotypes have been found with these samples. However, this is unlikely because these samples showed differences in genotype between urine and blood DNA.
To compute the bias in the estimation of the RR attributable to misclassification of the rapid acetylator type, results of blood samples were taken as the reference standard. This is not a perfect reference standard, because reproducibility of genotyping of blood DNA was not 100 but 97.9% (1 of 47 samples showed a different acetylator type). For the classification of imputed phenotypes for rapid acetylators, six polymorphisms are involved and each one will be at risk for misclassification. When lesser loci are involved, the risk of misclassification of imputed phenotype, which is based on the results of combined loci, will be lower. If another gene with fewer polymorphisms is investigated, it might be expected that the occurrence of misclassification of imputed phenotypes is less.
Results were based on DNA obtained from female urine samples. It was reported before that DNA extraction from male urine samples will yield less DNA (6) . Subsequently, for males, results might be different.
Smoking (yes/no) and older women (46 years of age and older) were slightly overrepresented in inaccurate samples (n = 14) when compared with accurate samples (n = 51). However, no firm conclusions can be drawn, because the numbers are very small.
Although the reproducibility of NAT2 genotyping results with DNA from urine samples was not 100% and differences were observed when using DNA from blood samples, the sensitivity and specificity rates were high: 97% of subjects were classified correctly for acetylator type when urine samples were used instead of blood samples (95% confidence interval, 90100% for sensitivity and 91100% for specificity). Therefore, at least in women, stored urine samples can be used for DNA genotyping in large cohort studies, when blood samples are not available.
| Acknowledgments |
|---|
| Footnotes |
|---|
1 Supported by the Dutch Cancer Society (UU 98-1707). ![]()
2 To whom requests for reprints should be addressed, at Julius Center for General Practice and Patient Oriented Research, University Medical Center, Utrecht, D01.335, P. O. Box 85500, 3508 GA Utrecht, the Netherlands. Phone: 31-30-2509363; Fax: 31-30-2505485; E-mail: P.H.M.Peeters{at}jc.azu.nl ![]()
3 The abbreviations used are: NAT2, N-acetyltransferase 2; DOM, Diagnostisch Onderzoek Mammacarcinoom/Diagnostic Investigation Breast Cancer; EPIC, European Prospective Investigation into Cancer and Nutrition; RR, relative risk; ORR, observed RR; TRR, true RR. ![]()
4 Internet address: www.louisville.edu/medschool/pharmacology/NAT.html. ![]()
Received 3/23/01; revised 4/22/02; accepted 4/25/02.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
R. A. Paynter, D. R. Skibola, C. F. Skibola, P. A. Buffler, J. L. Wiemels, and M. T. Smith Accuracy of Multiplexed Illumina Platform-Based Single-Nucleotide Polymorphism Genotyping Compared between Genomic and Whole Genome Amplified DNA Collected from Multiple Sources Cancer Epidemiol. Biomarkers Prev., December 1, 2006; 15(12): 2533 - 2536. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Cell Growth & Differentiation |