
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
1 Centers for Disease Control and Prevention and 2 Emory University, Atlanta, Georgia; 3 University of Michigan, Ann Arbor, Michigan; 4 Wayne State University, Detroit, Michigan; and 5 Fred Hutchinson Cancer Research Center, Seattle, Washington
Requests for reprints: Martin Steinau, Coordinating Center for Infectious Diseases, Division of Viral and Rickettsial Disease, Viral Exanthems and Herpesvirus Branch, Centers for Disease Control and Prevention, 1600 Clifton Road, MS G-41, Atlanta, GA 30333. Phone: 404-639-0561; Fax: 404-630-3540. E-mail: MSteinau{at}cdc.gov
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
The etiologic role of high-risk human papillomavirus (HR-HPV) in cervical neoplasia is well established. HPV testing has therefore been approved as an adjunct to cytology to improve cervical cancer screening in women over 30 and to triage women with equivocal cytology results (atypical cells of uncertain significance; refs. 3, 4). However, the greatest strength of HPV testing is its negative predictive value. Specificity remains relatively low because of high prevalence of transient asymptomatic HPV infections in the absence of disease (5).
Molecular biomarkers hold great promise as objective and precise indicators of pathogenetic processes. Discovery efforts have intensified in recent years, but studies to evaluate their usefulness in clinical applications are rather rare and typically lack sufficient sample size (6-8). To date, the most promising candidates, such as 3q amplification, p16, and replication license proteins, require in situ hybridization or immunohistochemistry assays and interpretation within a morphologic context (9-11). Ideally, screening biomarkers should be detected in noninvasively collected samples, such as exfoliated cervical cells, using an assay format with the potential for high throughput and quantitation. We have recently shown that exfoliated cells have a gene transcription profile representative of tissue and might be suitable for biomarker screening (12). Using a quantitative reverse transcription-PCR (RT-PCR) assay, we evaluated 40 candidate RNA markers in exfoliated cervical cells for their potential to detect underlying CIN3 lesions.
| Materials and Methods |
|---|
|
|
|---|
All samples from women with biopsy-confirmed high-grade CIN (CIN3; n = 88) and invasive cancer (n = 5) not previously analyzed for gene expression with microarrays were included as cases (CIN3+). Two controls with no cervical disease identified at colposcopy with biopsy and/or cytology (CIN0) matched for age (20-50 years) and race/ethnicity (Black, White, and Hispanic) were selected for each case. To evaluate markers that could add to HR-HPV testing, controls were matched for HPV status as much as possible. In the study set, 91 of 93 cases and 95 of 186 controls were HR-HPV positive. The matched samples were split by a pseudo-random number generator into two sample sets: set 1 (exploratory), 47 CIN3+ and 94 CIN0; set 2 (test), 46 CIN3+ and 92 CIN0.
Candidate Genes
Using microarrays for whole genome expression analysis, we had previously identified six genes potentially up-regulated in exfoliated cervical cells from subjects with CIN3+ (12). These, along with an additional seven genes from the same study indicated by other statistical measures, were chosen for further evaluation (genes 1-13 in Table 1
). Based on review of recent literature, we selected an additional 27 gene transcripts reported to have potential to identify cervical neoplasia. These genes were either well-known markers of cell transformation (i.e., proliferation and cell cycle progression, immortality, and altered cell adhesion) or were published with compelling experimental evidence for strong overexpression in cervical lesions. Table 1 lists all candidate genes selected for evaluation. With the exception of DAPK1 and IGSF4, all genes were expected to be overexpressed in CIN3+ samples.
|
A 7.5-µL total nucleic acid aliquot was treated with 13.5 units DNase I (GenHunter, Nashville, TN) in a 27.5-µL reaction with 1x RT buffer (Invitrogen Corp., Carlsbad, CA) for 30 min at 37°C and 2 min at 70°C. The reactions were carried out in 0.6 mL Hi-Yield Nucleic Acid Recovery tubes (Robbins Scientific Corp., Sunnyvale, CA) and incubated in a digital heatblock (VWR Scientific Products, West Chester, PA). One microliter was removed to be tested for residual DNA (no-RT control). To the remainder we added 13.5 µL master mix consisting of 1x RT buffer, 60 ng random primer pd(N)6, 1.5 ng oligo-T primer pd(T)1218, 4 µL deoxyribonucleotide triphosphates, 10 mmol/L each (all from Invitrogen), and 0.1 pg exogenous plant RNA of the Arabidopsis thaliana Chlorophyll A-B binding protein CAB (Stratagene, La Jolla, CA). This spike was included to serve as an external control to monitor reproducibility of the RT efficiency. The tubes were heated at 70°C for 5 min and placed on ice for 2 min. Finally, 300 units of SuperScript III (Invitrogen) and 3 µL of 20 mmol/L dichlorodiphenyltrichloroethane in 1x RT buffer were added and brought with H2O to a final volume of 60 µL. After incubation for 5 min at 25°C, 50 min at 50°C, and 15 min at 70°C, samples were diluted with 180 µL H2O and stored in 80 µL aliquots at 20°C until PCR amplification. Two microliters of this cDNA was used in subsequent PCR reactions, and the remainder of the aliquot was kept at 4°C.
Efficient DNase treatment was verified in all samples by running 2 µL of the 1:10 diluted no-RT control (see above) in a PCR reaction with primers for the abundant PGK1 gene. No signals from no-RT controls reached the threshold within 35 PCR cycles unlike the positive control of cDNA from pooled cervical extracts.
Primers for candidate genes were designed with PerlPrimer v1.1.9 (15), strictly avoiding 3' extendable primer dimers. Wherever possible, primers were selected to span an intron, and at least one primer was placed over an exon/exon boundary producing amplicons between 80 and 300 bp (median, 136 bp; Table 1). We verified target specificity of each primer pair by gel electrophoresis of amplicons generated from cDNA from pooled cervical exfoliated cells, human genomic DNA (Roche Diagnostics, Indianapolis, IN), and a-no template control. Primers and conditions for the reference gene transcripts PGK1 and RPL4 as well as the external control CAB were established previously (16). After amplification, all PCR reactions were subjected to dissociation curve analysis to verify product specificity by a single peak of the second derivative of the melting curve. PCR amplification efficiency for all primers was determined by the slope (101/slope) of a standard curve over a five-step 10 fold dilution series (100, 10, and 1 ng and 100 and 10 pg). The cDNA template for this standard curve was made from universal human reference RNA (Stratagene). Amplification efficiency had to be above 90% for primers to be used for real-time RT-PCR (Table 1).
Real-time PCR amplification of all samples was done in duplicate on an ABI 7900 HT sequence detection system (Applied Biosystems, Foster City, CA) using 96-well plates. The 25-µL reaction volume contained 1x SYBR Green PCR Master Mix (Applied Biosystems), 0.8 µmol/L of each primer, and 2 µL template. The thermal profile consisted of one cycle at 95°C for 10 min and 40 cycles of 95°C for 15 s, 60°C for 15 s, and 72°C for 45 s. A no-template control was included on each plate.
Analysis
Raw Ct values were retrieved at a threshold of
Rn = 0.1 and imported as .txt files into MS Excel 2002. Reactions that were not reproducible as shown by a coefficient variation of Ct values >5% or an irregular melting peak profile (in either of the two reactions) were excluded from the analysis. For samples that passed these quality standards, the replicate Ct values were averaged. To normalize for sample differences, we subtracted the geometric means of PGK1 and RPL4 from averaged Ct values of each gene. This normalization factor was previously determined to be the most stable combination of transcripts in exfoliated cervical cells (16).
We applied receiver operating characteristics (ROC) analysis using SPSS for Windows 11.5.0 (SPSS, Inc., Chicago, IL) to normalized Ct values of CIN0 and CIN3+ groups and calculated the area under the curve (AUC) for each gene in the first sample set. We set a low-stringency cutoff (AUC >0.6) to identify genes with diagnostic potential to be evaluated in the second sample set (test set). For markers confirmed by an AUC >0.6 in the second sample set, an overall AUC was calculated from combined Ct values of all samples in both sets (nCIN3+ = 93; nCIN0 = 186).
We evaluated whether markers could be combined to improve detection of CIN3+. We set the threshold for detection of each gene at the normalized Ct value that gave 90% specificity for CIN3+. Samples in which one or more genes were detected were scored as positive for the marker panel. We also used likelihood maximization to select a linear combination of markers yielding a combination score in the context of a logistic regression model of disease status (17). To account for overfitting, the model was derived from a randomly selected 50% of the sample data and tested on the remaining 50%.
Efficiency (E) corrected average expression fold differences between groups were calculated from the means of normalized Ct values as E
CIN0-
CIN3. The Mann-Whitney U test was carried out to assess the statistical significance of overall differences between the CIN3+ and the control sample groups.
| Results |
|---|
|
|
|---|
The sample normalization factor (i.e., geometric means of Ct values for reference genes PGK1 and RPL4) indicates relative cDNA concentration and hence total RNA in each sample. The median difference in this normalization was 9.9-fold, indicating a 1,065-fold difference between the lowest and the highest cDNA concentrations. This reflects substantial sample-to-sample variation in total RNA concentration. Even so, target RNAs could generally be quantified from all samples. Ct values for the majority of transcripts (35 of 40) met quality control standards for quantitative evaluation in at least 75% of the samples in both the CIN3+ and CIN0 groups. Exceptions were the transcripts of AURKA, ADORA2B, SHCBP1, and NUDT6 that could not be determined in over 25% of the samples in one group and TERT that was frequently undetectable in both groups.
ROC Analysis
Table 2
shows the fold difference in expression between the means of the CIN3+ and CIN0 samples and the performance of the markers as evaluated with ROC curve analysis. The exploratory analysis (first sample set) identified 12 genes with an AUC >0.6. These were further analyzed in the test (second) set (46 CIN3+, 92 CIN0). Six markers [claudin 1 (CLDN1), minichromosome maintenance deficient 5 and 7 (MCM5 and MCM7), cell division cycle 6 homologue (CDC6), antigen identified by monoclonal antibody Ki-67 (MKI67), and SHC SH2-domain binding protein 1 (SHCBP1)] passed the second screen, with AUC >0.6 (Table 2).
|
|
|
|
Interestingly, at 60% sensitivity, CLDN1, the best single marker, also achieved about the same specificity, indicating its performance was similar to the five-marker panel (Fig. 1). However, the two approaches to detection showed little concordance in the sample classification. Of 68 samples classified as positive in either method, 46 were true positives classified as such by both methods, and 67 were false positives with only 23 classified as such by both methods. Discordant samples (either true positive or false positive) did not differ with respect to age, race, and HPV status (data not shown). Results for the samples classified as negative followed the same pattern. Diagnostic accuracy of the quantitative RT-PCR tests was not affected by the amount of RNA in the samples as the mean normalization factor for samples classified as true positive, false positive, true negative, and false negative did not differ (25.34, 25.61, 24.88, and 24.88, respectively, for CLDN1 and 25.10, 25.08, 25.11, and 25.19, respectively, for the five-marker panel).
The alternative combination approach (modeling a score by likelihood maximization from any combination of the six candidate markers) did not yield a higher AUC than single markers when split sample validation was applied (data not shown).
| Discussion |
|---|
|
|
|---|
Recent immunohistochemistry studies of CLDN1, an integral membrane protein for tight junction formation, showed strong expression in CIN lesions and cervical cancer (18, 19). Perturbations in CLDN1 have been associated with several cancers and may be an indicator of altered polarity, abnormal cellular organization, and loss of differentiation (20). The minichromosome maintenance proteins MCM5 and MCM7 are normally up-regulated in the G0 to G1-S phase transition and initiate and promote DNA replication. Increased expression at both the protein and RNA level has been shown in cervical tissue (21-23). MCM7 might also be a direct target for HPV (24, 25). Several immunohistochemistry studies have implicated CDC6 and MKI67 as proliferation markers often correlated with the grade of neoplasia (22, 26, 27). SHCBP1 was originally implicated by our own microarray study and had been described as a growth factor stimulated link of signaling pathways governing cell cycle progression (28). Notably, five of the six genes are involved in cellular proliferation.
CLDN1 is located at 3q28-q29, the region most frequently amplified during cervical carcinogenesis as determined by both genomic (29) and gene expression analysis (30). The other genes we identified as potential markers are located on chromosomal regions that have not been reported to be amplified (MCM5, 22q13.1; MCM7, 7q21.3-q22.1; CDC6, 17q21.3; MKI67, 10q21.3; and SHCBP1, 16q11.2).
As correlation between markers was limited, we evaluated whether they could be combined into a panel. Biomarker panels presume that cases have phenotypic diversity requiring more than one marker to achieve sensitive detection. Results from the two approaches that we took to combine markers did not exceed the diagnostic performance of CLDN1 alone (60% sensitivity and 76% specificity or an AUC of 0.72). Other methods for combining markers might be suggested, but it seems unlikely that the sensitivity would be increased significantly without loss of specificity.
As the study is enriched in disease and recruited from a referral population, the observed sensitivity and specificity does not reflect how these markers would perform in routine screening conditions. Based on ROC curve analysis, we feel that these markers may play a role in disease detection but are unlikely to have sufficient sensitivity and specificity to significantly improve screening practices. In guiding further work in biomarker discovery and validation, it is worth exploring reasons why screening sensitivity and specificity are not higher.
Limitations inherent in the sample quality may contribute to low sensitivity. The molecular preservation of exfoliated cervical cells is potentially compromised as a result of terminal differentiation and exfoliation. However, based on our successful demonstration that expression profiles from normal cervical tissue and exfoliated cells are comparable (12), we believe that molecular preservation is adequate. Furthermore, the amount of RNA, reflected in normalization factor, was the same in those samples correctly or incorrectly classified.
Dilution of signal may contribute to low sensitivity. The exfoliated cervical sample includes a mixture of cell types from normal and lesional tissue. As only a minority (1-10%) of cells are morphologically abnormal, their signaling intensity may be diluted in extracts of unselected cells. Detecting molecular changes indicative of disease in these extracts requires a large fraction of abnormal cells from the lesion, a robust change in expression that can withstand dilution, or a change in expression that extends beyond the morphologically detected lesion (i.e., the existence of a field effect). Although dilution cannot be discounted, it should be noted that the level of discrimination in this study using extracts of exfoliated cervical cells was similar to the immunohistochemistry measures of CDC6 and MCM in tissue samples of lesions shown by Murphy et al. (22).
Lower observed performance of a marker's specificity could also result from misclassification of controls. We used a combination of cytology, colposcopy, and biopsy with centralized pathology review to determine disease. However, whereas this is the current "gold standard" for follow-up of an abnormal Papanicolaou test, this approach is nonetheless imperfect. It is increasingly recognized that negative colposcopy and biopsy of women with HR-HPV does not entirely exclude the presence of disease, particularly in the endocervix, that may become detectable later (4, 31). Follow-up will be required to determine the true disease status in this population. The CIN3+ samples are unlikely to be misclassified as the pathology was reviewed by a panel of experts. As expected, all but two of these samples were HR-HPV positive.
Poor discrimination of markers may be a reflection of the biology of preinvasive cervical disease. These lesions present a spectrum of morphologic changes that variably reflect the multiple molecular changes that occur during neoplastic progression. The true biological potential of histologically similar CIN3+ lesions is not clear. It is estimated that up to 33% of these lesions may regress (32). Of those that persist, many years and the acquisition of additional genomic changes are generally required before invasion. As noted above, immunohistochemistry markers in tissue samples have imperfect sensitivity and specificity as well, presumably for these reasons.
Finally, imperfect discrimination could be a result of the cellular functions represented by this set of biomarkers. As these markers mainly target cell cycle proliferation, nonspecific increases in proliferative activity associated with inflammation, squamous metaplasia, or tissue repair unrelated to neoplasia could lead to detection in the cervical epithelium of healthy controls. Conversely, regressing CIN3 lesions may show reduced proliferation. Immunohistochemistry experiments have shown moderate to strong expression of several of the implicated genes in cells of the epithelial basal and parabasal layer (19, 21, 23). Additional markers not related to proliferation might increase the accuracy, but it is striking that none of the genes with other functions were validated among the 40 candidates we tested.
This study supports the concept that exfoliated cervical cells reflect changes in gene transcription and can be used to detect CIN with sensitivity and specificity similar to markers in biopsy tissue. However, the performance of these six markers is not sufficient to significantly improve screening. Problems in sampling cannot be excluded, and it will be important to determine the tissue profile of these markers to evaluate this limitation. Further biomarker discovery, targeting DNA and protein markers in combination with these and additional RNA markers, will be necessary. We recommend that further studies should use screening samples collected from women with a minimum of 2 years of follow-up data to reduce misclassification.
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Note: The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the funding agency.
Received 7/ 7/06; revised 9/29/06; accepted 11/21/06.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |