| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
1 Department of Radiology, University of Cambridge, Addenbrooke's Hospital; 2 Strangeways Research Laboratory, Cambridge, United Kingdom; 3 Department of Engineering Science, University of Oxford, Oxford, United Kingdom; and 4 Highnam Associates Limited, Wellington, New Zealand
Requests for reprints: Jane Ding, Department of Radiology, University of Cambridge, Addenbrooke's Hospital, Box 218, Hills Road, Cambridge CB2 0QQ, United Kingdom. Phone: 44-1223-256119; Fax: 44-1223-330915. E-mail: jjd32{at}cam.ac.uk
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
The first method of measuring breast density was proposed by Wolfe in 1976 and involves categorically classifying density (9). Later methods in the same vein include Tabar's five-point grading system, Breast Imaging Reporting and Data System, and the six-category classification method, which involves visually quantifying the extent of density covering the breast. Several teams also visually estimated the percentage of dense breast tissue in 5% density increments, in other words, on a 20- or 21-category scale (3, 10). As precision became important in the quantification of breast density, Byng and colleagues (11) in Canada developed a computer-assisted thresholding technique for measuring density. Today, this and similar techniques are the most widely used method for calculating density.
The concerns with all of the current density measurement methods include reader subjectivity, labor intensiveness, differences in mammographic techniques, and lack of explicit consideration for the third dimension of the breast. The standard mammogram form (SMF) is a recently developed software that theoretically overcomes the previous addressed problems by estimating the amount of breast density volumetrically through a fully automated process.
SMF density measures have shown reasonable correlation with the visual six-category density classification method (12). However, it does have lower left-right reliability compared with the thresholding technique (13). Validation of a breast density measurement method should involve evaluating its ability to predict breast cancer risk in a case-control setting. In this study, we assess the association between breast density with cancer risk by comparing SMF with the conventional visual classification and the semiautomated density method.
| Materials and Methods |
|---|
|
|
|---|
In the United Kingdom, women ages 50 to 70 y receive invitations to attend routine breast screening every 3 y. Women ages >70 y are entitled to request screening if they so choose. Subjects in this study were ages between 50 and 75 y.
Controls were selected by searching the screening database from the National Health Service Regional Breast Screening Program for Cambridge and Norfolk and Norwich separately. For the Cambridge study, one control was selected for each cancer case. Approximately three controls were chosen for each cancer case in the Norwich project. The methods for control selection were exactly the same at both screening centers. For each screen-detected case, we selected controls who had also attended screening and been diagnosed as normal at that screen. For each interval case, we used the mammograms taken at the screening round before the date of the diagnostic mammogram as the basis for matching of controls. Controls were matched to the patients by screening center, date of birth (within 6 mo), and date of screening (within 3 mo). Patient characteristics at the Cambridge and Norwich regions are outlined in Table 1 .
|
Density Measurements
For cancer patients, we used mammograms from the contralateral or unaffected breast, and for the controls, we chose the matching side of the breast. We used only the mediolateral oblique view in the analyses as this single view was available consistently on all subjects (14). A total of 1,297 subjects received imaging for the craniocaudal view, and the density results from these images were compared using the Spearman's rank correlation to determine the consistency between the two views. Density assessments were done by an expert radiologist (R.W.), who is experienced in evaluating parenchymal patterns and visually estimating percent density, and received training on use of the interactive threshold program from its originators. The reader did not have any knowledge of whether the image belonged to a cancer case or a control. All repeated readings were randomly selected and done by R. Warren within 3 wk of the first readings. R. Warren was blinded to previous assignments during these repeated readings.
Wolfe's Parenchymal Patterns
Mammographic density was visually assessed based on Wolfe's parenchymal patterns. An N1 pattern involved parenchymal that is composed primarily of fat with no visible ducts. Parenchyma in P1 pattern contained prominent ducts extending to up to 25% of the breast, and in P2, pattern involved ducts in >25% of breast area. DY pattern is related to severe involvement of the parenchyma with extensive amount of density. Intrareader agreement was analyzed using the weighted
test based on repeated reading of 227 films.
Visual Percent Density
The percentage of mammographic density covering the breast was estimated visually for every 5% density increment. The visual percent density scores were there for 0%, 5%, 10%, 15%, and so forth up to 95% and 100%. This 21-point categorical scoring system resembles a continuous outcome. A total of 223 images were used to determine the intraobserver agreement, which was analyzed by Spearman's rank correlation.
Interactive Threshold
Breast density was measured using the interactive thresholding software, which calculates the percent area of breast covered in dense regions. Original mammograms were scanned using the Array 2905 DICOM ScanPro Plus Laser Film Digitizer version 1.3E (Array Corp. USA) at absorbance of 4.7. Interactive thresholding measurements were done using the Cumulus 3.0 program designed by Yaffe and Boyd, the originators of the technique (15). This computer-assisted method uses the gray scale in the digitized images to determine breast density. The assessor first adjusts the threshold to mark the edge of the breast and to mask the pectoral muscle. A second threshold is set to outline the regions of density within the breast. The program computes the number of pixels in dense area and total breast area, from which percent of density is calculated. A selection of 150 images was reanalyzed and used to determine the intraobserver agreement, as calculated by Spearman's rank correlation.
Standard Mammogram Form
Volumetric density measurements were obtained using the software GenerateSMF version 2.2β from Siemens Molecular Imaging. SMF provides a representation of the amount of nonfat tissue at each location in a mammogram, a representation that has been estimated by an evolving series of computer programs. If the separation between the mammography compression plates is known, then the SMF representation potentially provides a volume-based estimate of the amount of dense tissue in a breast. The SMF software available in this project requires knowledge of several film imaging variables, including tube voltage and film exposure time, to estimate the thickness of the compressed breast; this is a crucial step toward building a 2.5 dimensional model of the breast.5 Fortunately, it has recently been shown that SMF can equally be estimated without compromising much accuracy (16) for the majority of films that lack such imaging variable information. As the name SMF suggests, the SMF program compensates for variations in X-ray imaging techniques and standardizes image scanned on different film digitizers. The technical details of the SMF software have been described elsewhere (16, 17).
The Array-scanned images were converted into DICOM files compatible for use with SMF. X-ray calibration data, including tube voltage, film exposure time, anode target material, and filter material, were extracted from individual film labels and entered into the data entry section of the scanning software (software created by Christopher Tromans). All of the Norwich mammograms contained full set of calibration data as we excluded earlier films that did not have this information. The Cambridge mammograms were taken on two different types of mammography machines, one of which did not provide any calibration information on the film labels. Consequently, a total of 124 of 2,514 films (4.9%) did not have calibration data but were still included in the final analyses. The SMF software provides an estimate for breast thickness for each film and in doing so compensates for erroneous or lack of calibration data (16). The default calibration settings for films with unknown variables include tube voltage at 28 kVp, molybdenum/molybdenum anode target and filter combination, and estimation of exposure time from the scanned image.
The SMF program outputs the percent volume of density including and excluding the breast edge. Because the edge of the breast has inconsistent thickness compared with the rest of the breast and contains predominantly fatty tissue, we examined in this study only the values excluding the breast edge. Inevitably, as a result of this exclusion, the final percent volume of the breast as estimated by GenerateSMF is expected to be slightly understated.
Repeatability of SMF density measures was carried out on the full set of data (2,514 films) and a 100% correlation was achieved. This was expected as the software is fully automated.
The SMF software standardizes for different digitizers, given that the absorbance of scanned images is known and is either 4.0 or 4.7 (18). To test the digitizer effect, we compared SMF results on images scanned on an old hand-feed Lumysis digitizer (images scanned at absorbance 4.0) and on a modern Array machine (images scanned at absorbance 4.7; ref. 18).
Statistical Analyses
The association between mammographic density and breast cancer risk was evaluated using logistic regression to compute odds ratios (OR) with 95% confidence intervals (95% CI). Age (three categories: 50-54, 55-59, and 60+), screening center (Cambridge versus Norwich), and mode of detection (screen-detected cancers versus interval cancers) were included into the multivariate analyses as confounders. We were unable to obtain information on body mass index and reproductive history as they were not recorded in the National Health Service Breast Screening Program and, therefore, could not include these as confounders into the multivariate analyses. Density values, as measured using the visual method, thresholding technique, and SMF, were divided into quartiles of combined cancers and controls to be compared with Wolfe's four-point grading system. For all methods, the three higher density groups were compared individually to the lowest baseline group. The fit of the models was compared using the likelihood ratio test statistics. The analyses were repeated assuming a linear trend in the relationship between density and risk (the trend test). The same analyses were carried out in separate screen-detected and interval cancer populations. Finally, we considered the model including both the SMF results and the threshold readings to determine whether each was an independent determinant of risk, considering breast density both as a continuous and a category variable. We conducted all statistical analyses using Stata 8.0 (Stata Corp.).
Quality Control of Density Readings
A total of 1,297 subjects received imaging for the craniocaudal view, which was additionally assessed by the reader. The SMF mammographic mediolateral oblique and craniocaudal view Spearman's rank correlation was 0.746 for percent volume of density. View correlations for the thresholding method were 0.962 for total breast area, 0.897 for total dense area, and 0.907 for percent area of density. A random selection of 150 images was used to assess reader repeatability for the thresholding method: the Spearman's rank correlations were 0.995 for total breast area, 0.930 for total dense area, and 0.933 for percent area of density. The Spearman correlation for visual estimates of percent density on 223 images was 0.957. The intraobserver agreement of the reader for parenchymal pattern assessment on 227 images was excellent (weighted
= 0.820).
| Results |
|---|
|
|
|---|
Table 2 summarizes the ORs associated with increasing categories of density in the overall population. In general, mean percent densities as measured by interactive threshold and SMF were higher in the cancer population than the control population, but the difference was more pronounced in the threshold method (Table 2). Strong density-risk trends existed in all four methods, as was shown by the P values for trend being <0.001. However, the stepwise increase in risk associated with higher density categories was greater in the threshold method compared with Wolfe's classification, visual estimate of percent density, and the SMF method (Table 2). The ORs determined by density measures using the threshold were 2.45 (95% CI, 1.86-3.23), for the SMF method was 1.92 (95% CI, 1.47-2.51), for Wolfe's grades was 1.79 (95% CI, 1.27-2.52), and for visual percent density scoring was 1.81 (95% CI, 1.41-2.32), comparing the highest categories of density with the baseline category.
|
|
40% area of density or more in her breast would be at 2.45-fold risk according to the threshold method but only 1.24- to 1.81-fold risk according to the visual grading (Table 2). The distribution of SMF results is also different from threshold readings. For example, a woman whose breast is predominantly fatty and is classified as having virtually no density measured using the threshold method will have a minimum of 11.5% density based on the calculations of SMF. This is because in the computation of SMF, even the smallest amount of density must be stretched out across the thickness of the breast, so there are rarely very low volumetric percent density values. To determine if lack of calibration data affected the observed risk, we eliminated the set of 124 films with their matched cases/controls and carried out the same multivariate analyses. A total of 558 cases and 1,804 controls were used in this subanalysis. The ORs obtained were consistently lower for all methods, and the absolute decreases in OR were no more than 12% (data not shown; available on request). The results that were significant in the total data set were also significant in this subset of data.
We tested the effect of different digitizers on SMF results by directly comparing the SMF values for Lumisys-scanned images and those for Array-scanned images. We showed a 100% correlation in the SMF results (18).
Using the likelihood ratio, the model fit for the four methods was compared. The
2 statistic for SMF measures, threshold readings, Wolfe's grades, and visual readings was 5.93, 25.42, 2.86, and 3.47, respectively, suggesting that the threshold readings provided the best predictors of breast cancer risk.
After adjusting for interactive threshold readings, SMF density results were no longer associated with breast cancer risk (Table 3). However, adjustment for SMF results did not significantly modify the relationship between threshold density readings and cancer risk (Table 3) compared with before the adjustment (Table 2).
| Discussion |
|---|
|
|
|---|
Mammographic features captured by the threshold method have the strongest association with cancer risk compared with Wolfe's patterns, visual estimation of percent density, and SMF. The comparison among the three remaining methods was more ambiguous. Although the stepwise increase for density and risk association was overall greater for the SMF method, the differences were too small for us to form definitive conclusions. The likelihood ratio test provided further evidence that SMF was the best predictor of risk of the three methods. However, more work is needed to explore this conjecture.
It is generally accepted that the association between density and risk is stronger for percent density measures compared with Wolfe's patterns or other qualitative classifications (1-5, 19, 20). Most studies arrived at these conclusions based on the relative magnitudes of risk or ORs. There are several limitations in such studies, including inconsistencies in the methods used to categorize percent density, differences in the experience of the assessors, and variations in study populations. To overcome these limitations, some articles carried out direct comparisons between qualitative and quantitative methods and their associations with cancer risk (3, 21-23). Our findings are partly in agreement with the work by Brisson et al. (3), showing percent density as a stronger risk predictor compared with Wolfe's grading. However, their percent density readings were obtained visually using a method similar to the 21-scale system in our study.
The magnitudes of breast cancer risk relative to each categories of density were lower than in most other studies. This may be due to differences in the range of readings between studies (e.g., a wider range of readings results in the upper density category being farther from the baseline group, leading to a larger observed risk). Furthermore, the lower than average ORs can be explained by a lack of confounder data, such as body mass index, family history of breast cancer, and reproductive variables, all of which are strongly related to breast cancer risk. Stratifying the data by mode of detection revealed that Wolfe's patterns, visual percent density, and SMF percent density were not associated at all with cancer risk in the screen-detected population. This finding is not in accordance with results from several other studies (2, 3, 5, 7, 21). Relative risks with respect to density were severalfold higher in the interval cancer population than the screen-detected group. This could be explained by the masking hypothesis that states that tumors can be masked in the midst of the extensive breast density and are thus easily missed during routine screening but are presented symptomatically after a negative screen (24). Consequently, women with dense breasts are at higher risk of developing interval cancers compared with screen-detected cancers.
Three of the four methods in the study have a subjective component, which is the main limitation of their use. Although SMF has perfect repeatability, the accuracy of its density measures may rely on the amount of X-ray variable information available. However, the calibration variable compensation mechanism of SMF underwent evaluation and showed robustness in estimating breast thickness even in the absence of any calibration data (16). Finally, we must emphasize that there is a need to be able to apply SMF to digital mammograms, for which the present software is not adapted.
Several groups around the world have been working on developing volume-based density measurement methods. Most methods use a step-wedge approach to calibrate each image, which must be carried out at the time of mammography and therefore cannot be applied retrospectively to films (25). Some teams are also working on a new version of the threshold method applied to digital mammography. Advances in this subject will be of great value as the area of mammography is quickly moving toward digital forms. There are advantages to using digital images over film mammography, including lack of labor for film digitizing, lack of differences in the quality of digitized images, and availability of calibration data for digital images.
Strengths and Limitations
Despite subjectivity in the three methods evaluated in this study, the films were all assessed by the same reader who has shown good repeatability. Furthermore, this reader was trained by the originator of the threshold technique (Professor Norman Boyd, Toronto, Ontario, Canada) to ensure good interclass agreement on using this method. To determine the ability of SMF to predict breast cancer risk, its measures of density were compared with two of the most widely used density assessment methods, Wolfe's grading and interactive threshold. The large size of the study population and the high ratio of controls to cancers provided further statistical robustness to this case-control study. Finally,
95% of the population data sets have full film acquisition data, which could be important for the accuracy of SMF.
The main limitation in this study is the lack of confounding data, as this was a retrospective study involving medical records only. However, this lack of adjustment for confounders should theoretically affect all three manual reading methods equally. For example, we have evidence (7, 21, 26, 27) that body mass index is a confounder for the Wolfe's grading and the threshold method, but we do not know whether this applies also to SMF. Furthermore, the quartile method of categorizing the continuous measures of density was not consistently used in the previous studies, so the magnitudes of breast cancer risk could not be fairly compared with those in other studies.
Future Directions
Breast density is currently undergoing investigation as a biomarker for breast cancer risk. Although the current implementation of GenerateSMF is not the best predictor of breast cancer risk, several teams are working on improving the SMF software and possibly applying the new algorithm on digital mammography. Future work on producing an objective volumetric approach of measuring breast density will be a step closer to the use of density in clinical practice and its involvements in studies on breast cancer etiology.
| Disclosure of Potential Conflicts of Interest |
|---|
|
|
|---|
| Acknowledgments |
|---|
We thank the staff at the Cambridge Breast Screening Unit, particularly Dr. Peter Britton, Dr. Lynda Bobrow, Judith Fatibene, and Barbara Knighton; the staff at the Norwich Breast Screening Unit, with special mentions to Dr. Erika Denton, Dr. Anne Girling, Glynis Wivell, and Judy Priddle; the technical team at Siemens Molecular Imaging, including Jerome Declerck and Xiao Pan Bo; Professor Norman Boyd for his training on using the Cumulus program; and Professor Martin Yaffe and his team for technical support.
| Footnotes |
|---|
5 It is called 2.5D because, at a particular image location x, the SMF value may be estimated to be 3.7 cm of nonfat tissue, but if the compressed breast thickness is 6 cm, the program cannot say which part of the 6-cm column of tissue corresponding to x is fat (2.3 cm) and which is nonfat. ![]()
Received 9/28/07; revised 2/12/08; accepted 2/25/08.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
N. Boyd, L. Martin, A. Gunasekara, O. Melnichouk, G. Maudsley, C. Peressotti, M. Yaffe, and S. Minkin Mammographic Density and Breast Cancer Risk: Evaluation of a Novel Method of Measuring Breast Tissue Volumes Cancer Epidemiol. Biomarkers Prev., June 1, 2009; 18(6): 1754 - 1762. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Heine, M. J. Carston, C. G. Scott, K. R. Brandt, F.-F. Wu, V. S. Pankratz, T. A. Sellers, and C. M. Vachon An Automated Approach for Estimation of Breast Density Cancer Epidemiol. Biomarkers Prev., November 1, 2008; 17(11): 3090 - 3097. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |