| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
1 Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Heidelberg, Germany; and 2 Finnish Cancer Registry, Institute for Statistical and Epidemiological Cancer Research, Helsinki, Finland
Requests for reprints: Hermann Brenner, Division of Clinical Epidemiology and Aging Research, German Cancer Research Center, Bergheimer Strasse 20, D-69115 Heidelberg, Germany. Phone: 49-6221-548140; Fax: 49-6221-548142. E-mail: h.brenner{at}dkfz-heidelberg.de
| Abstract |
|---|
|
|
|---|
15 years and diagnosed with one of 20 common forms of cancer between 1953 and 1997. We show that, in most cases, the model-based approach predicts 5- and 10-year relative survival expectations of newly diagnosed patients quite closely and much better than any of the previously available methods, including standard period analysis. We conclude that the model-based approach may enable deriving up-to-date cancer survival rates even with the common latency in availability of cancer registry data. (Cancer Epidemiol Biomarkers Prev 2006;15(9):172732) | Introduction |
|---|
|
|
|---|
In this article, we develop and empirically evaluate a model-based period analysis approach aimed to provide up-to-date estimates of long-term survival even with the common latency in availability of cancer registry data.
| Materials and Methods |
|---|
|
|
|---|
15 years with a first diagnosis with 1 of 20 common forms of cancer between 1953 and 1997.
Statistical Analysis
Throughout this article, we present relative rather than absolute survival rates, as the former are most commonly reported by cancer registries. Relative survival rates reflect the probability of surviving the cancer of interest rather than the total survival probability (9, 10), taking expected deaths in the absence of cancer into account. For this analysis, the expected numbers of deaths were derived from age-, gender-, and calendar periodspecific mortality figures of the general population of Finland according to the so-called Ederer II method (11).
We first assessed overall trends in survival during the past decades by looking at the development of 5-year relative survival for successive cohorts of patients diagnosed between 1953 to 1957 and 1993 to 1997.
Next, 5-year relative survival actually observed for patients diagnosed in 1993 to 1997 and followed through 2002 (Fig. 1, bottom solid rectangular frame ) was compared with the most up-to-date estimates of 5-year survival that would potentially have been available somewhere in the middle of 1993 to 1997, the years of diagnosis of this cohort. Assuming that, with the common latency of cancer registration of about several years, available data might have included patients diagnosed up to the year 1992, the following survival estimates were derived: First, a cohort-based estimate for the cohort of patients diagnosed in 1983 to 1987 and followed over 5 years since then (Fig. 1, top solid rectangular frame). Second, a so-called complete estimate additionally included patients diagnosed in 1988 to 1992, although the latter could not have been under observation for 5 years by the end of 1992 and might be censored at that date unless they died or were lost to follow-up earlier (Fig. 1, triangular frame). Third, a standard period estimate for the 1988 to 1992 period, derived by left truncation of observations at the beginning of 1988 in addition to censoring of observations at the end of 1992 as previously described (Fig. 1, dashed frame; ref. 1). Fourth, to overcome the latency in cancer registration, a model-based period estimate for the 1993 to 1997 period was derived.
|
To address the performance of the various methods in a much broader range of settings, we repeated the analyses of 5-year relative survival outlined for the cohort of patients diagnosed in 1993 to 1997 in the preceding paragraphs for cohorts of patients diagnosed in 1988 to 1992, 1983 to 1987, 1978 to 1982, and 1973 to 1977 as well (which is the widest possible range of cohorts for which analogous calculations could be carried out with the available database of the Finnish Cancer Registry). We calculated the following summary indicators of the performance of the various methods: The mean difference and the mean squared difference between 5-year relative survival later observed for patients diagnosed in the respective calendar periods and the various estimates potentially available during these periods. The mean differences reflect the average underestimation or overestimation of the 5-year relative survival rates. The mean squared differences, in addition, reflect average absolute levels of deviations of single estimates (with particular "punishment" of strong deviations).
To address the performance of the modeling approach for longer-term survival, we carried out analogous analyses comparing 10-year relative survival actually observed for patients diagnosed in 1988 to 1992, 1983 to 1987, and 1978 to 1982 with the most up-to-date estimates of 10-year relative survival that might have been available in those periods.
The analyses were carried out with the SAS statistical software package (Cary, NC), using the macro period, to derive the effective numbers of patients at risk and of deaths (13), and the procedure GENMOD to carry out Poisson regression.
| Results |
|---|
|
|
|---|
15 years were reported to the Finnish Cancer Registry with a first diagnosis of cancer between 1953 and 1997. Of these, we excluded 2.6% notified by death certificate only, another 2.6% notified by autopsy only, and 0.1% due to missing information on month of diagnosis. The 20 forms of cancer specifically addressed in this article include
89.6% of the remaining cancer cases (n = 490,279). The numbers of notifications, as well as 5-year relative survival rates of patients with these 20 forms of cancer, are shown for calendar years 1953 to 1957 and 1993 to 1997 (Table 1 ). Five-year relative survival strongly varied by cancer site. Among patients diagnosed in 1953 to 1957, it ranged from 66.3% for patients with cancer in the oral cavity to 3.2% for patients with esophagus cancer. Five-year relative survival rates increased between 1953 to 1957 and 1993 to 1997, albeit to a strongly varying degree, for all but one (pancreas cancer) of the assessed forms of cancer. A most pronounced increase by 49.4%, 45.4%, and 40.3% units (i.e., an average annual increase by >1% unit) was seen for patients with prostate, bladder, and thyroid cancer, respectively. With 89.4%, the latter had the highest 5-year relative survival rates in 1993 to 1997 among the cancer patients included in this analysis. On the other hand, 5-year relative survival still remained just above 10% for patients with cancers of the esophagus, liver, and lung, and <4% for patients with cancer of the pancreas.
|
|
|
|
|
|
| Discussion |
|---|
|
|
|---|
Our findings regarding the advantage of "standard" period analysis over cohort and complete analysis in terms of up-to-dateness of survival estimates are in agreement with previous evaluations that had not taken the usual delay in availability of cancer registry data into account (3-5). Previous analyses had also shown that period analysis may advance detection of trends in 5-, 10-, 15-, and 20-year survival by almost 5, 10, 15, and 20 years compared with cohort analysis (2). Model-based period analysis as applied in this study would be expected to result in further reduction of delay in disclosure of progress in survival by another 5 years. In fact, the mean difference in model-based period estimates for the current period from 5-year and 10-year survival later observed for patients diagnosed in that period was close to zero for most forms of cancer in our extensive empirical evaluation, which indicates that the delay in disclosure of progress in prognosis can be overcome almost entirely.
In our modeling approach, we assumed persistent increasing or decreasing trends in relative survival rates over time. Obviously, the model-based period approach performs best when this assumption holds entirely. In our time trend analyses of cancer survival over half a century, we found entirely or close to monotonic upward trends in survival for most forms of cancer. However, for cervical cancer, divergent trends in various time intervals were found. The transient drop in survival for this form of cancer probably reflects selection processes resulting form the very successful screening program for this form of cancer (14). As a result of this pattern, the performance of model-based period estimates was worse for this form of cancer than the performance of the other methods of survival analysis. For cancers with little change in prognosis over time, neither standard period analysis nor model-based period analysis provides advantages over conventional cohort or complete analysis, and all methods essentially perform equally well. Such a pattern was seen, for example, for pancreas cancer in our analysis.
In the application of the modeling approach, there are a number of design options that may be considered. These include, for example, the number and width of periods included in the modeling. In our analysis, we chose to use three 5-year periods, covering a 15-year time fame altogether. Although a longer time frame (e.g., use of four or five 5-year periods) would provide a broader basis for estimation, the assumption of a persistent trend within such a long time frame may often be more problematic. Also, such an approach would only be feasible in cancer registries with a long history of cancer registration at high levels of quality and completeness. Although this prerequisite would be given in the Finnish Cancer Registry, it would hinder application of modeled period analysis in a large and growing number of younger high-quality cancer registries. On the other hand, relying on just two 5-year periods might provide a too weak basis for reliable estimation of trends and may give too much weight to possible random deviations from long-term trends. The width of time periods used for modeling and prediction is likewise somewhat arbitrary. We feel, however, that the 5-year periods used in our analysis may be a reasonable choice, as it limits the influence of short-term fluctuations in prognosis (whether they are due to chance or due to other reasons), and avoids the limited flexibility that would result from too broad intervals.
A potential tradeoff of the modeling approach, apart from its reliance on the existence of a long-term history of reliable cancer registration, is the increased complexity in calculation and data interpretation. However, the increased complexity of calculations also comes along with increased flexibility, e.g., to take account of possible deviations from linearity in trends, or of other variables that might affect prognosis that might be included in the models. In fact, the modeling approach may be considered as a broader general methodologic framework, which includes standard period analysis as one special case (i.e., the case of a saturated model in which follow-up yearspecific survival rates are estimated for one single calendar period).
In summary, our empirical evaluation supports expectations from theory that the modeling approach further enhances possibilities of deriving up-to-date cancer survival rates. This approach may be particularly helpful to overcome outdatedness of survival data resulting from the common latency in availability of cancer registry data.
| Appendix 1 |
|---|
|
|
|---|
The calendar periods are coded in such a way that j = 0 for the first calendar period, and j = 1 and 2 for the subsequent calendar periods included in the modeling.
Then, a generalized linear model dij = f (i,j) is fitted with outcome dij, Poisson error structure, predictor variables i (categorical) and j (numerical), link ln(µij dij*), and offset ln(lij dij / 2), where µij = the model-based numbers of deaths and d* = (lij dij / 2) x ln((lij eij) / lij).
Let
i and ß be the estimated regression coefficients for follow-up years i and calendar periods j. Then, estimates of conditional relative survival for each combination of follow-up year i and calendar period j are given as
![]() |
![]() |
The modeled k-year cumulative relative survival for the current calendar period is obtained by setting j = 3.
| Footnotes |
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Received 3/20/06; revised 6/14/06; accepted 7/ 5/06.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. Gondos, F. Bray, T. Hakulinen, H. Brenner, and the EUNICE Survival Working Group Trends in cancer survival in 11 European populations from 1990 to 2009: a model-based analysis Ann. Onc., March 1, 2009; 20(3): 564 - 573. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Brenner, A. Gondos, and D. Pulte Expected long-term survival of patients diagnosed with multiple myeloma in 2006-2010 Haematologica, February 1, 2009; 94(2): 270 - 275. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Pulte, A. Gondos, and H. Brenner Trends in 5- and 10-year Survival After Diagnosis with Childhood Hematologic Malignancies in the United States, 1990-2004 J Natl Cancer Inst, September 17, 2008; 100(18): 1301 - 1309. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Brenner and T. Hakulinen Maximizing the Benefits of Model-Based Period Analysis of Cancer Patient Survival Cancer Epidemiol. Biomarkers Prev., August 1, 2007; 16(8): 1675 - 1681. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |