Some small violations may have little practical effect on the analysis, while other violations may render the Kaplan-Meier results uselessly incorrect or uninterpretable. In particular, small sample sizes may increase the effect of assumption violations. Heavy censoring may also affect the reliability of the Kaplan-Meier estimates.
Implicit factors can also affect censoring times, by affecting the probability that a subject will be withdrawn from the study or lost to follow-up. For example, younger subjects may tend to move away (and be lost to follow-up) more frequently than older subjects, so that age (an implicit factor) is correlated with censoring. If the sample under study contains many younger people, the results of the study may be substantially biased because of the different patterns of censoring. This violates the assumption that the censored values and the noncensored values all come from the same survival distribution.
Stratification can be used to control for an implicit factor. For example, age groups (such as under 50, 51-60, 61-70 and 71 or older) can be used as strata to control for age. This is similar to using blocking in analysis of variance. The goal is to have each group/stratum combination's subjects have the same survival distribution.
If a loss or withdrawal of one subject could tend to increase the probability of loss or withdrawal of other subjects, this would also lead to lack of independence between censoring and the subjects.
The estimates for the survival functions and their variances rely on independence between censoring times and survival times. If independence does not hold, the estimates may be biased, and the variance estimates may be inaccurate.
An implicit factor not accounted for by stratification may lead to a lack of independence between censoring times and observed survival times.
A high censoring rate may also indicate problems with the study: ending too soon (many subjects still alive at the end of the study), or a pattern in the censoring (many subjects withdrawn at the same time, younger patients being lost to follow-up sooner than older ones, etc.)
If the last observation is censored, the Kaplan-Meier estimate of survival can not reach 0.
If the number of subjects exposed (at risk) in an interval or the number of subjects that survived to the beginning of that interval is small, the variance estimates for the survival functions will tend to underestimate the actual variance. This situation is most likely to occur for later intervals, when most subjects have either died or been censored, so that the variance estimates for later intervals are generally less reliable than those for earlier intervals.
Examine the glossary.
Do a keyword search of PROPHET
StatGuide.
Back to StatGuide Kaplan-Meier page.
Back to StatGuide home page.
©1996 BBN Corporation All rights reserved.