Добавил:
kiopkiopkiop18@yandex.ru t.me/Prokururor I Вовсе не секретарь, но почту проверяю Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:

Ординатура / Офтальмология / Английские материалы / Using and Understanding Medical Statistics_Matthews, Farewell_2007

.pdf
Скачиваний:
0
Добавлен:
28.03.2026
Размер:
3.03 Mб
Скачать

Table 6.1. A typical tabulation for a Kaplan-Meier survival curve; the data represent 31 lymphoma patients, of whom 20 are known to have died

Time, t,

Number at

Number of

Estimated probability

Standard error

in months

risk at

deaths at t

of surviving more

of the estimate

 

t months

months

than t months

 

 

 

 

 

 

0

31

0

1.000

2.5

31

1

0.968

0.032

4.1

30

1

0.935

0.044

4.6

29

1

0.903

0.053

6.4

28

1

0.871

0.060

6.7

27

1

0.839

0.066

7.4

26

1

0.806

0.071

7.6

25

1

0.774

0.075

7.7

24

1

0.742

0.079

7.8

23

1

0.710

0.082

8.8

22

1

0.677

0.084

13.3

21

1

0.645

0.086

13.4

20

1

0.613

0.087

18.3

19

1

0.581

0.089

19.7

18

1

0.548

0.089

21.9

17

1

0.516

0.090

24.7

16

1

0.484

0.090

27.5

15

1

0.452

0.089

29.7

14

1

0.419

0.089

32.9

12

1

0.384

0.088

33.5

11

1

0.349

0.087

 

 

 

 

 

those used to produce K-M estimated survivor curves, interpretation of the resulting estimated curves is not straightforward, and statistical help is advised.

(e) Corresponding to the K-M estimated survival curve presented in figure 6.1, table 6.1 shows typical information provided by computer programs which calculate Kaplan-Meier estimated survival curves. Each line of the table records a survival time, the number of deaths which occurred at that time, the number of individuals under observation at that time, the K-M estimate of the probability of surviving beyond that time, and a standard error of the estimate. Typically, the standard error will not be the simple estimate of comment (c), but generally a better one involving more detailed calculations.

General Features of the Kaplan-Meier Estimate

61

6.3. Computing the Kaplan-Meier Estimate

In this penultimate section of chapter 6 we intend to discuss, in some detail, a simple method for calculating the Kaplan-Meier estimate by hand. The technique is not complicated; nevertheless, some readers may prefer to omit this section.

To calculate a Kaplan-Meier estimated survival curve, it is first necessary to list all observations, censored and uncensored, in increasing order. The convention is adopted that if both observed and censored survival times of the same duration have been recorded, then the uncensored observations precede the corresponding censored survival times in the ordered list. For the simple example presented in figure 6.2a, the ordered list of observations, in days, is

1*, 3, 4*, 5, 5, 6*, 7, 7, 7*, 8*

where * indicates a censored survival time.

The second step in the calculation is to draw up a table with six columns labelled as follows:

tobserved distinct survival time, in days

nthe number of individuals still under observation at t days

rthe number of deaths recorded at t days

pc

the proportion of individuals under observation at t days who do not

 

die at t days, i.e., (n – r)/n

Pr(T 1 t) the estimated probability of survival beyond t days

s.e.

the approximate standard error for the estimated probability of survival

 

beyond t days

For convenience, the initial row of the table may be completed by entering t = 0, n = number of individuals in the study, r = 0, pc = 1, Pr(T 1 t) = 1 and s.e. = blank. This simply indicates that the estimated survival curve begins with Pr(T 1 0) = 1, i.e., the estimated probability of survival beyond Day 0 is one. The first observed survival time in the ordered list of observations is then identified, and the number of observations which exceed this observed survival time is recorded, along with the number of deaths occurring at this specific time. Notice that the number of individuals still under observation at t days is equal to the number of individuals in the study minus the total number of observation times, censored or uncensored, which were less than t days. Thus, both deaths and censored observations reduce the number of individuals still under observation, and therefore at risk of failing, between observed distinct survival times.

In our example, the first observed survival time is t = 3; therefore, the initial two rows of the table would now be:

6

Kaplan-Meier or ‘Actuarial’ Survival Curves

62

t

n

r

pc

Pr(T > t)

s.e.

0

10

0

1.0

1.0

3

9

1

 

 

 

 

 

 

 

 

 

The column labelled pc gives the estimated probability of surviving Day t for individuals alive at t days. This is (n – r)/n, and Pr(T 1 t), the estimated probability of survival beyond t days, is the product of the value of pc from the current line and the value of Pr(T 1 t) from the preceding line of the table. In the second row of our example table then, pc = 8/9 = 0.89 and Pr(T 1 t) = 0.89 ! 1.0 = 0.89. According to the formula given in comment (c) of §6.2 for an approximate standard error for Pr(T 1 t), the entry in the s.e. column will be:

Pr(T 1 t) {1 – Pr(T 1 t)}/n = 0.89 (1 – 0.89)/9 = 0.10.

This process is then repeated for the next observed survival time. According to the ordered list for our example, r = 2 deaths are recorded at t = 5, at which time n = 7 individuals are still under observation. Therefore, pc = (7 – 2)/7 = 0.71, Pr(T 1 t) = 0.71 ! 0.89 = 0.64, and s.e. = 0.64 0.36/7 = 0.15. The table now looks like the following:

t

n

r

pc

Pr(T > t)

s.e.

0

10

0

1.0

1.0

-

3

9

1

0.89

0.89

0.10

5

7

2

0.71

0.64

0.15

 

 

 

 

 

 

The final row in the table will correspond to the r = 2 deaths which are recorded at t = 7, when n = 4 individuals are still under observation. In this row, therefore, pc = 2/4 = 0.50, Pr(T 1 t) = 0.64 ! 0.50 = 0.32 and s.e. = 0.32 0.68/4 = 0.13. Thus, the completed table for our simple example is:

t

n

r

pc

Pr(T > t)

s.e.

0

10

0

1.0

1.0

3

9

1

0.89

0.89

0.10

5

7

2

0.71

0.64

0.15

7

4

2

0.50

0.32

0.13

 

 

 

 

 

 

To plot the K-M estimated survival curve from the completed table, use the values in columns one and five (labelled t and Pr(T 1 t)) to draw the graph with its characteristic staircase appearance. Steps will occur at the values of the distinct observed survival times, i.e., the values of t. To the right of each value

Computing the Kaplan-Meier Estimate

63

of t, draw a horizontal section at a height equal to the corresponding value of Pr(T 1 t). Each horizontal section will extend from the current value of t to the next value of t, where the next decrease in the graph occurs.

From the columns labelled t and Pr(T 1 t) in the table for our example, we see that the graph of the K-M estimated survival curve has a horizontal section from t = 0 to t = 3 at probability 1.0, a horizontal section from t = 3 to t = 5 at probability 0.89, a horizontal section from t = 5 to t = 7 at probability 0.64, and a horizontal section from t = 7 at probability 0.32. Since the final entry in the table for Pr(T 1 t) is not zero, the last horizontal section of the graph is terminated at the largest censored survival time in the ordered list of observations, namely 8. Therefore, the final horizontal section of the graph at probability 0.32 extends from t = 7 to t = 8; the estimated probability of survival is not defined for t 1 8 in this particular case.

If the final censored observation, 8, had been an observed survival time instead, then the completed table would have been:

t

n

r

pc

Pr(T > t)

s.e.

0

10

0

1.0

1.0

3

9

1

0.89

0.89

0.10

5

7

2

0.71

0.64

0.15

7

4

2

0.50

0.32

0.13

8

1

1

0.00

0.00

 

 

 

 

 

 

In this case, the graph of the K-M estimated survival curve would drop to zero at t = 8 and be equal to zero thereafter.

6.4. A Novel Use of the Kaplan-Meier Estimator

Despite the fact that estimation of the statistical distribution of the time to an endpoint like death is routinely referred to as the survival curve or survivor function, not all studies in which K-M estimators are used require subjects to die or survive. James and Matthews [11] describe a novel use of the K-M estimator in which the endpoint of interest is an event called a donation attempt. Their paradigm, which they call the donation cycle, represents the use of familiar tools like K-M estimator in an entirely new setting, enabling transfusion researchers to study the return behaviour of blood donors quantitatively – something that no one had previously recognized was possible.

According to the paradigm that they introduce, each donation cycle is defined by a sequence of four consecutive events: an initiating donation attempt,

6

Kaplan-Meier or ‘Actuarial’ Survival Curves

64

Proportion of donors who have not yet returned

1.0

Cycle 1

Cycle 2

Cycle 3 0.8 Cycle 4 Cycle 5

0.6

0.4

0.2

0

0

50

100

150

200

250

 

Time since a previous donation attempt (weeks)

Fig. 6.3. Kaplan-Meier estimated survival curves for Type O, whole blood donations in donation cycles one to five.

a mandatory deferral period, which is typically eight weeks for whole blood donors, an elective interval, and a subsequent donation attempt. The elective interval represents the time between the end of the mandatory deferral period and the occurrence of the next donation attempt. Donors may, and hopefully do, complete multiple donation cycles. Each cycle, while conceptually identical to all other donation cycles, can be distinguished by its unique sequence number, e.g., first, second, etc., and has an exact length which can be measured in suitable units of time, such as days or weeks. Furthermore, if a donor who has made at least one donation attempt fails to make another, the observed interval is clearly a censored observation on the event of interest, i.e., a subsequent donation attempt.

To illustrate the merits of the donation cycle paradigm, James and Matthews [12] obtained records concerning all Type O whole blood donors from the Gulf Coast Regional Blood Center (GCRBC) in Houston, Texas, who had not been permanently deferred before April 15, 1987. From 164,987 usable donor records and a corresponding set of 608,456 transactions that were almost exclusively donation attempts, these researchers identified the lengths of all first, second, third, etc. donation cycles. They then randomly sampled approximately 5,100 intervals from each cycle-specific dataset.

A Novel Use of the Kaplan-Meier Estimator

65

Despite the size of each sample, this cycle-specific data is exactly what modern statistical software requires to generate K-M estimates of the so-called survival curve for each sample. The resulting graphs are displayed in figure 6.3, and illustrate how, for the type O, whole blood donors that contributed through the GCRBC, the intervals between donation attempts depend on a history of previous donations and the elapsed time since the initial or index donation attempt in a cycle. In each case, the estimated probability of making a subsequent donation attempt remains at or near 1.0 for the first eight weeks of the observation period – precisely the length of the mandatory deferral interval – and only thereafter begins to decrease. The graphs also show that the estimated survival function decreases with each additional donation attempt. Thus, for any fixed time exceeding eight weeks, the proportion of donors who had already attempted a subsequent donation is an increasing function of the number of previously completed donation cycles. In contrast to the usual staircase look of most K-M estimated survival curves, the smooth appearance of these estimated functions reflects the effect of using very large sample sizes to estimate, at any fixed value for t, the proportion of donors who had not yet attempted a subsequent donation. One year after the index donation, some 68% of first-time donors had yet to return for a subsequent attempt; at five years, approximately one-third of first-time donors had not attempted a second whole blood donation. The corresponding estimated percentages for donors in their fifth cycle were 36 and 10%, respectively.

The marked localized changes in the graph at 52 and 104 weeks are a particularly distinctive feature of these estimated curves; a third drop appears to occur at 156 weeks, although this decrease is less evident visually. James and Matthews speculate that these parallel changes ‘were the result of two GCRBC activities: blood clinic scheduling and a donor incentive programme which encouraged such clinic scheduling.’ The GCRBC offered an insurance-like scheme that provided benefits for donors and their next-of-kin, provided a donation had been made during the preceding 12 months. Similar localized changes have been seen in studies of other forms of insurance. For example, return-to- work rates usually increase markedly at or near the end of a strike or the expiration of unemployment insurance coverage (see Follmann et al. [13]).

In general, the K-M estimated survival curve is used mainly to provide visual insight into the observed experience concerning the time until an event of interest such as death, or perhaps a subsequent attempt to donate a unit of blood, occurs. It is common, however, to want to compare this observed experience for two or more groups of patients. Although standard errors can be used for this purpose if only a single fixed point in time is of scientific interest, better techniques are available. These include the log-rank test, which we describe in chapter 7.

6

Kaplan-Meier or ‘Actuarial’ Survival Curves

66

7

U U U U U U U U U U U U U U U U U U U U U U U U U U U

The Log-Rank or Mantel-Haenszel Test for the Comparison of Survival Curves

7.1. Introduction

In medical research, we frequently wish to compare the survival (or relapse, etc.) experience of two groups of individuals. The groups will differ with respect to a certain factor (treatment, age, sex, stage of disease, etc.), and it is the effect of this factor on survival which is of interest. For example, figure 7.1 presents the Kaplan-Meier estimated survival curves for the 31 lymphoma patients mentioned in chapter 6, and a second group of 33 lymphoma patients who were diagnosed without clinical symptoms. According to standard practice, the 31 patients with clinical symptoms are said to have ‘B’ symptoms, while the other 33 have ‘A’ symptoms. Figure 7.1 shows an apparent survival advantage for patients with A symptoms.

In the discussion that follows, we present a test of the null hypothesis that the survival functions for patients with A and B symptoms are the same, even though their respective Kaplan-Meier estimates, which are based on random samples from the two populations, will inevitably differ. This test, which is called the log-rank or Mantel-Haenszel test, is only one of many tests which could be devised; nevertheless, it is frequently used to compare the survival functions of two or more populations.

The log-rank test is designed particularly to detect a difference between survival curves which results when the mortality rate in one group is consistently higher than the corresponding rate in a second group and the ratio of these two rates is constant over time. This is equivalent to saying that, provided an individual has survived for t units, the chance of dying in a brief interval following t is k times greater in one group than in the other, and the same statement is true for all values of t. The null hypothesis that there is no differ-

1.0

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

0.8

 

 

 

 

 

 

 

 

 

 

 

 

A symptoms

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

0.6

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Probability

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

0.4

 

 

 

 

 

 

 

 

 

 

 

 

B symptoms

 

 

0.2

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

0

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

0

 

2

4

 

6

 

8

 

 

 

 

 

 

 

 

 

Time (years)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Fig. 7.1. The Kaplan-Meier estimated survival curves for 33 lymphoma patients presenting with A symptoms and 31 with B symptoms.

ence in survival experience between the two groups is represented by the value k = 1, i.e., a ratio of one. As we indicated in chapter 6, time is usually measured from some well-defined event such as diagnosis.

7.2. Details of the Test

The basic idea underlying the log-rank test involves examining each occasion when one or more deaths (or events) occurs. Based on the number of individuals in each group who are alive just before the observed death time and the total number of deaths observed at that time, we can calculate how many deaths would be expected in each group if the null hypothesis is true, i.e., if the mortality rates are identical. For example, if Group 1 has six individuals alive at t units and Group 2 has three, then the observed deaths at t should be distributed in the ratio 2:1 between the two groups, if the null hypothesis is true.

7

The Log-Rank or Mantel-Haenszel Test for the Comparison of Survival Curves

68

If three deaths actually occurred at t units, then we would expect two in the first group and one in the second group. If only one death had actually occurred at t, then we would say that the expected number of deaths in Group 1 is 2/3 and in Group 2 is 1/3. Notice that the expected number of deaths need not correspond to a positive integer.

To complete the log-rank test we add up, for the two groups separately, the observed and expected numbers of deaths at all observed death times. These numbers are then compared. If O1 and O2 are the observed numbers of deaths in the two groups and E1 and E2 are the expected numbers of deaths calculated by summing the expected numbers at each event time, then the statistic used for comparison purposes is

 

(O

E )2

(O

E

)2

 

T =

1

1

+

2

2

 

.

 

E1

 

E2

 

 

 

 

 

 

 

If the null hypothesis is true, T should be distributed approximately as a21 random variable (chi-squared with one degree of freedom). Let to represent the observed value of T for a particular set of data; then the significance level of the log-rank test is given by Pr(T 6 to), which is approximately equal to Pr( 21 6 to). Therefore, we can use table 4.10 to evaluate the significance level of the log-rank test (see §4.4).

Comments:

(a) If we wish to compare the survival experience in two groups specified, say, by treatment, but it is important to adjust the comparison for another prognostic factor, say stage of the disease, then a stratified log-rank test may be performed. In this case, study subjects must be classified into strata according to stage, and within each stratum the calculations of observed and expected numbers of deaths for a log-rank test are performed. The test statistic, T, is computed from values of O1, O2, E1 and E2 which are obtained by summing the corresponding observed and expected values from all the strata.

The effectiveness of stratification as a means of adjusting for other prognostic factors is limited because it is necessary, simultaneously, to retain a moderate number of subjects in each stratum. If we wish to adjust for a number of prognostic factors then, in principle, we can define prognostic strata within which prognosis would be similar, except for the effect of treatment. However, as the number of prognostic factors increases, the strata soon involve too few subjects to be meaningful. Unless the study is very large, the use of more than six or eight strata is generally unwise. Stratification is probably most effective with two to four strata, especially since there are procedures which are more useful when the number of prognostic factors is large. These procedures will be discussed in chapter 13.

Details of the Test

69

Соседние файлы в папке Английские материалы