Добавил:
Upload Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:

Словари и журналы / Психологические журналы / p63British Journal of Mathematical and Statistical Psycholog

.pdf
Скачиваний:
58
Добавлен:
29.05.2015
Размер:
346.02 Кб
Скачать

Outcomes and sample selection

73

self-reported scores from the experiment’s second (last) follow-up interviews: the days clients report using alcohol in the 30-day period before the second follow-up interview; the days they report using drugs in the same period; and the days they report being homeless in 60 days. The interviews were conducted a year after programme entry. The outcome measures and the independent variables (which were also constructed from a baseline interview) generally have acceptable test–retest correlation coefŽcients.8

Independent variables

All outcome equations include as independent variables the two dummyor binary(zeroone) terms representing assignment into the intervention (treatment) conditions. This means that the coefŽcients that are attached to the two variables estimate the impact of what will be called the ‘interventions’. The coefŽcients implicitlycompare outcomes of individuals in each intervention condition to the outcomes of individuals in the control condition, who are provided a score of zero on both variables.

Control variables

All equations hold constant background variables, including the baseline level of the outcome variable, the interactions between the baseline score and each intervention, and various theoretically derived predictors (which differ by the dependent variable; Sosin et al., 1994). They also hold constant background variables that represent extraneous sources of change: the days between the baseline and follow-up interview; and the days of recent residence in situations ( jails, prisons and the like) in which substance abuse or homelessness is not possible. The baseline scores are centred to reduce the multicollinearity that occurs when using interaction terms ( Jaccard, Turrisi, & Wan, 1990).

Strategy and models

All outcome models use tobit regressions, which are appropriate because the outcome variables have large proportions of zero scores. In these tobit analyses, the impacts of the experiments are estimated under the assumption that clients who report outcome scores of zero may still differ in the propensity for substance abuse or homelessness.

The estimation strategies otherwise vary. The conventional basic strategy relies on the data on the acceptors. It predicts the outcomes by the experimental variables and the background variables. The intent-to-treat strategy adds data on the measured outcomes of refusers. It uses the same variables as does the basic strategy. The sample selection strategy relies on the same sample and variables as the basic strategy but adds the sample selection terms. Later sections of the paper discuss other correction strategies.

8Most of the items analysed here stem from the two instruments—the Addiction Severity Index (ASI; McLellan, Luborsky, Woody, & O’Brien, 1980) and the personal history form (PHF; Barrow, Hellman, Lovell, Plapinger, Robinson, & Streuning, 1985). The former is a 45-minute interview designed to determine substance impairment and related social and personal problems; the latter covers housing and demographic information. The reliability of the ASI and PHF has been established by test–retest analyses that utilized data from 20 clients enrolled in the current project, and about 20 from each of nine other homelessness and substance abuse projects funded by the National Institute for Alcohol Abuse and Alcoholism (n = 188 ; Drake, McHugo, & Biesanz, 1995). Intraclass correlation coefŽ cients, which compared baseline and retest responses (provided by the same clients about a week later), generally were above 0.70 on the ASI, and above 0.60 on the PHF. Chicago respondents tended to evince somewhat higher correlations.

74 Michael R. Sosin

4. Results of the application

4.1. Sample traits

Sample traits are reported in Table 1. Results near the top represent baseline traits; those at the bottom represent traits at the follow-up interview, which are the outcome scores.9 The baseline traits for the entire sample are reported at the top of the Žrst column. These roughly mirror the traits of a national sample of homeless adults (Burt & Cohen, 1990). The few differences include the greater proportion of current respondents who are black, and the slightly higher proportion who are women. Large city samples generally tend to have higher proportions of minority members and women, at least to some degree (Burt & Cohen, 1990; these results restrict responses to clients who are represented at both the baseline and second follow-up interview).10 Results reported at the bottom of the Žrst column further show that the outcomes generally improve over

time. This leads to the previously mentioned bunching of outcome scores at zero.

4.2. Group comparisons of traits

The data in the rest of Table 1 compare traits of subgroups of the sample, primarily to illustrate the possible existence of bias. Columns 2–4 thus report the selected traits of the acceptors of each condition; asterisks in the fourth column identify statistically signiŽcant differences between the traits of the three groups. Columns 5 and 6 report selected traits of the refusers of the two intervention conditions (no clients refused the control condition); asterisks in each of the two columns identity statistically signiŽcant differences between the traits of the acceptors and the refusers of a given intervention condition.

Results from the top part of columns 2–4, which mirror the searches for bias that are attempted by many researchers, do not indicate any obvious problem; there are no statistically signiŽcant differences in the three-way comparisons of the acceptors of the experimental conditions. However, the information on the refusers is of more concern. The asterisked quantities in column 5 suggest that housing refusers and acceptors differ on the reported days of homelessness at the time of the baseline interview, while those in column 6 suggest that case-management refusers and acceptors differ on the reported days of drug consumption at the time of the baseline interview. Despite the small sample of refusers, data in column 5 also suggests that housing refusers and acceptors differ to a nearly statistically signiŽcant degree on homeless history. As has been argued, differences between acceptors and refusers may cause bias in conventional calculations—even if the groups of acceptors demonstrate similar traits.

While not subjected to signiŽcance tests, results in the bottom portion of columns 5 and 6 suggest that there are differences in the outcomes between the control group and the refusers. This implies that the refusers are not a random sample of the original clients, in that their outcomes differ from those of untreated individuals who clearly comprise a random sample. Again, the implication is that biases may occur in analyses that exclude information on the refusers.

The results in Table 1 also verymodestly point to some of the difŽculties that emerge

9The statistical package used was LIMDEP (Econometric Software, Inc.).

10See Sosin, Colson, & Grossman (1988), Stahler et al. (1993) and Sosin et al. (1994). While this is not presented in the table, sample members also have the somewhat higher incomes that apparently typically characterize homeless clients found in programmes.

Outcomes and sample selection

75

in the intent-to-treat analysis strategy—in particular, that this strategy is often based on the questionable assumption that little is changed by including the refusers as if they participated, since most have few symptoms and thus would not beneŽt from participation. Even if the results indicate the refuser’s baseline symptoms are often less severe than those of the acceptors, they also suggest that this is not always the case.

As might be expected, the outcome scores of the refusers are poorer than those of the acceptors, at least in Žve of the six comparisons. But while proponents of the intent-to-treat strategy normally argue that such outcomes are further evidence that refusers would not improve much if they participated (Simpson, 1981; Pekarik, 1983; Powell et al., 1987), a case can also be made that the failure to participate suppresses the experimental effects; refusers generally have less severe symptoms at the baseline interview but fare worse at the follow-up interview. In general, the data are open to debate concerning how the refusers would fare had theyparticipated. Further, while the data suggest that experimental clients fare better than control clients, it is not clear how refusal might affect these differences.

4.3. Outcome results: Three strategies

Basic strategy

Tables 2–4 compare results from the three strategies. In all of the represented equations, the coefŽcients that are attached to the experimental conditions show the calculated effects of the ‘interventions’ on the outcome variables. Technically speaking, the effect sizes are put in terms of ‘units’ because of the transformation of outcome variables in tobit regressions.

The Žrst column of each table reports results for the basic strategy. The results in Table 2 reveal that both the housing and case-management interventions reduce alcohol use. Those in Table 3 suggest that neither intervention has a statistically signiŽcant impact on drug use. Those in Table 4 show that the effect of the housing intervention on homelessness is large though not quite statistically signiŽcant, but that the effect of the case-management intervention on homelessness is statistically signiŽcant. These models suggest reasonable substantive results, in that there is other evidence that a case-management programme can outperform an inpatient programme (Moos, King, & Patterson, 1996).

In general, effect sizes are moderate. Depending on the outcome, they range between about 2 and 7 units for the housing intervention. CoefŽcients are between approximately 3 and 13 units for the case-management intervention.

Intent-to-treat strategy

The second columns of the tables reports analyses from the intent-to-treat strategy. In brief, the results continue to suggest the same statistically signiŽcant treatment effects—at least when using one-tailed tests of signiŽcance for these hypothesized relations. But the coefŽcients are attenuated. For example, Table 2 suggests that the estimate from the intent-to-treat analysis of the impact of the case management intervention on alcohol use is almost four units less than the estimate from the basic analysis.

As might be expected, estimates are suppressed because refusers generally do not fare as well as acceptors. This is illustrated in most of the Žndings from equations that predict the outcomes of refusers, alone, against those of the clients in the control

Table 1. Sample traits by condition and acceptance status

 

 

 

 

Comparisons among acceptors

 

 

 

 

Compare to acceptors

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(2)

 

(3)

 

(4)

 

(5)

 

(6)

 

 

 

(1)

 

Housing

 

Case mgmt.

 

Control

 

 

Housing

 

Case mgmt.

 

 

 

Total

Standard

acceptors

Standard

acceptors

Standard

condition

Standard

 

refusers

Standard

refusers

Standard

 

 

N = 483

deviation

N = 121

deviation

N = 76

deviation

N = 140

deviation

 

N = 11

deviation

N = 137

deviation

 

 

 

 

 

 

 

 

 

 

 

 

 

Average age

34.92

7.70

35.47

8.22

35.25

6.44

34.08

7.24

36.16

9.83

35.03

8.18

Proportion female

0.25

0.44

0.29

0.45

0.25

0.41

0.23

0.43

0.22

0.44

0.25

0.45

Proportion black

0.91

0.28

0.91

0.29

0.93

0.24

0.92

0.28

0.81

0.42

0.91

0.29

Years of education

11.89

1.78

11.76

1.91

11.86

1.48

11.86

1.73

12.49

1.74

12.01

1.87

Proportion ever married

0.47

0.50

0.52

0.50

0.41

0.46

0.39

0.49

0.74

0.47

0.50

0.52

Months homeless

 

 

 

 

 

 

 

 

 

 

 

 

 

experience

25.26

47.03

27.19

32.14

28.22

35.38

25.32

70.85

14.36

14.76

23.11

33.20

Baseline scores

 

 

 

 

 

 

 

 

 

 

 

 

 

Days drinking in 30,

17.82

11.74

19.61

11.36

16.42

11.31

17.04

11.82

16.78

13.85

17.79

12.04

Days drug use in 30,

15.35

12.48

15.37

12.15

17.87

11.67

16.34

12.52

8.74

10.82

13.75*

13.04

Days homeless in 60,

26.31

22.58

30.38

23.18

24.75

22.36

28.09

22.68

10.66**

15.82

23.09

21.66

Follow-up scores

 

 

 

 

 

 

 

 

 

 

 

 

 

Days drinking in 30,

6.42

8.51

5.01

8.47

3.31

6.59

8.02

9.99

10.12

12.83

7.46**

10.73

Days drug use in 30,

5.43

9.49

4.79

9.31

5.19

9.21

6.04

9.46

2.49

3.91

5.77

10.23

Days homeless in 60,

8.02

15.11

7.12

13.37

4.60

11.07

9.45

16.95

13.26

22.21

8.71

16.01

Drinking difference

± 11.55

13.37

± 14.57

12.78

± 12.85

12.59

± 9.66**

12.83

± 6.54

20.17

± 10.35

13.98

Drug use difference

± 9.97

13.84

± 10.60

13.34

± 12.22

13.10

± 10.69

14.75

± 7.98

8.99

± 7.58*

13.93

Homelessness difference

± 18.92

25.68

± 23.50

25.03

± 19.50

26.26

± 18.72

27.22

0.94**

22.21

± 16.01

23.53

Note. For differences among acceptors or acceptor/rejector comparison: *p < 0.05; **p < 0.01.

Sosin .R Michael 76

 

 

 

Outcomes and sample selection

77

Table 2. Impacts of interventions on alcohol use: Tobit regressions

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(2)

 

(3)

 

 

 

(1)

 

Intent to

 

Sample

 

 

 

Basic

Standard

treat

Standard

Selection

Standard

 

(n = 236)

error

(n = 330)

error

(n = 236)

error

 

 

 

 

 

 

 

 

Housing condition

± 5.39*

2.18

± 3.67²

2.25

± 6.54*

 

2.92

Case mgmt. only condition (CMO)

± 9.85**

2.76

± 6.06**

2.13

± 16.30**

 

5.50

Background factors

 

 

 

 

 

 

 

Housing* alcohol use

0.02

0.19

± 0.05

0.20

0.02

 

0.19

CMO* alcohol use

± 0.23

0.23

0.03

0.18

± 0.18

 

0.30

Alcohol use

0.30*

0.13

0.30*

0.14

0.30*

 

0.13

Days from baseline/100

1.07

0.85

1.18

0.71

0.80

 

0.90

Female

1.62

2.21

0.98

2.19

1.91

 

2.34

Esteem level

± 0.77**

0.20

± 0.59**

0.19

± 0.75**

 

0.22

Depression level

± 1.22

0.66

± 0.30

0.59

± 1.16

 

0.74

Anxiety level

0.22

0.20

0.10

0.19

0.20

 

0.21

Legal problems

13.68*

5.33

9.71

4.82

15.00*

 

5.91

Days in controlled environment

± 0.29

0.18

± 0.53**

0.16

± 0.33

 

0.20

Other parameters

 

 

 

 

 

 

 

Constant

19.72*

9.06

11.75

7.96

19.99

 

10.36

Sigma (tobit equation)

12.96**

0.90

13.70**

0.82

 

Sigma (correction)

13.23**

 

1.28

Rho

0.50

 

0.38

Note: *p < 0.05, **p < 0.01. For one-tailed test (intervention only), ² p < 0.05.

condition (when including the same control variables). These results suggest that, with respect to alcohol-use outcomes, housing refusers fare 2.7 units worse than control clients, and case-management clients fare only 4.3 units better; with respect to druguse outcomes, housing refusers fare 7.8 units better than control clients, but casemanagement refusers fare only 2.3 units better; with respect to homelessness outcomes, housing refusers fare 14.1 units worse, and case-management refusers fare only 6.6 units better. None of these Žndings is statistically signiŽcant at the 0.05 level.

Sample selection results: Outcome equations

The estimates that are calculated under the (maximum likelihood) sample selection analysis strategy are found in the last column of the three tables. For the sake of presentation, the sigmas (standard errors) for the sample selection models are reported separately from the sigmas for the other tobit analyses. We Žrst describe the results for the outcome equations and then discuss the selection equations.

The results for the outcome equations in part conŽrm the importance of the key statistically signiŽcant relations that were located by the basic strategy; the coefŽcients suggest that the housing and case-management interventions reduce alcohol use, and that the case-management intervention reduces homelessness. However, results reported in the last column of Table 3 also suggest that case-management intervention reduces drug use, while those in Table 4 show that housing intervention tends to reduce

78

Michael R. Sosin

 

 

 

 

 

 

Table 3. Impacts of interventions on drug use: Tobit regressions

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(2)

 

(3)

 

 

 

(1)

 

Intent to

 

Sample

 

 

 

Basic

Standard

treat

Standard

Selection

Standard

 

 

(n = 234)

error

(n = 318)

error

(n = 234)

error

 

 

 

 

 

 

 

Housing condition

± 2.39

2.71

± 2.36

2.86

± 4.16

4.32

Case mgmt. only condition (CMO)

± 2.88

3.34

± 2.68

2.68

± 15.29*

6.03

Background factors

 

 

 

 

 

 

Housing* drug use

0.02

0.22

± 0.06

0.23

0.06

0.27

CMO* drug use

± 0.04

0.25

0.12

0.21

0.06

0.30

Drug use

0.16

0.17

0.18

0.17

0.13

0.23

Days from baseline/100

1.49

1.03

1.59

0.87

0.78

1.04

Female

 

2.60

2.74

2.46

2.61

2.06

2.85

Esteem level

± 0.90**

0.27

± 0.66**

0.24

± 0.96**

0.31

Depression level

± 1.50

0.81

± 0.88

0.74

± 1.52

0.89

Anxiety level

0.42

0.24

0.28

0.23

0.36

0.26

Legal problems

22.20**

6.71

13.31*

5.99

23.15**

6.97

Recruitment site 1

± 1.88

3.47

± 3.16

3.06

± 4.58

4.76

Days in controlled environment

± 0.18

0.21

± 0.29

0.19

± 0.16

1.85

Other parameters

 

 

 

 

 

 

Constant

10.99

11.53

4.97

9.93

17.75

14.06

Sigma (tobit equation)

15.11**

1.16

16.27**

1.06

Sigma (correction)

15.89**

1.82

Rho

 

0.75**

0.24

Note: *p < 0.05, **p < 0.01.

homelessness. Further, effect sizes are larger in the sample selection models—for example, on alcohol use, the impact of case-management services is estimated at 16 units in the sample selection model, compared to under 10 according to the basic strategy, while the effect of housing intervention is 7 units in the sample selection model, compared to 5 under either conventional strategy. The impact of case-manage- ment intervention on drug abuse is estimated at 15 units, compared to under 4 according to each conventional strategy. The impact of housing condition on homelessness is 10 units, compared to a maximum of 7. Of course, sample selection models alter the effect sizes and levels of statistical signiŽcance because they contain a correction term, which in these equations is correlated with the variables representing the intervention conditions and (as rho indicates) with the outcome variables.11

4.4. Details of sample selection models

Selection model

The selection equations reported in the Žrst column of Table 5 includes what might be considered to be six groups of variables. The Žrst two groups are measures of the two

11For the equation predicting alcohol use, the term explains 68% of the variance in the variable representing the case management only condition. Sample selection models typically have relatively high correlations between the correction term and other terms (Stolzenberg & Relles, 1997).

 

 

 

Outcomes and sample selection

79

Table 4. Impacts of interventions on homelessness: Tobit regressions

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(2)

 

(3)

 

 

 

(1)

 

Intent to

 

Sample

 

 

 

Basic

Standard

treat

Standard

Selection

Standard

 

(n = 226)

error

(n = 315)

error

(n = 226)

error

 

 

 

 

 

 

 

 

Housing condition

± 6.84

4.85

± 5.28

4.74

± 9.88²

 

6.03

Case mgmt. only condition (CMO)

± 12.78*

6.05

± 7.29²

4.25

± 33.51**

 

10.34

Background factors

 

 

 

 

 

 

 

Housing* homelessness

0.24

0.21

0.17

0.20

0.24

 

0.25

CMO* homelessness

± 0.37

0.25

0.16

0.18

± 0.34

 

0.26

Days homeless at baseline

0.06

0.13

0.06

0.13

0.08

 

0.15

Days from baseline/100

1.10

1.96

2.81

1.56

0.11

 

2.11

Recruitment site 1

± 0.87

6.01

3.05

4.59

5.78

 

7.12

Female

± 3.82

5.19

± 8.93

4.56

± 3.74

 

5.63

Black

± 15.36*

7.41

± 4.27

6.55

± 16.00

 

9.66

Age

± 0.38

0.32

± 0.54

0.28

± 0.30

 

0.38

Time homeless in past

± 1.49

1.23

± 0.20

0.80

1.64

 

1.73

Days from project start/100

± 0.09

1.77

2.14

1.50

0.00

 

1.83

Government beneŽ ts (dollars)

± 0.00

0.01

0.00

0.01

± 0.00

 

0.11

Days in controlled environment

± 0.00

0.13

± 0.23*

0.11

0.05

 

0.19

Other parameters

 

 

 

 

 

 

 

Constant

26.64

18.71

5.44

15.44

26.57

 

22.16

Sigma (tobit equation)

26.51**

2.16

27.24**

1.83

 

Sigma (correction)

27.43**

 

2.69

Rho

0.74*

 

0.32

Note: *p < 0.05, **p < 0.01. For one-tailed test (intervention only), ² p < 0.05.

intervention conditions and of two background traits (education and race). The other groups of variables stem from the previously mentioned theory of non-participation. Thus, the third group represents military experience and con•ict with others, measuring motivation in terms of traditional indicators of ‘disafŽliation’.

The fourth group of variables measures whether an individual was recruited after the recruitment worker began to fully detail the treatment component of the programme, after this worker began to stress the availabilityof recreational services, and during the high-energy Žrst 90 days of the programme. These variables represent the recruitment procedures that might affect the perceived costs and beneŽts of entering care. The Žfth group measures whether clients were recruited from a site that offered its own extended services, and whether clients previously received help in obtaining welfare beneŽts. These two variables represent access to alternative sources of help which, as mentioned earlier, might reduce the perceived beneŽts of the intervention. The Žnal group of variables measures professional referrals to programmes, and previous experience in drug and alcohol programmes. The two variables represent cues that may increase the perceived beneŽts of treatment. The selection equation excludes measures of baseline homelessness, substance abuse problems and mental health problems. These proved to be unpredictive once the other variables were entered.

As noted earlier, the included variables interact with the experimental conditions.

80

Michael R. Sosin

 

 

 

 

 

 

Table 5. Determinants of acceptance: probit equation (n = 544)

 

 

 

 

 

 

 

 

 

 

 

 

 

Alcohol

Standard

Drug

Standard

Homelessness

Standard

 

 

equation

error

equation

error

equation

error

 

 

 

 

 

 

 

Case management only

 

 

 

 

 

 

 

(CMO)

± 5.27

61 700

± 6.451

18 060

± 4.847

71 950

Housing

± 3.95

61 700

± 3.369

18 060

± 4.160

71 950

CMO* Age

0.054**

.027

0.017

0.021

0.041

0.021

CMO* Con•ict w/others

± 0.030

.044

0.006

.038

± .018

.035

CMO* Past help obtaining

 

 

 

 

 

 

 

welfare

± 0.190

.416

± 0.281

.349

± 0.686*

.337

CMO* No. past alcohol

 

 

 

 

 

 

 

treatments

± 0.011

.106

0.063

.094

± 0.069

0.103

CMO* No. past drug

 

 

 

 

 

 

 

treatments

0.039

.113

± 0.049

.102

0.052

.094

CMO* Years of education

± 0.014

1.00

0.053

0.090

± 0.062

.114

CMO* Professional referral

0.749*

.322

0.545

0.279

0.362

.300

CMO* Black

0.320

.715

.269

.595

0.112

.538

CMO* Recruitment site 1

± 1.743

.908

± 1.927*

.774

± 0.686

.639

CMO* First 90 days of

 

 

 

 

 

 

 

program

± 0.546

.874

0.740

.710

± 0.241

.628

CMO* Days from program

 

 

 

 

 

 

 

start/100

.255

.196

0.250

.198

0.024

.200

CMO* Military service

.288

.683

.850

.659

± .131

.446

CMO* Recruitment emphasis

 

 

 

 

 

 

 

on treatment

0.248

.715

0.538

.610

1.557*

.760

CMO* Recruitment emphasis

 

 

 

 

 

 

 

on treatment

± 1.313

.980

± .451

.721

± 2.115**

.815

Housing* Recruitment site 1

± .957*

.449

± 1.141*

.459

± 1.239**

.457

Housing * First 90 days from

 

 

 

 

 

 

 

program

± 0.361

.457

± .341

.452

± .566

.434

Constant

5.230

61 700

4.680

18 060

5.476

71 950

Note: *p < 0.05, **p < 0.01. For one-tailed test (intervention only), ² p < 0.05.

Few of these interactions involve the housing condition, which has very little refusal to explain.

Selection model: Results

The remaining columns of Table 5 report the results of the selection models. There are three such models since each of the three outcome equations must be estimated simultaneouslyand separatelywith the selection equation (Breen, 1996). From the table it can be seen that each of the three models locates at least one signiŽcant predictor of acceptance. Each also includes other large coefŽcients. All this conŽrms that acceptance is not fully random.

As might be expected, the coefŽcients are similar in size across the three selection models. Nevertheless, there are some differences in statistical signiŽcance, particularly in the interaction terms involving the case-management intervention. For example, in the selection model that is estimated with the outcome equation for alcohol use, there

Outcomes and sample selection

81

is a statistically signiŽcant relationship between acceptance into the case-management condition and having a professional referral. In the selection model that is estimated with outcome equation for homelessness, acceptance into the case management condition is related to help in obtaining welfare beneŽts and the emphasis on treatment in recruitment. In light of such inconsistencies, it seems that the results from selection equations should be treated with a moderate degree of caution. No known studies investigate the accuracy of the selection (not outcome) models.

Table 5 also suggests that all three selection equations estimate high (and equal) standard errors for the intercept and the coefŽcients representing the intervention conditions. This is a function of the complication mentioned earlier. Nevertheless, further calculations corroborate the analyses in the Žrst of the two suggested ways. They indicate that the predicted probability of acceptance among control member clients, which is represented by a transformation of the intercept term, is quite close to the actual 100%in all three equations.12

Note that a single equation is used to estimate determinants of the acceptance of both intervention conditions, which means that an outcome equation uses one Mills ratio as it controls for sample selection. The implicit assumption is that selection affects outcomes in similar ways for housing and case-management clients. This is a reasonable assumption for the example, because the estimates of the effects of the intervention are similar when separate selection models are run for the housing and the case-manage- ment clients. The use of combined models merelyserves to simplifythe discussion, even if it might not always be appropriate.

Predicted outcomes

The second corroborative test can only be calculated after considering predicted, or expected, values of the outcome scores. These help in assessing the potential effects of the experiment by indicating how members of the sample are likely to fare. The predicted values must be calculated for the entire sample—both acceptors and refusers. This re•ects the fact that the sample selection models estimate outcomes for all individuals (Greene, 1992; Breen, 1996). At present, the predicted values are compared to those that are calculated under the intent-to-treat strategy (which also includes both acceptors and refusers). This further explores the consequences of correcting for sample selection.

In sample selection models, the predicted values for the clients who are in an experimental condition, when subtracted from the predicted values for the clients who are in the control condition, will not sum to a score that mirrors the coefŽcient that is attached to the intervention condition. This re•ects the fact that the predicted values also take into account the outcomes that are associated with the propensity to refuse; they depend on the score on the selection terms as well as the coefŽcients that are attached to the intervention condition (and also on the background variables). In the current models, rho is positive (if not always statistically signiŽcant): clients who have a greater propensity to refuse have higher scores on the outcome variables, and thus have worse estimated outcomes. This suggests that the differences between the predicted values for clients in the experimental and the control conditions will be smaller than the coefŽcient-based estimates of treatment effects. Nevertheless, the predicted scores

12Similarly, expected values for members of the control condition, which are reported in Table 6, seem accurate because they are comparable for the conservative and sample selection models.

82 Michael R. Sosin

reported in Table 6 suggest that the sample selection modelling strategy estimates larger effects than the intent-to-treat modelling strategy. In general, estimates from the former are between 4 and 6 units better for clients in the case-management intervention. They are between about 0.5 and 2 units better for clients in the housing intervention. As expected, results for control clients are almost identical under the two strategies.

Other sample selection estimators

The second corroborative analyses consider whether the outcome results in the sample selection models are affected by the previously mentioned statistical problems. They determine whether the predicted values for the case-management clients are replicated in sample selection models that exclude clients who are in other intervention conditions. As previously explained, these corroborative analyses will not have the problematic standard errors. They are undertaken for the case-management condition, which alone has sufŽcient selection to help discern the accuracyof the predicted values. These new selection equations include the same independent variables as the previously reported sample selection modelling equations, although they exclude the interaction terms with the experimental condition and the dummy variables representing the experimental conditions (given that results are only estimated for one condition).

The results indeed corroborate the previously reported sample selection models. The new predicted values are 1 unit of alcohol use, and zero units of drug use and homelessness. These closely match the previously reported values for the case-management clients (1,1 and 1). Indeed, the new models only converge to a maximum likelihood solution when some minor adjustments are made. These primarily add background variables (not measures of costs and beneŽts) to the outcome equations. The need for adjustment probably re•ects the fact that the models exclude the variables that indicate the experimental intervention conditions. They thus have little statistical power and are difŽcult to estimate with a maximum likelihood technique.13

4.5. Discussion

There are many implications if one agrees that the sample selection models are appropriately used here. First, the results suggest that selectivity might not always be ‘fatal’ for conventional estimations; the main statistically signiŽcant results that are found in the conventional analysis strategies are replicated in the sample selection modelling strategy. Second, the results suggest that conventional analyses can underestimate the beneŽts of an experiment when there is post-assignment selectivity; in the example, one or two statistical signiŽcant effects in the sample selection models are not found in the conventional models, and most estimates of the effects of treatment in the sample selection models are larger than in the conventional models. Third, the results warn that inaccuracies may occur in conventional analyses even when experiments only have modest selectivity; use of the selection strategy affects the estimates for the minimally selective housing condition. Of course, all such results might differ for other experiments.

The results from the sample selection models question an assumption behind some intent-to-treat analyses by indicating that selection does not necessarilyeliminate clients

13The outcome equation predicting drug use adds the variables representing military service and education; the equations predicting homelessness and alcohol use add the variable representing education, and exclude this variable from the probit selection equation.