Добавил:
Upload Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:

Словари и журналы / Психологические журналы / p41British Journal of Mathematical and Statistical Psycholog

.pdf
Скачиваний:
58
Добавлен:
29.05.2015
Размер:
319.18 Кб
Скачать

Fitting structural equation models

51

covariance matrix by a robust covariance matrix in the normal theory based likelihood function, Yuan and Bentler (1998a) investigated the rescaled likelihood ratio statistic for SEM.

Now let t = tr(QG)/( p ± q) be the average of the non-zero eigenvalues of QG; then a rescaled statistic can also be constructed as

 

 

ˆ

 

T

=

nF[S, S(b)]

,

(22)

 

R

 

 

 

 

 

where tˆ is a consistent estimate of t with the unknown parameters in Q being replaced by their consistent estimates and G by Sy. Empirical results in Curran, West, and Finch (1996), Hu, Bentler, and Kano (1992), and Yuan and Bentler (1998b) indicate that the test statistic TR performs quite well under a variety of distribution conditions when F = FML and the model Žtting is through a simultaneous estimation process. More empirical studies are needed to verify the performance of TR in (22).

Suppose we use the same method for estimating g as for u, that is, a simultaneous estimation process; then U = (sÇ¢WsÇ)± 1¢W. If W = G± 1, then QG is a projection matrix of rank p ± q, t = 1 and the correction factor is unnecessary. This is just the known result about ADF. If W = W1 in the simultaneous estimation process, then TR becomes the statistic studied by Satorra and Bentler (1988, 1994).

5. Applications

The methods developed are applied to two data sets in this section. The Žrst data set is from Holzinger and Swineford (1939) and consists of 24 cognitive test scores of 145 seventhand eighth-grade students. Jo¨ reskog (1969) studied a correlation structure of nine variables from this data set. We will also use these nine variables in our study. Our second data set is from the Health Outcomes of Women survey, an ongoing longitudinal study conducted by Marshall (1999). The survey consists of three groups, and our analysis is based on a data set of nine variables with 183 cases from the white group.

Example 1

The nine variables of Holzinger and Swineford’s (1939) are: (1) visual perception, (2) cubes, (3) lozenges, (4) paragraph comprehension, (5) sentence completion, (6) word meaning, (7) addition, (8) counting dots, (9) straight-curved capitals. In the original report of Holzinger and Swineford (1939), variables 1, 2 and 3 were designed to measure spatial ability, variables 4, 5 and 6 were designed to measure verbal ability, and variables 7, 8 and 9 were speed tests. Let x be the vector of the nine observed variables; then the conŽrmatory factor model

 

 

 

 

 

 

x = Lf + e,

 

S =

LFL¢ + W,

 

 

 

(23a)

with

0 0

0

0

 

 

 

0

0

0

1 ,

 

0 f21

1.0 f23 1,

L =

l42

l52

l62

F =

 

B

l11

l21

l31

0

0

0

0

0

0

¢

 

1.0

f12

f13

 

0

0

0

0

0

0

l73

l83

l93

C

 

B f31

f32

1.0 C

 

@

 

 

 

 

 

 

 

 

 

A

 

@

 

A

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(23b)

represents the original design of Holzinger and Swineford. We assume that the measurement errors are uncorrelated, with W being a diagonal matrix.

52 Ke-Hai Yuan and Wai Chan

Previous analyses generally assumed that Holzinger and Swineford’s (1939) data set followed a multivariate normal distribution. However, Mardia’s multivariate skewness and kurtosis (Mardia, 1970, 1974) for these nine variables are M1 = 0.77 and M2 = 3.04, indicating that the data set may come from a distribution with heavier tails than those of a normal distribution when referring these statistics to x2165 and N(0, 1), respectively. We Žt model (23) in the following ways: (a) a simultaneous estimation of all the unknown parameters in the model bythe MLand ADFmethods; (b) Žtting the three submodels each with three indicators separately for factor loading and error variance estimates, and then estimating the factor correlations in the combined model byŽxing the factor loadings and error variances at these estimates. Thus in (b) we have four models to Žt. Our interest is in contrasting the parameter estimates in the different estimation approaches. The multivariate kurtosis for variables 1, 2 and 3 is M21 = 1.38, for variables 4, 5 and 6 is M22 = 1.46, and for variables 7, 8 and 9 is M23 = 2.99; and none of the multivariate skewness values is signiŽcant. It is obvious that the kurtosis M23 is statisticallysigniŽcant. So we mayestimate the Žrst two submodels by normal theory based MLand the third by the ADF method in order to obtain more efŽcientparameter estimates. For the purpose ofcontrasting different estimation processes we Žt the three-indicator submodels and the Žnal combined model in

(b) bythe following approaches: (I) allthe four models are Žtted byML; (II) the Žrst 2 threeindicator models are estimated by ML, the third three-indicator model by ADF, and the combined model by ML; (III) all 3 three-indicator models are estimated by ADF and the combined one by ML; (IV) all the three-indicator submodels are estimated by MLand the combined one by ADF; (V) the Žrst 2 three-indicator models are estimated by ML, the third three-indicator modelbyADF, and the combinedmodelbyADF; (VI) allthe models are estimated by ADF.

Parameter estimates and the corresponding test statistics are reported in Table 1. Because all the parameter estimates are signiŽcant, we omit the standard errors to save space. Notice that with three-indicators the one-factor model is a saturated model, and all the estimation approaches (I–VI) in (b) lead to the same set ofestimates for factor loadings and error variances. Estimation approaches I–III also lead to the same set of estimates of factorcorrelations, and sodo the estimation approaches IV–VI. As can be seen from Table 1, there exist various discrepancies between the parameter estimates by the different approaches. The largest discrepancy occurs with the estimates for w99 between the simultaneous estimation approach (a) and the model segregation approach (b). If model (23) is correct, all the estimates are consistent for w99. Then we have to regard the discrepancy as the result of small sample size. Actually, all the parameter estimates are statistically signiŽcant. How good the model (23) is at describing the relationship among the nine variablesis re•ected bythe test statistics in Table 1. Here the test statistic reported for the simultaneous estimation process by MLis the Satorra and Bentler (1988) rescaled statistic; the statistic for the simultaneous estimation process by ADF is that proposed by Browne (1984); and the statistics for conditions I–VI are calculated according to (22). Because different P matrices are used in (12), the TR statistics for different conditions may not be equal even though the parameter estimates are equal. Here the differences among conditions I–IIIand those among conditions IV–VIare quite small, but this maynot always be the case. The smallest statistic is Satorra and Bentler’s TSB = 49.37, while the largest one is the rescaled statistic TR = 72.94 corresponding to condition IV. All of the statistics are signiŽcant at the 0.01 level, implying that the overall model (23) is not adequate in explainingthe association among the nine variables. This mayalsoexplain whythere exists a substantial difference among the different estimates of w99.

Even though there is no substantial difference in the conclusions regarding model

 

 

 

 

 

Fitting structural equation models

53

Table 1. Parameter estimates and test statistics in Example 1

 

 

 

 

 

 

 

 

 

 

 

 

v

ML

ADF

I

II

III

IV

V

VI

 

 

 

 

 

 

 

 

 

f21

0.54

0.90

0.52

0.52

0.52

0.71

0.71

0.71

f31

0.52

0.68

0.44

0.44

0.44

0.65

0.65

0.65

f32

0.34

0.61

0.28

0.28

0.28

0.37

0.37

0.37

T

49.37

57.92

49.29

49.29

49.29

72.94

72.44

72.44

 

 

 

 

 

 

 

 

 

g

ML

ADF

I–VI

 

 

 

 

l11

4.68

4.41

4.09

l21

2.30

1.29

2.45

l31

5.77

4.48

6.30

l42

2.92

2.82

2.92

l52

3.86

4.02

3.86

l62

6.58

6.72

6.56

l73

15.68

16.33

16.21

l83

16.71

15.88

18.08

l93

25.96

29.70

22.72

w11

25.92

18.47

31.05

w22

14.49

14.27

13.77

w33

35.90

42.11

29.45

w44

2.86

2.45

2.86

w55

6.75

5.98

6.72

w66

20.03

15.57

20.13

w77

319.86

299.64

302.71

w88

161.58

75.41

113.94

w99

697.94

467.14

855.25

(23) by the different procedures, there is a difference in the speed of convergence. For example, with default initial values in the EQS set-up (Bentler, 1995) the simultaneous estimation process with ML needs 82 iterations to converge. However, for the same estimation methods and default initial values the 3 three-indicator submodels need respectively 7, 6 and 9 iterations to converge, while the combined model for the estimation of factor correlation needs only 2 iterations. So convergences with the segregated models are much easier.

This example illustrates that one can Žt a model in parts according to the procedures developed in this paper. Even though various discrepancies exist among the different approaches, the conclusion regarding model (23) and its parameters is basically the same. That is, all the parameters are necessary in the parameterization in (23) but the model is not adequate in explaining the relationship among the variables.

Example 2

Because model (23) is statisticallynot adequate in explaining the relationship among the

54 Ke-Hai Yuan and Wai Chan

nine observed variables, Jo¨ reskog (1969) proposed a factor model with

1.0 f23 1.

L =

0

0

0

0

l42

l52

l62

0

0

0

1

,

F =

0f21

 

B

l11

l21

l31

0

0

0

0

l81

l91

C

¢

 

1.0

f12

0

 

0

0

0

0

0

0

l73

l83

l93

 

 

B 0

f32

1.0 C

 

@

 

 

 

 

 

 

 

 

 

A

 

 

@

 

A

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

(24)

Because there are double loadings for x8 and x9 in (24), it can be shown that the model based on (x7, x8, x9) is not identiŽable if l81 and l91 are included in the model; and the estimates for l73, l83 and l93 are not consistent if l81 and l91 are ignored. However, it can be easilyshown that the factor loadings l11, l21, l31, l42, l52, l62 and the error variances for the Žrst six variables can stillbe consistentlyestimated byŽtting the two three-indictor submodels. In addition to the simultaneous estimation by the MLand ADF approaches, the following four model segregation approaches are also investigated: (I) both the two three-indicator submodels and the combined model are estimated by ML; (II) the two three-indicator models are estimated by ADF, the combined model by ML; (III) both the three-indicator submodels are estimated by ML, and the combined one by ADF;

Table 2. Parameter estimates and test statistics in Example 2

v

ML

ADF

I

II

III

IV

 

 

 

 

 

 

 

l81

6.17

6.66

5.64

5.64

6.19

6.19

l91

20.82

23.92

19.20

19.20

24.42

24.42

l73

19.95

19.99

19.17

19.17

22.15

22.15

l83

13.98

12.93

14.63

14.63

11.38

11.38

l93

16.62

15.69

17.75

17.75

16.97

16.97

f21

0.48

0.53

0.49

0.49

0.58

0.58

f32

0.17

0.19

0.17

0.17

0.21

0.21

w77

167.41

115.58

198.06

198.06

46.76

46.76

w88

193.39

128.07

182.91

182.91

139.86

139.86

w99

605.78

613.56

637.24

637.24

694.33

694.33

T

24.73

25.03

27.32

27.32

32.29

32.29

 

 

 

 

 

 

 

g

ML

ADF

I–IV

 

 

 

 

l11

5.03

5.23

4.09

l21

2.17

2.25

2.45

l31

5.53

5.39

6.30

l42

2.90

2.63

2.92

l52

3.85

3.66

3.86

l62

6.51

6.10

6.56

w11

22.46

17.42

31.05

w22

15.04

14.71

13.77

w33

38.60

38.48

29.45

w44

2.88

2.89

2.86

w55

6.62

5.59

6.72

w66

20.25

17.69

20.13

Fitting structural equation models

55

(IV) both the two three-indicator submodels and the combined model are estimated by ADF.

Parameter estimates and the corresponding test statistics by different methods are reported in Table 2. As in Example 1, all these estimates are statisticallysigniŽcant. Even though the differences among the different approaches for estimates of w99 become smaller, there is a substantial difference between estimates of w77. With regard to test statistics, the smallest one is given by the Satorra and Bentler statistic, while the largest one is the rescaled statistic TR for conditions III and IV. None of them is statistically signiŽcant at the 0.5 level, implying that model (24) is reasonable in explaining the relationship among the nine observed variables.

Even though model (24) Žts the data better, various discrepancies still exist among the estimates by different procedures. Similarly, the various test statistics are still in agreement regarding the model’s adequacy. This example also points out the fact that only identiŽable submodels can be used in the Žrst stage to obtain gˆ .

Example 3

The Health Outcomes of Women survey concentrates on the longitudinal effect of partner’s abuse on the health of women from low-income families. The four variables measured at time 1 in x = (x1, x2, x3, x4)¢ are psychological abuse, partner’s threats, partner’s violence and partner’s sexual aggression, together constituting a partner’s violence factor. There are Žve dependent variables in y = ( y1, . . . , y5)¢. Variables y1 and y2 are measures of self-esteem and stress, together forming an overall stress factor at time 1. The time 2 measurements y3 to y5 are measures of physical health, quality of health and health perception, constituting a health condition factor. In LISREL( Jo¨ reskog & So¨ rbom, 1993) notation, the substantive theory, that family violence at time 1 leads to stress on women at time 1 and that these together predict health condition in the subsequent wave, can be expressed as

 

x = Lx y + d,

y = Ly h + «

 

 

 

(25a)

and

 

 

 

 

 

 

 

 

 

h = Bh + Gy + z,

 

 

 

(25b)

where

¡1 l1 l2 l3 ¢¢

Ly = Á0

04

1

l5

l6

! ,

Lx =

 

 

Áb21

1

l

0

0

0

¢

 

B =

0 !,

G =

 

¡g11 g21 ¢¢.

 

 

0

0

 

 

 

 

 

and «, d and z are vectors of uncorrelated errors. Our interest here is in evaluating the model structure implied by (25). The problem with this data set is that the MLE of the error variance ( vd2 ) of x2 is negative (a Heywood case). As discussed in the earlier sections, we Žrst estimated the one-factor model x = Lx y + d; however, the error variance estimate for x2 based on this submodel is still negative. Further experiment shows that the submodel implied by x1, x2 and x4, which is just a three-indicator, onefactor model, leads to a positive estimate for vd2. By treating this as a Žxed parameter in the ML estimation process, the Heywood case is avoided. Table 3 reports the parameter estimates by different estimation methods, together with the corresponding

56

Ke-Hai Yuan and Wai Chan

 

 

 

Table 3. Parameter estimates and test statistics in Example 3

 

 

 

 

 

 

 

v

ML

ADF

ML-ML

 

 

 

 

 

 

l1

0.74

0.53

0.57

 

l2

0.33

0.22

0.27

 

l3

0.24

0.10

0.24

 

l4

± 0.71

± 0.64

± 0.70

 

l5

1.52

1.18

1.53

 

l6

1.49

1.16

1.61

 

g11

± 0.15

± 0.25

± 0.17

 

g21

± 0.07

± 0.16

± 0.09

 

b21

0.44

0.48

0.37

 

f

3.25

2.35

4.20

 

w11

0.70

0.79

0.67

 

w22

0.41

0.62

0.36

 

vd 1

2.35

0.54

1.40

 

vd 3

0.31

0.01

0.37

 

vd 4

0.60

0.05

0.54

 

v«1

0.59

0.37

0.58

 

v«2

0.33

0.37

0.33

 

v«3

1.02

0.51

1.08

 

v«4

2.64

2.21

2.75

 

v«5

0.35

0.29

0.25

 

T

79.21

55.06

90.02

 

 

 

 

 

 

g

ML

ADF

ML-ML

 

 

 

 

 

 

vd 2

± 0.06

0.10

0.48

test statistics. The notation ML-MLis for the model segregation approach in which both the three-indicator model and the combined model are estimated by the normal theory based maximum likelihood. Notice that we only Žxed vd2 in estimating the combined model; the other parameters of the three-indicator model are re-estimated. Of course, we can also use the ADFmethod in the simultaneous estimation approach, which also avoids the Heywood case, as can be seen from Table 3. All the test statistics are statistically signiŽcant when referring to the chi-square distribution with 24 degrees of freedom. As in the previous examples, there are some differences in the parameter estimates.

This example demonstrates that when an improper solution exists one can use the method developed in this paper to obtain a set of reasonable solutions.

Because of our limited approach to problematic data sets, the three examples based on the two data sets are used for the purpose of showing how to apply the proposed procedures in practice. As a matter of fact, the traditional methods work quite well with Examples 1 and 2, and the ADF method overcomes the Heywood case in Example 3. However, there are problematic data where traditional methods maynot solve the issues as discussed in the previous sections, and the procedure developed may apply. Actually, due to issues such as non-convergence or there being only a single indicator,

Fitting structural equation models

57

problematic data sets are seldom reported. We hope the procedure developed can help future researchers in putting more of these data sets into the literature.

Finally, we observe that one can use standard SEM software (e.g., AMOS, EQS, LISREL) to obtain parameter estimates gˆ and uˆ , but one has to use the procedures in Section 3 to obtain correct standard errors of uˆ and the procedure in Section 4 for an overall model evaluation.

6. A simulation example

The three examples in the previous section indicate that there may be various discrepancies among the parameter estimates and test statistics obtained by different procedures. These discrepancies are generallycaused bytwo sources of errors: sampling error, which occurs when the overall model is correctly speciŽed; and systematic error plus sampling error, which occurs when the overall model is misspeciŽed. For all the three examples in the previous section the submodels for estimating g are correct because they are saturated. However, if the overall model is misspeciŽed the estimates based on the submodel and those based on the overall model will not converge to the same set of values. Even though the overallmodel in Example 2 is not signiŽcant, we still do not know whether model (24) is correctly speciŽed. These considerations lead us to present the following simulation example. Let l = (0.60, 0.70, 0.80)¢,

L =

0 0

l 0 1,

F =

0 0.30

1.00

0.50 1,

 

l

0

0

 

B

1.00

0.30

0.40

 

B 0

0

l C

 

0.40

0.50

1.00 C

 

@

 

A

 

@

 

 

A

and W be a diagonal matrix such

that S = LFL¢ + W is

a correlation matrix. Let

f ,N(0, F) and e ,N(0, W) be independent; then x = Lf + e follows N(0, S). With the normal data all the three estimating functions in (9) are optimal. Thus V12 = 0 and the covariance matrix of uˆ (gˆ ) is given by (5). As in Example 1, factor loadings and error variances are estimated by 3 three-indicator submodels. To save space, in addition to reporting the parameter estimates and test statistics corresponding to the simultaneous estimation procedure by MLand ADF, we only report the results of estimating (I) all the three-indicator submodels and the combined model by the normal theory based maximum likelihood (ML-ML), (II) and all the three-indicator submodels and the combined model by the ADF procedure (ADF-ADF). These two procedures correspond to conditions (I) and (VI) in Example 1.

The results of three replications of the above simulation conditions with sample size 200 are given in Tables 4–6. There is little discrepancy between parameter estimates based on either of the simultaneous estimation processes and those based on the corresponding model segregation approach in any of these replications. Actually, because the overall model is correctly speciŽed, the discrepancy among the different parameter estimates by the simultaneous and segregation approaches is only caused by sampling errors, which is re•ected in Tables 4–6. On the other hand, the discrepancies in Tables 1–3 may be caused by systematic errors in addition to sampling errors. For normal data, parameter estimates in the simultaneous procedures are more efŽcient, as re•ected by the slightly larger standard errors from the model segregation approaches. There are also a few exceptions due to the Žnite-sample effect.

Furthermore, there is little difference among the different test statistics. In replication 1, the test statistic TR for condition I is only slightly greater than the corresponding

58 Ke-Hai Yuan and Wai Chan

Table 4. Comparison of simultaneous estimation with segregated estimation, replication 1

 

 

ML

 

 

ML-ML

 

 

ADF

 

ADF-ADF

 

 

 

 

 

 

 

 

 

 

 

 

 

v

v0

ˆ

SD

 

ˆ

SD

 

ˆ

SD

 

ˆ

SD

v

 

v

 

v

 

v

f21

0.300

0.252

0.089

0.244

0.090

0.265

0.085

0.269

0.087

f31

0.400

0.435

0.079

0.430

0.080

0.436

0.074

0.453

0.074

f32

0.500

0.438

0.081

0.430

0.083

0.437

0.070

0.410

0.072

T

 

TML = 23.832

 

 

TR = 24.374

 

TADF = 27.204

 

TR = 27.585

 

 

 

 

 

 

 

 

 

 

 

 

 

g

g0

SD

 

SD

 

SD

 

SD

 

 

 

 

 

 

 

 

 

 

l11

0.600

0.739

0.077

0.737

0.078

0.735

0.070

0.737

0.075

l21

0.700

0.786

0.074

0.768

0.075

0.738

0.068

0.768

0.077

l31

0.800

0.807

0.076

0.828

0.078

0.810

0.075

0.828

0.086

l42

0.600

0.655

0.079

0.668

0.081

0.692

0.071

0.668

0.078

l52

0.700

0.618

0.075

0.651

0.078

0.650

0.070

0.651

0.078

l62

0.800

0.772

0.077

0.733

0.080

0.739

0.061

0.733

0.077

l73

0.600

0.532

0.070

0.517

0.073

0.534

0.060

0.517

0.066

l83

0.700

0.603

0.070

0.614

0.075

0.622

0.064

0.614

0.072

l93

0.800

0.851

0.075

0.852

0.086

0.803

0.076

0.852

0.091

w11

0.640

0.612

0.083

0.616

0.085

0.583

0.074

0.616

0.083

w22

0.510

0.470

0.077

0.497

0.081

0.463

0.066

0.497

0.077

w33

0.360

0.510

0.083

0.476

0.088

0.464

0.092

0.476

0.115

w44

0.640

0.637

0.086

0.619

0.089

0.599

0.075

0.619

0.086

w55

0.510

0.598

0.079

0.556

0.082

0.562

0.073

0.556

0.085

w66

0.360

0.398

0.086

0.458

0.090

0.348

0.072

0.458

0.096

w77

0.640

0.615

0.071

0.630

0.075

0.543

0.060

0.630

0.073

w88

0.510

0.524

0.068

0.511

0.077

0.497

0.060

0.511

0.068

w99

0.360

0.288

0.087

0.287

0.114

0.347

0.073

0.287

0.115

TML, and TR for condition II is slightly greater than TADF. With replication 2, the TR for condition I is slightly smaller than TML; the TR corresponding to condition II is also slightly smaller than TADF. With replication 3, the TR is slightly greater than TML in condition I; while TR for condition II is a little greater than TADF. In contrast to the test statistics in Tables 1–3, the statistics in Tables 4–6 have much smaller discrepancies. This implies that, when the overall model is correctly speciŽed, the rescaled statistics are more like those based on simultaneous estimation approaches. Even though gˆ or

ˆ

 

 

 

 

 

 

 

 

u(gˆ ) in the model segregation approach is not fully efŽcient, their effect on the test

statistics is not substantial.

performed with SAS IML. Let z1 ,N3(0, I) and

 

 

The

above simulation was

z

2

,N

(0, I) be independent;

then x = LF1/2z

1

+ W1/2z

2

satisŽes the speciŽed

 

9

 

 

 

covariance structure of the factor model. The three replications were obtained by setting the initial SEED= 1234567 and the normal variables in z1 and z2 were generated in sequence by the function NORMAL(SEED). We hope these details will help readers who wish to verify the results in Tables 4–6.

 

 

 

 

 

 

 

 

Fitting structural equation models

59

Table 5. Comparison of simultaneous estimation with segregated estimation, replication 2

 

 

 

 

 

 

 

 

 

 

 

 

 

 

ML

 

 

ML-ML

 

 

ADF

 

ADF-ADF

 

 

 

 

 

 

 

 

 

 

 

 

 

v

v0

ˆ

SD

 

ˆ

SD

 

ˆ

SD

 

ˆ

SD

v

 

v

 

v

 

v

f21

0.300

0.303

0.086

0.293

0.085

0.286

0.085

0.300

0.083

f31

0.400

0.447

0.080

0.443

0.080

0.362

0.076

0.403

0.071

f32

0.500

0.540

0.074

0.524

0.074

0.542

0.075

0.547

0.074

T

 

TML = 13.796

 

 

TR = 13.760

 

TADF = 16.698

 

TR = 16.317

 

 

 

 

 

 

 

 

 

 

 

 

 

g

g0

SD

 

SD

 

SD

 

SD

 

 

 

 

 

 

 

 

 

 

l11

0.600

0.560

0.072

0.544

0.073

0.500

0.068

0.544

0.072

l21

0.700

0.709

0.073

0.714

0.077

0.701

0.071

0.714

0.077

l31

0.800

0.793

0.074

0.800

0.080

0.758

0.067

0.800

0.074

l42

0.600

0.617

0.072

0.614

0.075

0.657

0.071

0.614

0.076

l52

0.700

0.759

0.080

0.724

0.083

0.716

0.068

0.724

0.077

l62

0.800

0.855

0.077

0.890

0.084

0.873

0.061

0.890

0.076

l73

0.600

0.576

0.074

0.574

0.076

0.526

0.063

0.574

0.070

l83

0.700

0.762

0.076

0.749

0.083

0.787

0.083

0.749

0.092

l93

0.800

0.747

0.067

0.760

0.076

0.710

0.061

0.760

0.075

w11

0.640

0.623

0.074

0.641

0.076

0.610

0.065

0.641

0.079

w22

0.510

0.461

0.075

0.454

0.083

0.418

0.077

0.454

0.090

w33

0.360

0.378

0.082

0.367

0.094

0.351

0.070

0.367

0.085

w44

0.640

0.606

0.075

0.611

0.078

0.570

0.072

0.611

0.081

w55

0.510

0.634

0.089

0.686

0.097

0.634

0.072

0.686

0.092

w66

0.360

0.405

0.087

0.343

0.109

0.365

0.074

0.343

0.102

w77

0.640

0.672

0.078

0.674

0.081

0.695

0.059

0.674

0.072

w88

0.510

0.550

0.082

0.570

0.096

0.503

0.088

0.570

0.104

w99

0.360

0.338

0.066

0.318

0.085

0.340

0.060

0.318

0.085

7. Discussion

We propose a model segregation procedure for dealing with various issues in covariance structure analysis. Properties of parameter estimates can be characterized by means of estimating equations with nuisance parameters. Overall model evaluation can be performed using a rescaled statistic. Examples illustrate several applications of this procedure in practical data analysis.

As illustrated in the examples in Section 5, substantial discrepancies mayexist among model parameter estimates by different approaches. If the overall model is correctly speciŽed, then any identiŽable submodel will lead to consistent parameter estimates. Discrepancy among the parameter estimates is only due to small sample size; this is illustrated in Section 6. However, when the overall model is misspeciŽed, the parameter estimates from the overall model and those from a submodel may not converge to the same quantity. So, with a large sample size, substantial differences between parameter estimates by different approaches may imply a misspeciŽcation in the overall model.

The potential of the proposed procedure to solve some challenging problems in SEM

60 Ke-Hai Yuan and Wai Chan

Table 6. Comparison of simultaneous estimation with segregated estimation, replication 3

 

 

ML

 

 

ML-ML

 

 

ADF

 

ADF-ADF

 

 

 

 

 

 

 

 

 

 

 

 

 

v

v0

ˆ

SD

 

ˆ

SD

 

ˆ

SD

 

ˆ

SD

v

 

v

 

v

 

v

f21

0.300

0.181

0.092

0.177

0.092

0.197

0.077

0.208

0.077

f31

0.400

0.361

0.086

0.359

0.087

0.426

0.084

0.426

0.085

f32

0.500

0.470

0.078

0.467

0.080

0.455

0.073

0.472

0.070

T

 

TML = 20.138

 

 

TR = 20.488

 

TADF = 23.858

 

TR = 25.754

 

 

 

 

 

 

 

 

 

 

 

 

 

g

g0

SD

 

SD

 

SD

 

SD

 

 

 

 

 

 

 

 

 

 

l11

0.600

0.574

0.076

0.582

0.077

0.548

0.080

0.582

0.087

l21

0.700

0.701

0.076

0.682

0.078

0.674

0.074

0.682

0.081

l31

0.800

0.744

0.076

0.757

0.079

0.768

0.063

0.757

0.075

l42

0.600

0.636

0.072

0.637

0.073

0.612

0.065

0.637

0.069

l52

0.700

0.730

0.072

0.738

0.075

0.688

0.065

0.738

0.086

l62

0.800

0.760

0.072

0.752

0.074

0.726

0.071

0.752

0.074

l73

0.600

0.582

0.073

0.617

0.076

0.522

0.066

0.617

0.074

l83

0.700

0.653

0.077

0.671

0.081

0.683

0.072

0.671

0.083

l93

0.800

0.834

0.076

0.790

0.082

0.794

0.062

0.790

0.075

w11

0.640

0.663

0.081

0.653

0.081

0.643

0.074

0.653

0.080

w22

0.510

0.476

0.081

0.503

0.084

0.435

0.072

0.503

0.084

w33

0.360

0.405

0.083

0.385

0.091

0.355

0.075

0.385

0.085

w44

0.640

0.575

0.073

0.574

0.075

0.594

0.060

0.574

0.066

w55

0.510

0.471

0.074

0.460

0.080

0.451

0.060

0.460

0.076

w66

0.360

0.403

0.074

0.415

0.079

0.387

0.063

0.415

0.073

w77

0.640

0.620

0.076

0.579

0.079

0.537

0.066

0.579

0.077

w88

0.510

0.649

0.084

0.626

0.089

0.574

0.072

0.626

0.087

w99

0.360

0.342

0.088

0.413

0.098

0.368

0.064

0.413

0.095

does not implythat allthe issues discussed can be solvedbythis procedure in practice. For example, when a variable contains an improper error variance estimate and no submodel containing this variable is identiŽable, the procedure developed here is not applicable. In addition to such a limitation, there are also many unknown aspects with the proposed procedure. For example, how the statistic TR behaves in various conditions, especiallyits Type I and Type II errors, may merit a large-scale Monte Carlo study. More experience of this procedure with practical data is also needed for its better application.

AŽnal remark on applying the model segregation approach to overcome Heywood cases is needed. As discussed in van Driel (1978), there are several causes of negative estimates of error variances. When theyoccur with a large enough sample size, it is quite likely that the overall model is misspeciŽed. In such a case, Žxing these error variances at some positive value gˆ may avoid Heywood cases, but it may also excuse a researcher from Žnding a better model. Our recommendation in this regard is that, when parameter estimates under a segregation approach and a simultaneous approach are signiŽcantly different or the corresponding test statistics lead to quite different conclusions regard-