
Словари и журналы / Психологические журналы / p1British Journal of Mathematical and Statistical Psychology
.pdf
1
British Journal of Mathematical and Statistical Psychology (2002), 55, 1–15
© 2002 The British Psychological Society
www.bps.org.uk
A goodness of Ž t test for sparse 2p contingency tables
David J. Bartholomew1* and Shing On Leung2
1London School of Economics and Political Science, UK
2Faculty of Education, University of Macau, Macau
When a model is Ž tted to data in a 2p contingency table many cells are likely to have very small expected frequencies. This sparseness invalidates the usual approximation to the distribution of the chi-squared or log-likelihood tests of goodness of Ž t. We present a solution to this problem by proposing a test based on a comparison of the observed and expected frequencies of the second-order margins of the table. A x2 approximation to the sampling distribution is provided using asymptotic moments. This can be straightforwardly calculated from the expected cell frequencies. The new test is applied to several previously published examples relating to the Ž tting of latent variable models, but its application is quite general.
1. Introduction
We propose a goodness of Žt test for data in 2p contingency tables. Our main interest is in Žtting latent variable models for binary response data, but the method is more general, covering any model, such as the latent class model, which predicts cell frequencies. The test is based on a comparison of the observed and expected frequencies in the two-way margins.
Suppose there are p binary variables which take values 0 or 1. We represent these by the random vector X¢ = (X1, X2, . . . , Xp ). Observed values of these random variables are denoted by x¢ = (x1, x2, . . . , xp ). The vector x is known as a response pattern, and there are 2p different possibilities. A typical data set will consist of a list of the response patterns and their frequencies. In the language of contingency tables, x deŽnes a cell of the table and its associated frequency is the cell frequency.
Latent trait and latent class models for binary data predict the cell frequencies, and
* Requests for reprints should be addressed to David J. Bartholomew, The Old Manse, Stoke Ash, Suffolk, IP23 7EN, UK.

2 David J. Bartholomew and Shing On Leung
we require a means of testing whether the Žt is adequate. The obvious way is to compare the observed and expected frequencies for each response pattern by the usual chi-squared or log-likelihood ratio tests, but in practice this is often difŽcult. The reason is that if 2p is large compared with N, the sample size, many expected frequencies will be too small for the x2 approximation to the sampling distribution to be valid. For example, in a typical case with p = 10 and N = 200 there are 210 = 1024 possible response patterns (cells). On average, the expected frequency will be 0.2 and there will therefore be many much less than this. The difŽculty may be partially overcome by grouping response patterns, but this often has to be so drastic as to make the number of degrees of freedom zero or negative.
There have been several studies of this problem as it arises in latent variable and loglinear modelling (for example, Collins, Fiddler, Wugalter, & Long, 1993; Reiser & VandenBerg, 1994; Langeheine, Pannekoek, & van de Pol, 1996; Reiser, 1996; van der Heijden, Hart, &Dessens, 1997; Bartholomew &Tzamourani, 1999; Reiser & Lin, 1999). These authors generally agree that the usual test may be invalid and propose various ways of dealing with the problem. Bartholomew and Tzamourani (1999), for example, propose a bootstrap approximation supplemented by various diagnostic checks based on residuals. The approach which comes closest to our own is that followed by Reiser (1996) and continued in Reiser and Lin (1999). They propose tests based on Žrstand second-order margins and give a review of the reasons for doing so. We shall compare our approach with their’s in Section 5.
2. The test statistic
In this paper we propose a new, global, test based on the two-way margins of the contingency table. The two-way margins give the frequencies of responses for each of the 12 p(p ± 1) pairs of variables. Thus for variables i and j they will give the frequencies of the responses (1, 1), (1, 0), (0, 1) and (0, 0). If one of these frequencies is known the others are determined from the one-way margins which are regarded as Žxed since they provide no information about pairwise associations. Let nij be the observed frequency for the response (1, 1) on variables i and j. Then our test statistic is deŽned as
|
p p |
(n ± Npij)2 |
|
|
Y = |
Xi= 1 j=Xi+ 1 |
ij |
, |
(1) |
Npij(1 ± pij) |
where Npij is the expected value of nij. The (i, j)th term in Y is a measure of the departure of the bivariate distribution of that pair of variables from that predicted by the model. Since nij is a binomial random variable with parameters N and pij, the individual terms in (1) are distributed approximately as x2 with one degree of freedom. However, the terms are not independent, because the nij are not independent, and so it is not true that Y has a x2 distribution with 12 p(p ± 1) degrees of freedom. In the next section we
show that the distribution of Y can be closelyapproximated by that of a linear function of x2.
As we have deŽned it, the test is for the case where the pij —and hence the model parameters—are known. In practice, this is rarely the case and we shall then have to replace them by sample estimates. We shall investigate the effect that this further approximation has on the distribution.

A goodness of Ž t test for contingency tables |
3 |
3. The approximation to the sampling distribution of Y
Preliminary simulation studies of the sampling distribution of Y for latent trait models showed that its form is similar to the central x2 distribution, especiallyin the upper tail. Since it is possible to derive the exact moments of Y, this suggested Žnding a linear function of x2 which has the same Žrst few moments as Y. This is the basis of our approximation. In the next two subsections we show how to obtain the exact and asymptotic moments of Y: the latter provide a useful approximation to the former.
3.1. The exact moments of Y
Since E(nij ± Npij )2 = Var(nij) = Npij(1 ± pij), it follows that E(Y) = 12 p(p ± 1) whatever the model. The higher moments depend on the moments and cross-moments of the
nij. The derivation is straightforward but tedious, and proceeds as follows.
We introduce a second subscript to identify individual sample members and let |
|
||||||||
|
1 if the hth sample member responds positively to the ith variable, |
|
|||||||
xih = |
»0 otherwise. |
|
|
|
|
|
|
|
|
Then |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
N |
|
|
|
|
and |
|
|
|
nij = |
Xh= 1 xihxjh |
(2) |
|||
|
|
|
|
|
|
|
|
|
|
|
|
pij = Pf xih = 1, xjh = 1g . |
(3) |
||||||
For the second moment, Y2 can be written as |
|
|
|
||||||
|
Y2 = |
Xi<j |
Xk<l |
(nij ± |
Npij)2 |
(nkl ± Npkl)2 |
(4) |
||
|
|
. |
|
. |
|||||
|
Npij (1 ± pij) |
Npkl(1 ± pkl ) |
Since only the nij are random variables, the denominator is a constant. E(Y2) will
depend on the expectation of n2ijn2kl, n2ijnkl, nij n2kl, nijnkl , n2ij , n2kl, nij and nkl. Further details are set out in the Appendix.
The third moment can be obtained by the same method, but the algebra is much heavier. Because of its complexity we obtained numerical values using a Fortran program which Žrst calculates the raw moments and cross-moments of the ns and then substitutes them into the expression for E(Y3). As a partial check on the veryheavy algebra, we also estimated the moments from 10 000 random samples from each of three models which were generated for checking the approximations in Section 3.2 below. Some results are given in Table 1. The entries for N=100, 500 and 1000 in the columns headed ‘Exact’ were calculated from the formulae in the Appendix for the second moment and using the algorithm for the third moment referred to above; the values for N = ¥ were derived by the method given in the next section. The three models are: an ‘independence model’ which supposes the Xs are mutuallyindependent; and two latent trait models (see Section 4.1 below) which are typical of cases found in practice.
The parameter values used are as follows. In the case of the oneand two-factor
models the constant (intercept) parameters ai0 were set equal to zero for all i |
. For the |
||
one-factor model we used, for p = 5, a1 = (0.2, 0.4, 0.6, 0.8, 1.0)¢ |
and, for |
p = 10, |
|
a1 = (0.2, 0.4, 0.6, 0.8, 1.0, 1.2, 1.4, 1.6, 1.8, 2.0)¢. For |
the two-factor |
model we used, |
|
for p = 5, a1 = (0.4, 0.8, 1.2, 1.0, 1.0)¢ and a2 = (1.0, |
1.0, 1.0, 0.4, 0.8)¢; and, for p = 10, |

4 |
David J. Bartholomew and Shing On Leung |
|
|
|
|
|
|
||
Table 1. Exact and simulated moments of Y |
|
|
|
|
|
|
|||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Number of variables (p) |
|
|||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
5 |
|
|
|
|
10 |
|
|
|
|
|
|
|
|
|
|
Model |
|
Moment |
N |
Exact |
Simulated |
|
Exact |
Simulated |
|
|
|
|
|
|
|
|
|
||
Independence |
m2(Y) |
100 |
40 |
40 |
|
328 |
328 |
||
|
|
|
500 |
41 |
40 |
|
331 |
328 |
|
|
|
|
1000 |
41 |
42 |
|
331 |
335 |
|
|
|
|
¥ |
41 |
– |
331 |
– |
||
|
|
m3(Y) |
100 |
423 |
397 |
|
6 281 |
6 576 |
|
|
|
|
500 |
433 |
394 |
|
6 397 |
5 726 |
|
|
|
|
1000 |
434 |
452 |
|
6 412 |
6 681 |
|
|
|
|
¥ |
435 |
– |
6 426 |
– |
||
|
|
|
|
|
|
|
|
||
One-factor |
m2(Y) |
100 |
46 |
45 |
|
637 |
607 |
||
|
|
|
500 |
47 |
46 |
|
642 |
636 |
|
|
|
|
1000 |
47 |
47 |
|
643 |
639 |
|
|
|
|
¥ |
47 |
– |
643 |
– |
||
|
|
m3(Y) |
100 |
599 |
594 |
|
33 186 |
29 189 |
|
|
|
|
500 |
613 |
606 |
|
33 978 |
34 672 |
|
|
|
|
1000 |
615 |
613 |
|
34 078 |
32 498 |
|
|
|
|
¥ |
617 |
– |
34 177 |
– |
||
|
|
|
|
|
|
|
|
||
Two-factor |
m2(Y) |
100 |
62 |
62 |
|
915 |
914 |
||
|
|
|
500 |
63 |
66 |
|
922 |
908 |
|
|
|
|
1000 |
63 |
62 |
|
923 |
903 |
|
|
|
|
¥ |
63 |
– |
924 |
– |
||
|
|
m3(Y) |
100 |
1 126 |
1 130 |
|
66 610 |
65 629 |
|
|
|
|
500 |
1 153 |
1 245 |
|
68 230 |
64 624 |
|
|
|
|
1000 |
1 156 |
1 094 |
|
68 433 |
65 393 |
|
|
|
|
¥ |
1 159 |
– |
68 637 |
—— |
||
|
|
|
|
|
|
|
|
|
|
a1 = (0.4, 0.8, 1.2, 1.6, 2.0, 1.0, 1.0, 1.0, 1.0, 1.0)¢ and a2 = (1.0, 1.0, 1.0, 1.0, 1.0, 0.4, 0.8, 1.2, 1.6, 2.0)¢. For the independence model the ai1 are zero and the ai0 were chosen so
that the Žrst-order margins are the same as those for the one-factor model. These values are typical of those encountered in practice.
The agreement between the exact and simulated moments was generally good, and where there were discrepancies they were not systematic.
3.2. The asymptotic moments of Y
In order to obtain an approximation which can be implemented without the cumbersome calculations required to obtain the exact moments, we now show that asymptotic values

A goodness of Ž t test for contingency tables |
5 |
can be obtained relatively easily. This method may also be extended to obtain higher moments of Y and moments for other quantities of a similar form not discussed here.
We may write
1p(p± 1) |
|
|
|
|
|
|
|
|
Y = 2 Xi= 1 |
ui2 = u¢u, |
|
|
|
|
|
|
|
where u is a vector whose elements are the quantities (n |
± Np |
|
)/ |
Np |
|
(1 ± p |
) in |
|
some convenient order. Asymptotically ui _N(0, 1) for all i.ij |
|
ij |
p |
|
ij |
|
ij• |
|
Let S = E(uu¢) be the correlation matrix of the us. Let z = A¢u, where A is such as to |
||||||||
make the zs the principal components of the us. Then E(zz¢) = |
E(A¢uu¢A) = A¢SA. Now |
S may be written AL A¢, where L is a diagonal matrix of eigenvalues and A is the
eigenvector matrix. Hence E(zz¢) = |
L. |
|
|
|
|
|
|
|||
Now u¢u = z¢A¢Az = z¢z. So |
|
|
|
|
|
|
|
|
||
|
|
|
1p(p± 1) |
|
|
1p(p± 1) |
|
|
|
|
|
|
Y = |
2 |
Xi= 1 |
zi2 = |
2 Xi= 1 |
li vi2, |
|
|
|
where vi = zi / |
li and thus vi _N(0, 1). It is now an easy matter to obtain the moments |
|||||||||
of Y because |
the s are independent. Thus |
|
( ) = |
l = tr S since |
2 |
= 1 (x2 with |
||||
|
p v• |
|
|
|
E Y |
i |
Evi |
|
||
one degree of freedom); and |
|
Xli2Var(vi2) = |
P |
|
|
|||||
|
|
Var(Y) = |
2 X li2 |
|
|
= 2tr(S2).
Similarly, the asymptotic third moment of Y is the sum of the cubes of the ls multiplied by the third moment of a x2 variable with one degree of freedom, which is 8tr(S3). The element of S in the row labelled (i, j) and the column (k, l) is easily shown to be
p •
(pijkl ± pijpkl)/ DijDkl, where Dij = pij(1 ± pij ). The probabilities appearing in this expression are fourthand second-order marginal probabilities respectively and can
be calculated from the expected cell frequencies which are part of the standard output of the programs for Žtting latent variable and other models. Hence the asymptotic moments are easilyobtained by raising S to the appropriate power or by computing the eigenvalues. Numerical values are given in Table 1, which shows that the exact moments rapidly approach their limits.
3.3. The chi-squared approximation
We have alreadynoted that the simulated distributions appear to be similar in form to x2 in the upper tail so that it is reasonable to approximate the distribution of Y by treating it as a linear function of a x2 random variable. Thus we assume that
Y ,a + bxc2. |
(5) |
As we have expressions for the Žrst three moments of Y we can choose a, b and c so that the approximating distribution has the same Žrst three moments as Y.
Equating moments, it is easily shown that
b = m3(Y) , 4m2(Y)
c = m2(Y) ,
2b2
a = m1(Y) ± bc,
where m1(Y), m2(Y) and m3(Y) are the central moments of Y.

6 |
David J. Bartholomew and Shing On Leung |
|
|
|
|
|
Tail area probabilities are then calculated as follows: |
(6) |
|||
|
Pf Y > yg >Pnxc2 > |
b |
a |
o. |
|
|
|
y ± |
|
|
This we shall call the three-moment approximation. The range of the approximating distribution is (a, ¥) rather than the true range of (0, ¥). In practice this is of no importance because, in testing goodness of Žt, we are only interested in the upper tail. However, a simpler approximation, with the correct range, is obtained by setting a = 0. If we now determine b and c so that the Žrst two moments agree, we have
b = |
m2(Y) |
, |
c = |
m1(Y) |
. |
2m1(Y) |
b |
This is the two-moment approximation. Each approximationcan be made using either the exact moments or the asymptotic moments. Thus, altogether we have four approximations which are compared in Table 2. We have done this byestimating the 10%, 5%and 1% points of the exact distributions from the simulated distributions based on 10 000 replications. The values of a, b and c were calculated once for each cell of the table using the exact moments. We then estimated the probability for each approximation using (6). The three models are the same as those used for the calculations in Table 1.
All the approximations would be adequate for most practical purposes. The threemoment approximation is usuallythe best, especiallyat the smaller P-values, but for the one-and two-factor models there are cases where the two-moment approximations are slightly better. The difference between the exact and asymptotic moments is negligible and we therefore recommend the approximation based on asymptotic moments which will be used in the examples that follow. It should be borne in mind that the exact P- values are subject to some uncertainty, having been computed from 10 000 simulations.
As we noted in Section 2, the approximation has been derived and tested on the assumption that the model parameters are known. When the parameters are replaced by estimators there will be an effect on both the statistic and the sampling distribution. Asymptotically this will be negligible because the substitution introduces an error of order N± 1/2. However, it would be useful to know how large the sample size needs to be before the approximation can be ignored. Afull investigation of this point will require a major simulation study, but a preliminary calculation has been made for the case N = 100 and p = 10 using the same parameter values as in the simulation of Section 3.1. The sampling distribution of the estimated Y was simulated using 1067 replications. The percentages exceeding the 10%, 5%and 1%points of Y (as estimated from 10 000 simulations) were 9.37%, 4.87%and 0.84%, respectively. This suggests that, even with a sample size as small as 100, the procedure we have proposed will be adequate.
Areferee has pointed out that using the distribution of Y as an approximation to that of Yˆ is likelyto be inadequate if the model Žtted is poorly identiŽed. To take an extreme case, if the model is just identiŽed, the observed and expected frequencies will be equal and Yˆ will therefore be zero and its distribution will be concentrated at a single point. The distribution of Y, on the other hand, will not be so constrained. This is true but will not be relevant for the problems for which our test is designed. Yˆ is speciŽcally intended for sparse tables, and sparseness is only likely to arise when 2p is of at least the same order as the sample size. In such cases it will also be far in excess of the number of parameters to be estimated. This is why we chose p = 10 for the preliminary calculations given above; for models like those Žtted here 210 greatly exceeds the number of parameters.

A goodness of Ž t test for contingency tables |
7 |
4. Applications
4.1. Sexual attitudes data
We Žrst illustrate the test by carrying further the analysis of an example on sexual attitudes given in Bartholomew and Knott (1999, Example 2, p. 143). There a latent class model was Žtted. Here we Žt a latent trait model. That is, we investigate whether the associations among the responses can be explained by a common dependence on one or more latent variables. Let
Pf Xi = 1 | zg = pi(z), |
i = 1, 2, . . . , p, |
where z is a q-vector of latent variables assumed, without loss of generality, to have zero means and unit variances. The logit/normit model then speciŽes that
logit pi(z) = ai0 + ai1z1 + ai2z2 + . . . , i = 1, 2, . . . , p, (7)
where the zs are mutually independent and normally distributed. We investigate the Žt of this model for q = 1 and 2 using the methods and software available in Bartholomew and Knott (1999). Some results are given in Table 3.
The table gives that part of the analysis needed to apply the Y test. There are p questions and hence 12 10(10 ± 1) = 45 two-way margins as listed in the Žrst column. The columns headed ‘Contribution’ are the individual terms in (1) and the column totals are, therefore, the values of Y for the two models. Before carrying out the formal test it is immediately clear that the two-factor model is an excellent Žt, with a value of Y well below its expectation of 45. The value of Y for the one-factor model is larger, and almost half its value arises from the last row where the contribution is 49.37.
Using the x2 approximation we obtain the results given in Table 4 for the asymptotic threeand two-moment approximations. We have given only the moments and not the intermediate calculations required to obtain them. These have been deferred to Section 4.2 where the smaller value of p (= 5) allows a more economical presentation of the details.
With a p-value of 0.065 the model with p = 10 is only marginally signiŽcant but, having regard to the the extremely large contribution for the (9, 10) margins we consider that the two-factor model is more appropriate. This is consistent with the results obtained in de Menezes and Bartholomew (1996) for the same data. These indicated that the two questions on adoption by homosexual couples raise a quite different issue than the remaining items. A one-factor Žt would suggest a single dimension of variability according to a traditional/permissive scale of attitude to sexual behaviour. The present analysis conŽrms that this is not sufŽcient to explain attitudes to adoption. If the one-factor model is Žtted to items 1–8, excluding the adoption questions, the right-hand part of the table shows that the agreement is good, as we would expect.
4.2. Social life feelings and the workplace industrial relations survey
Here we return to two examples analysed in Bartholomew (1998), where further details will be found. The sparsity problem was not as great in these cases; for the social life feelings (SLF) data, N = 1490 and p = 5; for the workplace industrial relations survey (WIRS) data, N = 1005 and p = 6. Nevertheless, there were a number of very small expected frequencies and some uncertainty about the interpretation of the tests. The new test enables us to throw some new light on these unresolved questions.
For the SLF data the usual goodness of Žt tests (x2 and G2) gave values of 32.87 and

Table 2. Exact (simulated) and approximate signiŽ cance levels (%) for the Y test
|
|
|
|
|
|
|
|
|
Number of variables |
|
|
|
|
|||
|
|
|
|
|
5 |
|
|
|
|
|
|
|
10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
||||
|
|
|
|
Three-moment Ž t |
|
Two-moment Ž t |
|
|
Three-moment Ž t |
|
Two-moment Ž t |
|||||
|
|
True tail |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Model |
N |
probability |
Exact |
Asymptotic |
Exact |
Asymptotic |
Exact |
Asymptotic |
Exact |
Asymptotic |
||||||
|
|
|
|
|
|
|
|
|
|
|
|
|||||
Independence |
100 |
10 |
10.3 |
10.5 |
|
10.6 |
10.7 |
10.2 |
10.2 |
10.1 |
10.2 |
|||||
|
|
5 |
5.1 |
5.3 |
|
4.8 |
5.0 |
5.2 |
5.2 |
4.9 |
4.9 |
|||||
|
|
1 |
1.1 |
1.1 |
|
0.8 |
0.8 |
1.0 |
1.0 |
0.8 |
0.8 |
|||||
|
500 |
10 |
10.1 |
10.0 |
|
10.2 |
10.2 |
9.9 |
9.9 |
9.9 |
9.9 |
|||||
|
|
5 |
5.1 |
5.1 |
|
4.9 |
4.9 |
5.0 |
5.0 |
4.7 |
4.7 |
|||||
|
|
1 |
1.0 |
1.1 |
|
0.8 |
0.8 |
1.1 |
1.1 |
0.8 |
0.8 |
|||||
|
1000 |
10 |
10.1 |
10.1 |
|
10.3 |
10.3 |
10.1 |
10.1 |
10.0 |
10.0 |
|||||
|
|
5 |
5.1 |
5.1 |
|
4.8 |
4.8 |
4.9 |
4.9 |
4.6 |
4.6 |
|||||
|
|
1 |
1.0 |
1.0 |
|
0.7 |
0.7 |
1.0 |
1.0 |
0.7 |
0.7 |
|||||
One-factor |
100 |
10 |
10.7 |
10.9 |
|
11.2 |
11.4 |
11.0 |
11.0 |
11.8 |
11.9 |
|||||
|
|
5 |
5.7 |
5.8 |
|
5.4 |
5.6 |
6.0 |
6.1 |
5.6 |
5.6 |
|||||
|
|
1 |
1.2 |
1.2 |
|
0.8 |
0.8 |
1.1 |
1.1 |
0.5 |
0.5 |
|||||
|
500 |
10 |
11.0 |
11.0 |
|
11.5 |
11.5 |
11.0 |
11.0 |
11.8 |
11.9 |
|||||
|
|
5 |
5.4 |
5.4 |
|
5.1 |
5.1 |
5.7 |
5.7 |
5.2 |
5.2 |
|||||
|
|
1 |
1.0 |
1.0 |
|
0.7 |
0.7 |
1.0 |
1.0 |
0.5 |
0.5 |
|||||
|
1000 |
10 |
10.8 |
10.8 |
|
11.2 |
11.2 |
10.9 |
10.9 |
11.7 |
11.7 |
|||||
|
|
5 |
5.5 |
5.5 |
|
5.2 |
5.2 |
5.3 |
5.3 |
4.7 |
4.7 |
|||||
|
|
1 |
0.9 |
0.9 |
|
0.6 |
0.6 |
1.0 |
1.0 |
0.5 |
0.5 |
Leung On Shing and Bartholomew .J David 8

|
|
|
|
|
|
|
|
|
Number of variables |
|
|
|
|
|||
|
|
|
|
|
5 |
|
|
|
|
|
|
|
10 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
||||
|
|
|
|
Three-moment Ž t |
|
Two-moment Ž t |
|
|
Three-moment Ž t |
|
Two-moment Ž t |
|||||
|
|
True tail |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Model |
N |
probability |
Exact |
Asymptotic |
Exact |
Asymptotic |
Exact |
Asymptotic |
Exact |
Asymptotic |
||||||
|
|
|
|
|
|
|
|
|
|
|
|
|||||
Two-factor |
100 |
10 |
10.4 |
10.5 |
|
11.2 |
11.3 |
10.2 |
10.2 |
11.2 |
11.3 |
|||||
|
|
5 |
5.2 |
5.2 |
|
4.9 |
5.0 |
5.2 |
5.2 |
4.7 |
4.8 |
|||||
|
|
1 |
1.0 |
1.1 |
|
0.7 |
0.7 |
1.0 |
1.1 |
0.5 |
0.5 |
|||||
|
500 |
10 |
10.3 |
10.3 |
|
11.0 |
11.0 |
10.5 |
10.5 |
11.6 |
11.6 |
|||||
|
|
5 |
4.9 |
4.9 |
|
4.6 |
4.6 |
5.2 |
5.2 |
4.7 |
4.8 |
|||||
|
|
1 |
0.8 |
0.9 |
|
0.5 |
0.5 |
1.0 |
1.0 |
0.5 |
0.5 |
|||||
|
1000 |
10 |
10.4 |
10.4 |
|
11.1 |
11.1 |
10.8 |
10.8 |
12.0 |
12.0 |
|||||
|
|
5 |
5.4 |
5.4 |
|
5.2 |
5.2 |
5.5 |
5.5 |
5.2 |
5.2 |
|||||
|
|
1 |
1.1 |
1.1 |
|
0.7 |
0.7 |
1.0 |
1.0 |
0.5 |
0.5 |
tables contingency for test t Ž of goodness A
9

10 |
David J. Bartholomew and Shing On Leung |
|
|
|
|
||||
Table 3. Contribution to |
ˆ |
statistic by individual pairs for the sexual attitudes data |
|
||||||
Y |
|
||||||||
|
|
|
|
|
|
|
|
||
Pairs |
|
|
|
|
One-factor |
|
Two-factor |
||
|
|
Observed |
|
|
|
|
|
||
|
|
|
|
|
|
|
|||
i |
j |
count |
|
Expected |
Contribution |
|
Expected |
Contribution |
|
|
|
|
|
|
|
|
|
||
1 |
2 |
107 |
|
112.60 |
0.31 |
111.60 |
0.21 |
||
1 |
3 |
117 |
|
105.50 |
1.38 |
109.50 |
0.57 |
||
1 |
4 |
29 |
|
18.30 |
6.42 |
20.80 |
3.30 |
||
1 |
5 |
42 |
|
42.10 |
0.00 |
43.40 |
0.05 |
||
1 |
6 |
62 |
|
68.60 |
0.67 |
62.50 |
0.01 |
||
1 |
7 |
73 |
|
77.80 |
0.32 |
72.20 |
0.01 |
||
1 |
8 |
79 |
|
83.10 |
0.22 |
79.10 |
0.00 |
||
1 |
9 |
33 |
|
27.30 |
1.24 |
37.00 |
0.45 |
||
1 |
10 |
22 |
|
15.80 |
2.48 |
21.90 |
0.00 |
||
2 |
3 |
700 |
|
690.00 |
0.41 |
689.80 |
0.42 |
||
2 |
4 |
122 |
|
118.80 |
0.10 |
118.30 |
0.13 |
||
2 |
5 |
288 |
|
273.10 |
1.08 |
270.80 |
1.46 |
||
2 |
6 |
454 |
|
446.70 |
0.20 |
446.20 |
0.23 |
||
2 |
7 |
520 |
|
508.20 |
0.52 |
511.30 |
0.28 |
||
2 |
8 |
540 |
|
543.30 |
0.04 |
545.90 |
0.13 |
||
2 |
9 |
169 |
|
175.90 |
0.32 |
175.10 |
0.25 |
||
2 |
10 |
101 |
|
100.60 |
0.00 |
100.40 |
0.00 |
||
3 |
4 |
136 |
|
115.80 |
3.96 |
121.40 |
1.98 |
||
3 |
5 |
297 |
|
274.30 |
2.51 |
280.50 |
1.31 |
||
3 |
6 |
449 |
|
451.90 |
0.03 |
445.70 |
0.04 |
||
3 |
7 |
515 |
|
510.80 |
0.07 |
507.10 |
0.23 |
||
3 |
8 |
539 |
|
540.00 |
0.00 |
538.90 |
0.00 |
||
3 |
9 |
185 |
|
177.50 |
0.38 |
193.90 |
0.50 |
||
3 |
10 |
108 |
|
103.10 |
0.25 |
111.10 |
0.10 |
||
4 |
5 |
67 |
|
58.70 |
1.24 |
63.00 |
0.27 |
||
4 |
6 |
86 |
|
92.60 |
0.52 |
87.60 |
0.03 |
||
4 |
7 |
97 |
|
101.10 |
0.19 |
96.90 |
0.00 |
||
4 |
8 |
105 |
|
103.80 |
0.01 |
101.30 |
0.15 |
||
4 |
9 |
49 |
|
40.20 |
2.02 |
55.00 |
0.69 |
||
4 |
10 |
34 |
|
26.70 |
2.04 |
36.60 |
0.18 |
||
5 |
6 |
255 |
|
258.10 |
0.05 |
250.90 |
0.09 |
||
5 |
7 |
269 |
|
273.90 |
0.12 |
268.00 |
0.01 |
||
5 |
8 |
265 |
|
272.10 |
0.25 |
268.30 |
0.05 |
||
5 |
9 |
122 |
|
113.40 |
0.72 |
129.00 |
0.44 |
||
5 |
10 |
91 |
|
78.10 |
2.31 |
85.90 |
0.33 |
||
6 |
7 |
502 |
|
495.50 |
0.16 |
493.20 |
0.29 |
||
6 |
8 |
476 |
|
471.10 |
0.09 |
467.00 |
0.31 |
||
6 |
9 |
154 |
|
171.00 |
2.01 |
152.00 |
0.03 |
||
6 |
10 |
99 |
|
106.10 |
0.53 |
97.00 |
0.05 |
||
7 |
8 |
529 |
|
518.80 |
0.39 |
517.70 |
0.47 |
||
7 |
9 |
164 |
|
179.30 |
1.57 |
163.80 |
0.00 |
||
7 |
10 |
102 |
|
108.10 |
0.39 |
102.00 |
0.00 |
||
8 |
9 |
171 |
|
178.00 |
0.33 |
168.80 |
0.03 |
||
8 |
10 |
103 |
|
107.00 |
0.16 |
103.10 |
0.00 |
||
9 |
10 |
112 |
|
59.40 |
49.37 |
106.90 |
0.27 |
||
|
|
Total |
|
|
|
87.40 |
|
|
15.36 |
|
|
|
|
|
|
|
|
|
|