Добавил:

Upload Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Национальный исследовательский университет «Высшая школа экономики»

Предмет:

[НЕСОРТИРОВАННОЕ]

Файл:

R in Action, Second Edition.pdf

Скачиваний:

540

Добавлен:

26.03.2016

Размер:

20.33 Mб

Скачать

☆

<<< < Предыдущая 41 42 43 44 45 46 47 48 49 50 51 5253 / 17353 54 55 56 57 58 59 60 61 62 63 64 65 > Следующая >>>

b Treatment and Improved aren’t independent.

Frequency and contingency tables

151

produces this table:

		Sex Female	Male	Sum
Treatment	Improved
Placebo	None	65.5	34.5	100.0
	Some	100.0	0.0	100.0
	Marked	85.7	14.3	100.0
Treated	None	46.2	53.8	100.0
	Some	71.4	28.6	100.0
	Marked	76.2	23.8	100.0

Contingency tables tell you the frequency or proportions of cases for each combination of the variables that make up the table, but you’re probably also interested in whether the variables in the table are related or independent. Tests of independence are covered in the next section.

7.2.2Tests of independence

R provides several methods of testing the independence of categorical variables. The three tests described in this section are the chi-square test of independence, the Fisher exact test, and the Cochran-Mantel–Haenszel test.

CHI-SQUARE TEST OF INDEPENDENCE

You can apply the function chisq.test() to a two-way table in order to produce a chisquare test of independence of the row and column variables. See the next listing for an example.

Listing 7.12 Chi-square test of independence

>library(vcd)

>mytable <- xtabs(~Treatment+Improved, data=Arthritis)

>chisq.test(mytable)

Pearson’s Chi-squared test data: mytable

X-squared = 13.1, df = 2, p-value = 0.001463

>mytable <- xtabs(~Improved+Sex, data=Arthritis)

>chisq.test(mytable)

Pearson's Chi-squared test	c
data: mytable	c
X-squared = 4.84, df = 2, p-value = 0.0889
X-squared = 4.84, df = 2, p-value = 0.0889

Gender and Improved are independent.

Warning message:

In chisq.test(mytable) : Chi-squared approximation may be incorrect

From the results b, there appears to be a relationship between treatment received and level of improvement (p < .01). But there doesn’t appear to be a relationship c between patient sex and improvement (p > .05). The p-values are the probability of obtaining the sampled results, assuming independence of the row and column variables in the population. Because the probability is small for b, you reject the hypothesis that treatment type and outcome are independent. Because the probability for c isn’t small, it’s not unreasonable to assume that outcome and gender are indepen-

152	CHAPTER 7 Basic statistics

dent. The warning message in listing 7.13 is produced because one of the six cells in the table (male-some improvement) has an expected value less than five, which may invalidate the chi-square approximation.

FISHER’S EXACT TEST

You can produce a Fisher’s exact test via the fisher.test() function. Fisher’s exact test evaluates the null hypothesis of independence of rows and columns in a contingency table with fixed marginals. The format is fisher.test(mytable), where mytable is a two-way table. Here’s an example:

>mytable <- xtabs(~Treatment+Improved, data=Arthritis)

>fisher.test(mytable)

Fisher's Exact Test for Count Data data: mytable

p-value = 0.001393

alternative hypothesis: two.sided

In contrast to many statistical packages, the fisher.test() function can be applied to any two-way table with two or more rows and columns, not a 2 × 2 table.

COCHRAN–MANTEL–HAENSZEL TEST

The mantelhaen.test() function provides a Cochran–Mantel–Haenszel chi-square test of the null hypothesis that two nominal variables are conditionally independent in each stratum of a third variable. The following code tests the hypothesis that the Treatment and Improved variables are independent within each level for Sex. The test assumes that there’s no three-way (Treatment × Improved × Sex) interaction:

>mytable <- xtabs(~Treatment+Improved+Sex, data=Arthritis)

>mantelhaen.test(mytable)

Cochran-Mantel-Haenszel test data: mytable

Cochran-Mantel-Haenszel M^2 = 14.6, df = 2, p-value = 0.0006647

The results suggest that the treatment received and the improvement reported aren’t independent within each level of Sex (that is, treated individuals improved more than those receiving placebos when controlling for sex).

7.2.3Measures of association

The significance tests in the previous section evaluate whether sufficient evidence exists to reject a null hypothesis of independence between variables. If you can reject the null hypothesis, your interest turns naturally to measures of association in order to gauge the strength of the relationships present. The assocstats() function in the vcd package can be used to calculate the phi coefficient, contingency coefficient, and Cramer’s V for a two-way table. An example is given in the following listing.

Listing 7.13 Measures of association for a two-way table

>library(vcd)

>mytable <- xtabs(~Treatment+Improved, data=Arthritis)

>assocstats(mytable)

<<< < Предыдущая 41 42 43 44 45 46 47 48 49 50 51 5253 / 17353 54 55 56 57 58 59 60 61 62 63 64 65 > Следующая >>>

Соседние файлы в предмете [НЕСОРТИРОВАННОЕ]

#
05.08.2019741.83 Кб0psihologia.rtf
#
02.06.2015162.69 Кб76Psyh_final_ver.docx
#
02.06.2015141.74 Кб44Psyh_final_ver.docx
#
26.03.2016226.3 Кб23public_corporation.doc
#
26.03.2016451.53 Кб7pud_finansovyy-menedjment_318476.pdf
#
26.03.201620.33 Mб540R in Action, Second Edition.pdf
#
26.03.2016296.21 Кб17Radaev_Kak_napisat_akademicheskiy_text.pdf
#
26.03.20163.76 Mб4Raeff_Modernity.pdf
#
26.03.20162.12 Mб19raigorodskii_d_ya_hrestomatiya_psihologiya_lich.pdf
#
02.06.2015494.59 Кб6raschet_SRK_smorodin.doc
#
02.06.201563.98 Кб4referat_IOGP_3.docx