Ординатура / Офтальмология / Английские материалы / Principles Of Medical Statistics_Feinstein_2002
.pdf
16.7. Individual answers. As for the Supreme Court, as might be expected, it did not take a clear stand on the scientific issue. Instead, after acknowledging that the Frye rule might be too rigid, the Court said that local judges could use their own wisdom to evaluate scientific evidence and to decide what might be admissible, even if it did not comply with the Frye rule. Because the Court offered no guidelines for the local decisions, judges may have to start enrolling in appropriate courses of instruction.
Chapter 17
17.1.
17.1.1.With increased use of screening tests (such as cervical pap smears, mammography, and routine white blood counts) and more definitive technologic tests (such as ultrasound, CT scans, and MRI), many cancers are now detected that would formerly have been unidentified during the life of the patients. If incidence rates are determined not from death certificates, but from tumor registries or other special repositories of diagnostic data, the incidence rates will inevitably increase. The cancers that would formerly be found only as “surprise discoveries” at necropsy (if necropsy was done) will now be found and counted among the “vital statistics.”
17.1.2.Many of the cancers identified with the new screening and technologic procedures will be relatively asymptomatic and slow-growing. They will usually have better prognoses, even if left untreated, than the cancers that formerly were diagnosed because they had produced symptoms. Because the customary morphologic classifications do not distinguish these functional behaviors of cancer, the survival rates will rise because the relatively “benign” cancers are now included in the numerators and denominators. The relatively “malignant” cancers, however, may continue to be just as lethal as before. When referred to a community denominator, the cancer death rates may seem unchanged.
17.1.3.Because no unequivocal data (about nutrition, life style, smoking, alcohol, etc.) exist about risk factors for these cancers, it is difficult to choose a suitable public-health intervention that offers the prospect of more good than harm. Many nonmedical scientists — if professionally uncommitted to a particular “cause” or viewpoint — might therefore conclude that basic biomedical research has a better potential for preventing these cancers than any currently known public-health intervention.
17.3.
17.3.1.The denominator is 300, and 4 cases existed on July 1. Prevalence is 4/300 = .0133.
17.3.2.The incidence rate depends on whom you count as the eligible people and what you count
as incidence. Three new episodes occurred during the cited time period. If we regard everyone as eligible, the denominator is 300, and the incidence of new episodes will be 3/300 = .01. If you
insist that the eligible people are those who are free of the disease on July 1, the eligible group consists of 300 − 4 = 296 people. If you insist on counting only new instances of disease, the recurrence in case #3 is not counted as an episode. The incidence would be 2/296 = .0068. If you
allow the denominator to include anyone who becomes disease-free during the interval, only case
1 is excluded. If the numerator includes any new episode of disease, there are 3 such episodes in the interval. Incidence would be 3/299 = .01.
17.3.3.Numerator depends on whether you count 7 episodes or 6 diseased people. Denominator
is 300.
17.5.
17.5.1.Adding 0.5 to each cell produces (5.5)(8.5)/(0.5)(.05) = 187.
17.5.2.The main arguments were (1) the control groups were improperly chosen and should have contained women with the same pregnancy problems (threatened abortion, etc.) that might have evoked DES therapy in the exposed group; (2) the ascertainment of previous DES exposure may have been biased; (3) the Connecticut Tumor Registry shows an apparently unchanged annual incidence of clear-cell cancer despite little or no usage of DES for pregnancies of the past few decades.
©2002 by Chapman & Hall/CRC
values on both axes, the statistician guessed that the standard deviations for X and Y would each be about one fourth the corresponding range, i.e., sx ~ 18/4 = 4.5 and sy ~ 70/4 = 17.5, and so sx /sy was guessed as about 4.5/17.5, which is the inverse of the estimated slope, 70 /18. Because r = bsx/sy, the guesswork implies that r will roughly be (70/18)[(18/4)/(70/4)] 1 if the slope is as high as 4. From the dispersion of points, the statistician knows that the slope will not be as high as 4 and that r will not be as high as 1, but the statistician also knows, from Table 19.3, that for 11 points of data, with n = 11, P will be < .05 if r > .6. Believing that r will be close to or exceed .6, the statistician then guessed that the result will be stochastically significant. [A simpler
approach is to note that the bi-median split produces a 2 × |
2 table that is |
|
0 |
5 |
. This is at the |
|
5 |
0 |
|||
extreme of a Fisher test for 10!/(5! 5!) = 252 possibilities. The two-tailed P will be 2/252 = .008.] |
|||||
19.1.2. |
|
|
|
|
|
Σ X = 110; |
|
= 10; Σ X 2 |
= 1430; Sxx = 1430 − (1102 |
/11) = 330 |
X |
||||
Σ Y = 490; |
|
= 44.55; Σ |
Y 2 = 26700; Syy = 26700 − |
(490)2 /11 = 4872.73 |
Y |
Σ XY = 5635; (Σ XΣ Y)/N = 4900; Sxy = 735; b = Sxy/Sxx = 2.23 a = Y – bX = 44.55 – (2.23)(10) = 22.28
The graph of points is shown in Fig. AE19.1, and the line passes through (0, 22.28) and (10, 44.55). If you plotted the alternative line, b′ = Sxy/Syy = 735/4872.73 = .151; a′ = 10 − (.151) (44.55) = 3.28. The line passes through (3.28, 0) and (10, 44.55).
r2 = Sxy2 |
⁄(Sxx Syy )(735 )2 ⁄[(330 )(4872.73 )] = .336; and r = .58 |
||||
t = |
(r ⁄ |
1 – r2 ) n – 2 = (.58/ |
1 |
– .336 ) |
9 = 2.14 |
Because |
the |
required t.05 is 2.26 |
for |
P ð .05 |
at ν = 9, the result just misses being |
stochastically significant. On the other hand, because the investigator clearly specified an advance direction for the co-relationship, an argument can be offered that a one-tailed P value is warranted. If so, t.1 is 1.833, and “significance” is attained.
The visual guess of 4 for the slope was too high because the lowest and highest values for Y do not occur at the extreme ends of the range for X. The guess of 1/4 = .25 for the estimated sx /sy was quite good, however, because Sxx /Syy = 330/4872.73 = .068 and sx /sy =
Sx x/ Syy = .26.
19.3. Official Answer: YES. If one-tailed P values are permissible for comparisons of two groups, they should also be permissible for comparisons of two variables.
19.5.
19.5.1. The top figure (for firearm homicide) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
shows too much scatter to have a correlation as |
|
|
SERUM |
|
|
|
|
|
|
|
|
|
|||
high as .913. The lower figure (for firearm |
80 |
|
OMPHALASE |
|
|
|
|
|
|
|
|||||
suicide) looks like its data could be fit well with |
|
|
|
|
|
|
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
||
two horizontal lines, one at about 5.5 for low |
60 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
values of X, and the other at about 6.5 for higher |
|
|
|
|
|
|
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
||
values of X. In fact, the upper figure might also |
40 |
|
|
|
|
|
|
X |
|
|
|
|
|
||
be relatively well fit with two horizontal lines, |
|
|
|
|
|
|
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
||
one through the lower values of X and the other |
20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
through higher values. |
|
|
|
|
|
|
|
|
|
|
|
|
|
||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
||
19.5.2. Neither of the two graphs shows a |
0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|||
relationship “strong enough” to suggest that the |
|
|
|
|
|
|
|
|
|
|
|
|
|
||
0 |
4 |
8 |
12 |
16 |
20 |
|
|||||||||
correlations have r values as high as .64 and .74. |
|
|
|
|
|
IMMUNOGLOBIN ZETA |
|
|
|
||||||
19.5.3. An excellent example of a correlation that |
FIGURE AE.19.1 |
|
|
|
|
|
|
|
|||||||
got r = − .789 and P < .05 for only 7 data points. |
|
|
|
|
|
|
|
||||||||
Graph |
(and regression |
line) for data in Exercise |
|||||||||||||
|
|||||||||||||||
In view of what happens for values beyond x Š |
19.1. |
|
120, however, the relationship doesn’t seem |
||
|
||
convincing. |
|
19.5.4.Does this look like a good correlation? Nevertheless, it achieved r = .45.
©2002 by Chapman & Hall/CRC


S
33.333/15
4
81
64 
1740 
n
81
n
64 
3,410,478 
2s
n/2. For SED
n/2
50/2