Добавил:
kiopkiopkiop18@yandex.ru t.me/Prokururor I Вовсе не секретарь, но почту проверяю Опубликованный материал нарушает ваши авторские права? Сообщите нам.
Вуз: Предмет: Файл:
Скачиваний:
0
Добавлен:
28.03.2026
Размер:
4.18 Mб
Скачать

3

Describing data with charts

Learning objectives

When you have finished this chapter you should be able to:

Choose the most appropriate chart for a given data type.

Draw pie charts; and simple, clustered and stacked, bar charts.

Draw histograms.

Draw step charts and ogives.

Draw time series charts.

Interpret and explain what a chart reveals.

Picture it!

In terms of describing data, of seeing ‘what’s going on’, an appropriate chart is almost always a good idea. What ‘appropriate’ means depends primarily on the type of data, as well as on what particular features of it you want to explore. In addition, if you are writing a report, a chart will always give you an ‘impact’ factor. Finally, a chart can often be used to illustrate or explain a complex situation for which a form of words or a table might be clumsy, lengthy or otherwise

Medical Statistics from Scratch, Second Edition David Bowers

C 2008 John Wiley & Sons, Ltd

30

CH 3 DESCRIBING DATA WITH CHARTS

Figure 3.1 Pie chart: children receiving Malathion in nit lotion study, percentage by hair colour. Data in Table 2.1

inadequate. In this chapter I am going to examine some of the commonest charts available for describing data, and indicate which charts are appropriate for each type of data.

Charting nominal and ordinal data

The pie chart

You will all know what a pie chart is, so just a few comments here. Each segment (slice) of a pie chart should be proportional to the frequency of the category it represents. For example, Figure 3.1 is a pie chart of hair colour for the children receiving Malathion in the nit lotion study in Table 2.1. I have chosen to display the percentage values, which are often more helpful. A disadvantage of a pie chart is that it can only represent one variable (in Figure 3.1, hair colour). You will therefore need a separate pie chart for each variable you want to chart. Moreover a pie chart can lose clarity if it is used to represent more than four or five categories.

Exercise 3.1 The two pie charts in Figure 3.2 are from a study to investigate the types of stroke in patients with asymptotic internal-carotid-artery stenosis (Inzitari et al. 2000). They show the types (in percentages) of disabling and non-disabling ipsilateral strokes, among two categories of patients: those with < 60 per cent stenosis, and those with 60–99 per cent stenosis. What is the most common type of stroke in each of the two categories of stenosis? What is the second most common type?

Exercise 3.2 Sketch a pie chart for the patient satisfaction data in Table 2.4.

CHARTING NOMINAL AND ORDINAL DATA

31

<60% Stenosis

60–99% Stenosis

 

19.2%

3.3%

27.5%

5.0%

 

 

 

10.0%

 

5.9%

12.6%

 

 

 

 

 

 

 

16.6%

 

5.0%

 

 

 

 

 

 

 

 

 

27.5%

 

 

42.4%

25.5%

 

 

Disabling cardioembolic

Disabling Iacunar

Disabling Iarge-artery

 

Nondisabling cardioembolic

Nondisabling Iacunar

Nondisabling Iarge-artery

 

Figure 3.2 Pie charts showing the types (by percentages) of disabling and non-disabling ipsilateral strokes, among two categories of patients, those with < 60 per cent stenosis, and those with 60– 99 per cent stenosis. Reproduced from NEJM, 342, 1693–9, by permission of New England Journal of Medicine

The simple bar chart

An alternative to the pie chart for nominal data is the bar chart. This is a chart with frequency on the vertical axis and category on the horizontal axis. The simple bar chart is appropriate if only one variable is to be shown. Figure 3.3 is a simple bar chart of hair colour for the group of children receiving Malathion in the nit lotion study. Note that the bars should all be the same width, and there should be (equal) spaces between bars. These spaces emphasise the categorical nature of the data.

Figure 3.3 Simple bar chart of hair colour of children receiving Malathion in nit lotion study (data in Table 2.1)

32

CH 3 DESCRIBING DATA WITH CHARTS

Exercise 3.3 Use the data in Table 1.8 to sketch a simple bar chart, showing the hair colour of the children receiving d-phenothrin.

Exercise 3.4 Draw a simple bar chart for the patient satisfaction data in Table 2.4. In Exercise 3.2, you drew a pie chart for this data. Which chart do you think works best? Why?

The clustered bar chart

If you have more than one group you can use the clustered bar chart. Suppose you also know the sex of the children receiving Malathion in the above example. This gives us two sub-groups, boys and girls, with the data shown in Table 3.1.

There are two ways of presenting a clustered bar chart. Figure 3.4 shows one possibility, with hair colour categories on the horizontal axis. This arrangement is helpful if you want to compare the relative sizes of the groups within each category (e.g. redheaded boys versus redheaded girls).

Table 3.1 Frequency distribution of hair colour by sex of Malathion children in nit lotion study

 

 

Frequency

 

 

 

Hair colour

Boys

Girls

 

 

 

Blonde

4

11

Brown

29

20

Red

1

3

Dark

14

13

 

 

 

Figure 3.4 Clustered bar chart of hair colour by sex for children in Table 3.1

CHARTING NOMINAL AND ORDINAL DATA

33

Alternatively, the chart could have been drawn with the categories boys and girls, on the horizontal axis. This format would be more useful if you wanted to compare category sizes within each group. For example, red haired girls compared to dark haired girls. Which chart is more appropriate depends on what aspect of the data you want to examine.

Exercise 3.5 Use the data in Table 3.1 to sketch a clustered percentage bar chart showing the hair colour of children receiving Malathion and d-phenothrin. There are two possible formats. Explain why you chose the one you did.

An example from practice

The clustered bar chart in Figure 3.5 is from a study describing the development of the APACHE II scale, used to assess risk of death, and used mainly in ICUs (Knaus et al. 1985). APACHE II has a range of 0 (least risk of death) to 71 (greatest risk). Data was available on two groups of patients, one group admitted to ICU for medical emergencies, the second admitted directly to ICU following surgery. The bar chart shows the percentage death rate (vertical axis), against

APACHE II AND HOSPITAL DEATH

Noroperative and Postoperative Patients

 

100.0%

 

 

 

 

 

 

 

 

90.0%

 

 

 

 

 

 

 

 

80.0%

 

 

 

 

 

 

 

 

70.0%

 

 

 

 

 

 

 

Rate

60.0%

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Death

50.0%

 

 

 

 

 

 

 

40.0%

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

30.0%

 

 

 

 

 

 

 

 

20.0%

 

 

 

 

 

 

 

 

10.0%

 

 

 

 

 

 

 

 

0.0%

 

 

 

 

 

 

 

 

0–4

5–9

10–14

15–19

20–24

25–29

30–34

35+

Apache II Score

 

Nonoperative

 

Postoperative

Figure 3.5 Clustered bar chart of APACHE II scores. Data on two groups of patients, one group admitted to ICU for medical emergencies, the second admitted directly to ICU following surgery. The vertical axis is death rate (per cent). Reproduced from Critical Care Medicine, 13, 818–29, courtesy of Lippincott Williams Wilkins