ACTIVITY 1
To analyse data for meaningful patterns and measures
[LO 5.3]
- Now we need to gather information about the heights of the learners in your class. Fasten a measuring tape like the one dressmakers use to the side of the door, so that it is perfectly vertical. If you can’t find a tape, you can use some other way – maybe making small marks very accurately every centimetre on the wall, using rulers.
- Each learner takes off her shoes and stands with her heels and back tightly against the wall. Someone who is tall enough holds a ruler or piece of cardboard flat on her head to see exactly how tall she is. It is a good idea to take the measurement in centimetres and not in millimetres. Write the answer on her hand (or on a piece of paper).
- We do our first calculation in an interesting way: When everyone has been measured, all the pupils stand in line in the order of their heights.
- From this line of pupils we get the first measurement of the average of the class. Write down the height of the pupil who is exactly in the centre of the line (equally far from the beginning as from the end). This number is called the median. There are as many learners shorter than she is, as there are taller than she is. Note: if there are an even number of learners in the class, then of course there will not be a middle person. In that case we take the two middle persons, add their heights and divide the answer by two.
- Write down the median height for your class. If you are in a class with both boys and girls, work out the medians for the boys and girls separately
- Next make a frequency table for the heights and use tallies to count how many of each height you have in the class.
Go back to the table of ages of siblings and find the median age of the boys and girls separately. Your table is likely to be very big, but here is a smaller example of what you should do:
- See whether you agree that the median height for this group is 162 cm.
- If you study the numbers in the last row (they give the frequencies of the different heights) you will see that 164 cm is the height that occurs most often as there are six learners who are 164 cm tall. This number is called the mode. We can think of it as the most popular height.
- The next calculation is the one that gives us the value that we usually call the average. Its proper name is the arithmetic mean, or just mean. You may already know how to calculate it: you add all the values and then divide the answer by the number of values. For the table above you divide 6156 by 38 to get a mean height for the class of 162 cm.
- .We can make a table of these values:Use the table of ages of siblings again and calculate the mode and mean for boys and girls separately and then fill these values in on a table like the one alongside
Table 1
| Median |
162 cm |
| Mode |
164 cm |
| Mean |
162 cm |
- These values (mode, median and mean) are together called measures of central tendency. They are all different kinds of averages. That is why, when we use the word average to refer to the arithmetic mean, we are not being perfectly accurate. From now on, you can use the word mean where you would have said average before.
- Use the heights for your class and complete the calculations.
- Now carefully study the frequency table for the heights of another class of 38 learners:
Table 2
| cm |
158 |
159 |
160 |
161 |
162 |
163 |
164 |
165 |
166 |
| Total |
1 |
4 |
6 |
6 |
5 |
4 |
7 |
4 |
1 |
- Calculate the three measures of central tendency for this class as well.
- Compare the heights of the learners in the two classes and write a short essay about the similarities and differences you found.
ACTIVITY 2
To extract more information from data
[LO 5.3]
Table 3
| Median |
162 cm |
| Mode |
164 cm |
| Mean |
162 cm |
As you see from the heights of the learners in the previous example, the two classes have different heights but the three averages are exactly the same.
We can tell a little more about the data by using measures of dispersion. These tell us more about how the values are distributed.
- The first is the range, calculated by taking the highest value and subtracting the lowest value from it. Do this for both classes. As you can clearly see, the first class has a range of 13 cm and the second class has a range of 8 cm.
- The second measure of dispersion is the mean deviation. This is calculated firstly determining how far each value deviates (or differs) from the mean (which we have already calculated). Then we calculate the mean of these deviations to give the mean deviation.
- We make another table from the data for the second class, which has all the heights and the deviation of each value from the mean:
Table 4
| 158 |
159 |
159 |
159 |
159 |
160 |
160 |
160 |
160 |
160 |
160 |
161 |
161 |
161 |
161 |
161 |
161 |
162 |
162 |
| 4 |
3 |
3 |
3 |
3 |
2 |
2 |
2 |
2 |
2 |
2 |
1 |
1 |
1 |
1 |
1 |
1 |
0 |
0 |
Table 5
| 162 |
162 |
162 |
163 |
163 |
163 |
163 |
164 |
164 |
164 |
164 |
164 |
164 |
164 |
165 |
165 |
165 |
165 |
166 |
| 0 |
0 |
0 |
1 |
1 |
1 |
1 |
2 |
2 |
2 |
2 |
2 |
2 |
2 |
3 |
3 |
3 |
3 |
4 |
- The total of all these deviations is 68. Dividing by 38 we get 1,79 when rounded to two places.
- Do the same calculation for the other class.
- Now you can calculate the two measures of dispersion for your own class.
Measures of dispersion are very useful when you want to compare two sets of data, like the heights of learners in two different classes. There are other measures of dispersion, but they are not taught in this course.
- At this stage you are very good at tabulating data, calculating values to describe the data as well as making some inferences about the data
ACTIVITY 3
To use new skills to investigate and compare some test marks
[LO 5.3]
- Compare the test marks for the same test obtained by two groups of learners, shown in the table below. You have to use all the skills you have learnt so far in this learning unit, to see whether you can say whether one group did better than the other. This is not a simple question, and you are not easily going to see an answer without some careful work and concentrated thinking.
Table 6
| Group A |
82 |
78 |
57 |
91 |
29 |
80 |
85 |
49 |
82 |
67 |
99 |
68 |
83 |
12 |
87 |
86 |
38 |
81 |
58 |
79 |
| Group B |
72 |
82 |
74 |
84 |
81 |
84 |
76 |
12 |
2 |
71 |
70 |
93 |
13 |
90 |
80 |
73 |
91 |
70 |
99 |
88 |
ACTIVITY 4
To represent data in ways that make it easier to understand their meaning
[LO 2.2, 2.6, 5.4]
In the work on graphs you saw that a graph gives a much better picture of the meaning of data.
- Now you will be learning more about different kinds of graphical representation of data. This means mainly that you make the meaning of data visible without always having to do intricate calculations.
1 Line graphs
- You have already been shown that when you have plotted a number of points (for instance from a table) on graph paper, the points may lie in a straight line, which you can draw.
- But it is not always correct to join them with a line. Think back to the stepped graphs.
- Sometimes the points will not lie in a straight line, but if they are joined they form, zig-zag line. This is often called a broken-line graph. But – is it always sensible to join the points?
- Below is a part of the frequency distribution of ages of siblings we had before.
Table 7
| Ages |
<1 |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
11 |
12 |
13 |
| Total sisters |
8 |
6 |
10 |
9 |
3 |
7 |
9 |
4 |
6 |
2 |
6 |
5 |
9 |
7 |
| Total brothers |
6 |
3 |
9 |
7 |
2 |
4 |
9 |
5 |
6 |
8 |
6 |
7 |
4 |
7 |
| Total |
14 |
9 |
19 |
16 |
5 |
11 |
18 |
9 |
12 |
10 |
12 |
12 |
13 |
14 |
2 Bar graphs
The next bar graph shows the girls and boys separately, but because the bars are stacked the top of the bar shows the total of boys and girls as well.
The same information can be drawn as bar graphs in a number of different ways; here is another one, with the bars next to each other, for you to study. Write a few sentences on the different bar graphs and how (in your own opinion) they represent the information most usefully.
3 Histograms
- The publishers of a certain youth magazine wanted to find out how old their readers were. They asked the ages of everybody who bought the magazine at a stationery kiosk in a shopping mall. The categories shoppers could choose from, were:
- Under 5
- From 5 to 8, but not yet 8
- 8 years old
- 9 years old
- More than 9 but less than 10 years and 6 months
- More than 10½ years, but not yet 11
- More than 11 years, but not yet 11½
- More than 11½ years, but not yet 12
- More than 12 years, but not yet 12½
- More than 12½ years, but not yet 13
- More than 13 years, but not yet 13½
- More than 13½ years, but not yet 14
- More than 14 years, but not yet 15
- Between 15 and 16
- Between 16 and 18
- Between 18 and 20
- Under 25
- 25 to 60
- This is the first part of the table completed from their data:
Table 8
| Age |
| 0 |
5 |
8 |
9 |
10 |
10,5 |
11 |
11,5 |
12 |
12,5 |
13 |
13,5 |
14 |
15 |
16 |
18 |
20 |
| <5 |
<8 |
<9 |
<10 |
<10,5 |
<11 |
<11,5 |
<12 |
<12,5 |
<13 |
<13,5 |
<14 |
<15 |
<16 |
<18 |
<20 |
<25 |
|
| Frequency |
0 |
2 |
2 |
3 |
1 |
2 |
2 |
4 |
2 |
3 |
5 |
2 |
1 |
2 |
2 |
0 |
1 |
- Below is the histogram drawn from the data in the table. A histogram is very similar to a bar graph, but the bars are not separated, and the width of the bars depends on the size of the intervals.
- In the table we can see that the age intervals are not all the same; the first interval is five years, the next three years, etc. Fill in the missing horizontal axis labels yourself.
- In the learner height data, all the intervals were 1 cm, which makes a bar graph a good and easy choice.
- It is easy to make a mistake and draw a bar graph when you should be making a histogram because the intervals vary – be sure to check the interval lengths every time.
4 Pie charts
- Pie charts have this name because they look like sliced pies!
- Study the following examples.
- The table shows the eating habits of learners attending a certain high school. They were asked to complete a small questionnaire, from which the data in the table was compiled.
Table 9
| Had no breakfast at home |
Breakfasted at home |
Brought breakfast to eat at school |
Had lunch after going home |
Brought lunch to school |
Bought lunch from tuck shop |
Brought extra snacks to school |
| 82 |
357 |
141 |
54 |
406 |
120 |
227 |
- There are 580 learners at the school. Can you confirm this from the figures in the table?
- A pie chart was made from the breakfast information, and another from the lunch figures. Decide which is which, then fill in the descriptions on the correct slice of each pie chart.
- As you can see, the slices are not all the same size. The sizes are proportional to the number of learners represented in each slice. The way to get them in the right proportions is to calculate the size of the angle at the tip of each slice. For example, 82 580 × 360 = 51°, rounded. This is the angle at the tip of the slice representing the proportion of learners who don’t eat breakfast before coming to school. The formula is: angle size = value total number × 360. Do the calculations for all five the other slices, and confirm by measurement that the slices are the right size!
- Of course, in the end the slices have to add up to 360°.
5 Scatter plots
- This graph consists only of the plotted points. It links two sets of information on one graph, making comparisons easy. Let’s look at an example.
- The table shows the marks obtained in Science and Maths for a group of 22 learners.
Table 10
| Pupil: |
A |
B |
C |
D |
E |
F |
G |
H |
I |
J |
K |
L |
M |
N |
O |
P |
Q |
R |
S |
T |
U |
V |
| Science |
75 |
45 |
28 |
66 |
58 |
81 |
23 |
69 |
60 |
48 |
72 |
37 |
47 |
90 |
57 |
88 |
45 |
56 |
62 |
40 |
53 |
48 |
| Maths |
65 |
31 |
40 |
67 |
52 |
75 |
34 |
95 |
70 |
66 |
58 |
40 |
45 |
84 |
75 |
70 |
55 |
61 |
53 |
55 |
72 |
49 |
For each learner, the coordinates of the point are (Science mark ; Maths mark). Learner A is (75;65). Can you find the dot? Learner B is (45;31), etc. The squares on the paper are not shown, so that the dots can be seen more clearly.
If the dots lie in a pattern (as these do) roughly from bottom left-hand corner to top right-hand corner, then it means there is a connection between the marks a learner gets for the two subjects.
- Those learners, whose marks don’t correlate, are clear from the graph. Find the two circled points. For example, the point (69;95) of learner H, is a little bit higher than the rest of the points. This tells us that the learner has a better Maths than Science mark, but learner B (45;31) does much better in Science than in Maths. If everybody got exactly the same mark for both Science and Maths, then their points would make a very clear pattern. What do you think that pattern would look like?