Study Guide
Study Guide
Grades 7, 8 and 9
Foreword
Statistics South Africa (Stats SA) is committed towards building statistical capacity and
promoting statistical literacy within the organization, other government departments and
within schools and the public as a whole.
The Census@School (C@S) 2009 project was undertaken nationally in all provinces. Data
were collected from a sample of 2 500 schools selected from the Department of Basic
Education’s database of approximately 26 000 registered schools (EMIS database). The
main objective of the project is to raise awareness of the national census, how it gathers
data, and its benefits to society. In addition, data collected should be used to provide
contextual material for teachers and learners to use for teaching and learning of data
handling, and promoting statistical literacy relating to a variety of subjects.
The first series of Mathematics Study Guides on Data Handling and Probability for the
Senior Phase (Grades 7–9) using the 2009 C@S data has been developed. This milestone
has been achieved through the collaboration with and support from the national and
provincial Departments of Basic Education with regard to the C@S projects.
EDITOR
Jackie Scheiber
CONTENTS
Chapter 1 Collecting Data 1
Chapter 2 Organising and Summarising Data 21
Chapter 3 Mode, Mean, Median 33
Chapter 4 Representing Data 47
Chapter 5 Interpreting Data 71
Chapter 6 Relative Frequency and Probability 93
Chapter 7 Chapter 1 Answers 115
Chapter 2 Answers 118
Chapter 3 Answers 120
Chapter 4 Answers 123
Chapter 5 Answers 128
Chapter 6 Answers 131
C E N S U S @ S C H O O L
Chapter
1
Collecting data
In this chapter you will:
x Define data
x Learn about the Investigative Cycle
x Learn about methods used for collecting data
x Learn about the difference between samples and populations
x Select and justify appropriate methods for collecting data
x Learn about the design and use of questionnaires
I n statistics you often work with tables of data. Statistical results are also often
shown in graphs rather than just in words and numbers. These graphs make it
easier to interpret and analyse information. In this chapter you will learn about
collecting data.
What is data?
Data can be numbers, words, measurements, observations
NOTE
or even just a description of things.
Data is a collection Statistics is the name given to the study of data. It involves:
of facts such as 1. Collecting data
values or 2. Sorting data
measurements. 3. Displaying data in diagrams or charts
4. Analysing the results
5. Coming to conclusions.
9 If, for example, someone tells you that 55% of the learners at their school are
boys while only 5% of the teaching staff is male, they have given you some
statistical information.
9 What these percentages (which are statistics) tell you is that more than half of
the children at the school are boys while only a very small percentage (5%) of
the teachers are men.
1
C E N S U S @ S C H O O L
The South African Census@School activities have been developed using the
investigative cycle. Statisticians use this cycle and so can you.
Data
9 The cycle starts with a problem, and
goes on to planning and so on, but it
does not have to end at the
Plan Analysis
conclusion.
9 The conclusion could just be the
beginning of a new problem that you
find to investigate. That’s why it is a
cycle. Problem
Conclusion
We are going to go through each of the stages in the PPDAC cycle so that you will
get an idea of the stages in the cycle and how they fit together.
Stage 1: Problem
The problem section is where we make decisions about
o what data to collect
o who to collect it from
o why it is important.
Some questions that can help you to think about the problem and to develop a
statistical question of your own are:
x How do I go about answering this question?
x What do I need to know?
x How will I find the information that I need?
x What will I do with the information that I collect?
x Who will find this information useful?
x Is this information relevant to the problem?
2
C E N S U S @ S C H O O L
Stage 2: Plan
The planning section is about how you will gather the data.
Questions that can help you to think about planning your data
gathering project are:
x How will I gather this data?
x What data will I gather?
x How am I going to record this information?
For example: at a certain pet shop there are 205 goldfish, 6 puppies, 15 kittens, 37
budgies, 17 hamsters and 4 cockatiels. The different types of pets and the number
of each type is data.
o You need to make predictions and then test them. You might think to
yourself “There are mostly kittens and puppies at the pet shop”.
o After that you can reflect on the difference between your prediction and the
result.
o You need to plan things such as the sample size and method of data
collection. You think about whether you will count all of the pets (the whole
pet shop population), or if the shop is big, maybe only the pets in every
second cage (a sample of the pets). But to make this decision you need to
plan.
Stage 3: Data
The data section is how the data is managed and organised.
Raw Data is data which has been collected but not yet sorted out in any way.
For example: In order to organise the raw data about the animals in the pet shop, you
could make a table with the different types of pets in a list down one side and then
make a tick for every pet you count.
3
C E N S U S @ S C H O O L
You may record their data in any format as long as it is clear and easy to work with.
9 A table is usually the best format. Tables are the most common
organisational tool that statisticians use.
9 You could count the number of ticks to see how many of each pet type there
are and you can record this information in a table.
9 Then you’ll think about how to present the data – probably with a graph
show that you can give the “picture” of what the information you have
found looks like.
Stage 4: Analysis
The analysis section is about exploring the data and reasoning
with it.
Some questions that can help you think about how to analyse
your data are:
x What do I notice?
x Why do I think it looks like this?
x Are there relationships between some of the variables I
have investigated?
We can:
9 Read the data – take information directly off a graph
9 Read between the data – interpret the data
9 Read beyond the data – extend, predict or infer
9 Read behind the data – connect the data to the context.
For example:
o Look at the table of information and graphs that you may have drawn of the pet
shop statistics and see what it tells you.
o You might realise “Kittens and puppies may be bigger and easier to see from the
pet shop window, but there are far more fish for sale in this pet shop than
kittens and puppies put together”.
o You might wonder about which pets are most popular (reading beyond), which
pets are the easiest/hardest to look after (reading behind) or simply compare
the data you see in the graph (reading the data).
4
C E N S U S @ S C H O O L
Stage 5: Conclusion
The conclusion section is about answering the questions in
the problem section and providing reasons based on the
analysis of these questions.
For example: Based on the pet shop data, you could conclude that fish are the
most popular pet and easiest for the pet shop to keep since they have the most fish.
The five stages (Problem; Plan; Data; Analysis; Conclusion) take you through the
full “investigative research cycle”.
The example on the next page illustrates how the five stages of the research cycle
can be used:
5
C E N S U S @ S C H O O L
)
Example 1.1
The Census@School researchers wanted to investigate whether
South African learners had access to libraries, computers and the
internet at school. Explain how you can use the investigative cycle
to do this.
Solution
1) PROBLEM:
The general question that C@S wanted to investigate was something like – ‘I
wonder how accessible libraries, computers and the internet are to the learners
in South Africa?’
2) PLAN:
C@S had to plan how to collect the data. The project collected data for many
questions, including questions about the accessibility of libraries, computers and
the internet. They had to plan things such as:
a) How to gather this data – C@S decided to take questionnaires into schools.
Two questionnaires were developed, one for Grades 3 – 7 and one for
Grades 8 –12.
b) What data to gather – the questionnaire included questions about how
easy/difficult it is for learners to get to libraries.
c) How to record this information – the learners would write their answers on
the questionnaires which could be collected.
3) DATA:
C@S had to think about the way in which they would organise the data.
This would involve:
a) How to record the data – C@S entered all of the data onto a table.
b) How to present the data – a bar graph representing the accessibility of
libraries, computers and the internet was presented in the C@S report.
4) ANALYSE data:
What the bar graph showed was that more learners have greater access to
libraries than to computers and the internet.
Fewer learners have access to the internet than to libraries or computers.
5) CONCLUSION:
a) The analysis showed that, of the three, libraries are the most accessible and
the internet is the least accessible. Computers are somewhere in the middle.
b) Because learners do not have to pay money to take books out of a library,
more learners can access a library. Computers cost a lot and are not always
available. Access to the internet costs even more money and so it is even
less common.
c) There is a relationship between ownership of a computer and access to the
internet, although internet cafes do provide internet access to people who
do not own computers.
6
C E N S U S @ S C H O O L
Exercise 1.1
Make up your own example of a statistical investigation, like the pet shop example
used on the previous pages to illustrate the stages in the investigative cycle.
If you can’t think of an example of your own try one of the following:
x What is the range of ages of learners in your class or school?
x What is the range of learner heights in your class or school?
7
C E N S U S @ S C H O O L
To be able to make conclusions from data, we need to know how the data were
collected; that is, we need to know the method(s) of data collection.
8
C E N S U S @ S C H O O L
)
Example 1.2
You have been asked to investigate what the favourite sport is of
the learners in your school. Think about how you are going to do
it, and then answer the following questions about the methods you
are going to use:
9
C E N S U S @ S C H O O L
Exercise 1.2
You have been asked to investigate what the favourite tuck shop food is of the
learners in your class. Think about how you are going to do it, and then answer the
following questions about the methods you are going to use:
1) What data do you want to collect?
2) How do you plan to collect you data? Explain how you would go about it.
3) If you collected data initially as raw data, how would you record this data?
4) What form of data collection could you use?
a) Could you have used a census? Explain why or why not.
b) Could you have used a sample survey? Explain why or why not.
c) Could you have used an experiment? Explain why or why not.
d) Could you have used an observational study? Explain why or why not.
10
C E N S U S @ S C H O O L
A Population
A population is the whole group of people (or things)
that you want to speak about or investigate.
Examples:
a) In the example about what the favourite foods from the tuck shop are,
o If you wanted to find out about the favourite foods of the learners in your class
then your class would be the population
o If you wanted to find out about the favourite foods of the learners in the whole
school then all of the learners in the school would be the population.
b) In the example about the pet shop, all of the pets in the shop would be the
population.
9 Normally a population is quite large, but the size of your population depends on
your statistical research.
9 The larger your population (for example you might want give information relating to
“male South Africans") the more impossible it becomes to ask each member of that
population the questions you want to ask. You then need to develop some way to
help you with selection of individuals from your population, and to give a generalised
result based on information obtained from your sample.
A Sample
x When you want to collect information from a certain
population, which can sometimes be large, you could
take a smaller group, which could represent the whole
population.
x Such a group, used to represent the population, is called
a sample.
Examples:
a) If your favourite food survey was only to be done in your class you would not have
to find a sample.
o If the survey needed to be extended to the whole school, you will not have time
to ask everyone in the school the questions that you want to ask.
o What you could do is ask a selection of learners whose answers you will use to
make conclusions about the whole school.
11
C E N S U S @ S C H O O L
o You would try to ask learners from each grade in the school, with a good spread
across the school, to make your sample representative of the whole school
population.
b) In the example of the pet shop, counting pets in every second cage would be a way
of sampling.
)
Example 1.3
Explain the difference between a population and sample by using
the investigation to find out about the favourite sport of the
learners
a) In your class
b) In your school.
Solution
a) If I wanted to find out about the favourite sports that the learners in my class
like to play, the class is the population for my research. I can ask every learner in
my class and then talk about the sports they like. I don’t have to take a sample
of the learners in the class, because the population is small.
b) If I want to find out about the favourite sports that the learners in my school like
to play, the population for my research is every learner in the whole school. This
is harder to do! That could be quite a big number. I would rather ask a sample of
the learners in the school what their favourite sport is. Then I could use this
information to decide about the favourite sport of the learners in the school.
This means that I would ask a smaller group of learners selected from the whole
school population. But to be sure that what I say about the school is realistic I
would have to be careful to choose a sample that represents the full spread of
learners in my school. This sample could consist of:
x Some (a percentage) from each grade
x Some (a percentage)from each class in each grade
12
C E N S U S @ S C H O O L
Exercise 1.3
13
C E N S U S @ S C H O O L
Random samples
You need to be able to distinguish between samples and populations.
One way of selecting a sample is by using random
sampling.
9 If each member of the population has an equal
chance of being chosen for a sample, then you
have a random sample.
9 Taking a random sample makes sure that the
sample represents the population.
9 One way of selecting a random sample is to place all the names in a hat or box; to
mix the names up; and to then take the names out of the hat or box without looking.
The mixing up of the names is important or else the names which are placed last in
the hat will have a better chance of selection than those which were placed first into
the hat. This will lead to bias in your data.
For example
If you want to find out what the favourite tuck shop food is of all the learners in the
school, be sure not just to ask your friends (who might all have similar tastes) or people
only from one class.
o You need to think of a way of getting the information from as wide a selection of
learners as possible. That way, what you hear from your sample (your selection
carefully chosen to be representative) will reflect what is going on in the whole
school.
For example
Two ways you could select a random sample of learners from your school are:
1) You could cut up all of the school class lists and then draw 30 names out of the
hat. This might not give you a random sample if the hat is not well enough
shaken.
2) You could take each class list and select every 10th name on the lists. This could
be the easiest way to get a representative random sample.
14
C E N S U S @ S C H O O L
Exercise 1.4
15
C E N S U S @ S C H O O L
Exercise 1.5
Jane wants to find out what is the favourite sport of South African children. She spends a
week asking every learner she sees at her school what their favourite sport is. She tallies
up the information and decides that Athletics is the favourite sport.
My school
Favourite sport Boys Girls Total
Athletics 34 25 59
Soccer 39 5 44
Netball 2 30 32
Softball 15 12 27
Cricket 16 4 20
No Favourite sport 10 11 21
Other 8 6 14
She decides to check up on her finding. She goes to the South African Census@School
website and finds the following table:
Eastern Cape
Favourite sport Male Female Total
Soccer 6 931 775 7 706
Netball 135 5 626 5 761
Athletics 1 957 2 058 4 015
No Favourite sport 722 1 409 2 131
Rugby 1 276 44 1 319
Unspecified 595 476 1 071
Volleyball 293 435 728
Cricket 450 59 509
Dance Sport 80 387 467
Softball 185 281 465
16
C E N S U S @ S C H O O L
)
Example 1.4
Give an example of each of the different kinds of questions you
could ask when doing research at your school.
Solution
1) Particular things – What was the highest score in the Grade 7 maths final exam at
my school?
2) Comparative things – Are there more 14 year olds or 15 year olds in Grade 8 at my
school?
3) Relationships – Is there a relationship between the amount of money spent at the
tuck shop and the age of the learner at my school?
17
C E N S U S @ S C H O O L
Exercise 1.6
Give another example of each of the different kinds of questions you could ask
when doing research at your school.
1) A particular question
2) A comparative question
3) A relationship question
When you decide to collect data, you need to decide whether you are working with
a sample or a population.
)
Example 1.5
Where would you find your data to answer the question you posed in
Example 1.4?
Solution
1) Particular question – What was the highest score in the Grade 7 maths final exam at
my school?
I could find out about this by speaking to the teachers and getting the mark lists for
all of the Grade 7 classes at my school.
2) Comparative question – Are there more 14 year olds or 15 year olds in Grade 8 at
my school?
I could find out about this by asking the Grade 8 learners at my school about their
age.
3) Relationship question – Is there a relationship between the amount of money spent
at the tuck shop and the age of the learner at my school?
I could ask a sample of learners who shop at the tuck shop about their age and the
amount that they spend. Then I would be able to see if there is a relationship. I
would have to be sure to ask a big enough representative sample.
18
C E N S U S @ S C H O O L
Exercise 1.7
Where would you find your data to answer the questions you posed in
Exercise 1.6?
Write down the question and then explain where you would find
your data.
19
C E N S U S @ S C H O O L
)
Example 1.6
1) Give an example of each of the following two types of questions
that could be asked in a questionnaire.
a) a yes/no response type question.
b) a multiple choice response type question.
2) Why are these types of questions easy to analyse?
Solution
1) a) Do you eat red meat? Circle yes/no
b) What is your favourite meat? (Circle the letter that represents your choice)
i) Beef
ii) Lamb
iii) Chicken
iv) I don’t eat meat
2) These types of questions easy to analyse because the responses are limited to the
options given on the questionnaire and so you don’t have to worry or think about
what to do with all sorts of different, uncommon answers that some people might
give.
Exercise 1.8
Pick one of your research questions that you posed in Exercise 1.6.
Design a 10 question questionnaire to collect data for your research.
9 Your questionnaire should have six yes/no questions and four multiple choice
questions.
9 If you want to design questionnaires for ALL of your questions, go right ahead.
The more you do, the better you’ll get at doing it.
9 If you had some difficulty making up questions, choose one of the following
questions:
1) What types of transport do the learners in my school use to go to school?
2) How far do the learners in my school live from the school?
Many of the questions that have been raised in this chapter were researched by the
C@S team. Results for South Africa in 2009 are available in the Census at School
Results 2009 report which can be found at the Stats SA website www.statssa.gov.za
You will find out lots more about working with data in the chapters that follow.
20
C E N S U S @ S C H O O L
Chapter
2
Organising and Summarising Data
In this chapter you will:
x Learn to recognise continuous and discrete data
x Organise and record data using tallies
x Draw an ungrouped and grouped frequency table
x Organise data using a stem and leaf diagram
Types of data
Data can consist of two types of data: numerical data or
NOTE
non-numerical data. The data you collect in a survey or
x Data can be questionnaire may be varied – it may be about the colour of
numerical or non- learners’ eyes, their mode of transport to school, an opinion (e.g.
numerical. which chocolate do you prefer). It may also be numerical (e.g.
how many learners come to school by bus or taxi in the morning).
In this book we will be working with numerical data that is either discrete or
continuous as well as non-numerical or qualitative data.
x Qualitative data consist of descriptions using words.
Examples are: the colour of hair, the colour of eyes, shoe sizes.
x Discrete and continuous data both consist of numerical values.
o Discrete data is information that is collected by counting exact amounts.
Examples are: the number of children in a family; the number of children
with birthdays in October or the number of houses with electricity.
o Continuous data is collected by measurement and the values form part of a
continuous scale like a number line.
21
C E N S U S @ S C H O O L
Organising data
When you first look at numerical data, all you may see is a jumble of information.
You need to sort or summarise the data and record it in a way that puts order into it
so that it makes more sense.
9 Data in this form are called raw data. Raw data haven’t been organised in
any way.
9 One of the most common ways of sorting data is by making a list. Data is
easy to sort into lists that are either numerical or alphabetical.
Exercise 2.1
Given below are the heights (in centimetres) of 90 Grade 8 boys in an Eastern Cape
school as recorded in the 2009 Census@School:
165 148 158 150 160 165 150 156 155 164 162 160 158 148 158
140 146 160 148 152 139 165 148 160 156 158 170 155 160 148
155 158 179 170 158 161 155 160 163 178 138 172 170 156 160
160 171 140 160 170 175 148 170 177 155 167 154 160 170 155
136 179 150 167 148 160 164 167 157 165 163 140 162 178 160
170 163 162 165 175 165 152 147 180 148 170 165 167 165 165
Tally tables
9 You can also organise and summarise data using a tally table.
A tally is a mark which shows how often something happens.
9 For each score, a vertical stroke is entered in the appropriate row, with a
diagonal stroke being used to complete each group of five strokes.
22
C E N S U S @ S C H O O L
)
Example 2.1
Organise the list of heights (in centimetres) of 90 Grade 8 boys in
an Eastern Cape school given in Exercise 2.1 into a tally table.
Solution
Height (in cm) Tallies Frequency
136 1
137 0
138 1
139 1
140 3
141 1
142 0
143 0
144 0
145 0
146 1
147 1
148 8
149 0
150 3
151 0
152 2
153 0
154 1
155 6
156 3
157 1
158 6
159 0
160 12
161 1
162 3
163 3
164 2
165 9
166 0
167 4
168 0
169 0
170 8
171 1
172 1
173 0
174 0
175 2
176 0
177 1
178 2
179 2
180 1
23
C E N S U S @ S C H O O L
Frequency Tables
9 A frequency table is usually given without the tallies. The word “frequency”
may not appear in the table. Instead, the frequency column might be headed
“number of learners” or something like that. Always make sure that you
identify which numbers are the “values” and which ones are the “frequencies”.
Exercise 2.2
Conduct a simple survey of the learners in your class to ask about the month of
their birthday.
9 Stem-and-leaf diagrams can be used both with discrete data and with
continuous data (rounded off to the nearest whole number).
9 Stem-and-leaf diagrams retain the original data information, but present it in a
compact and more easily understandable way.
9 When entering a number like 56 onto a stem and leaf diagram, the tens digit (5)
forms the stem and the units digit (6) forms the leaf.
25
C E N S U S @ S C H O O L
)
Example 2.2
The members of your class got the following marks in a
mathematics test:
1) Organise the data using a stem and leaf diagram.
2) Write down 3 conclusions you can reach about the marks.
stem leaf
3
4
5
6
7
Step 2 : Write down the leaves directly from the data without worrying about
the order:
stem leaf
3 2, 9, 6, 3, 9
4 5, 5, 4, 7, 0, 7, 8, 5
5 6, 9, 4, 4, 2, 0, 2, 7, 5, 3, 7, 6, 5, 8, 5, 1
6 5, 9, 6, 1, 3, 2, 5
7 8, 7, 2, 1
26
C E N S U S @ S C H O O L
Step 3 : Rewrite the leaves in ascending order. This makes the table easier to
read.
stem leaf
3 2, 3,6, 9, 9
4 0, 4, 5, 5, 5, 7, 7, 8
5 0, 1, 2, 2, 3, 4, 4, 5, 5, 5, 6, 6, 7, 7, 8, 9
6 1, 2, 3, 5, 5, 6, 9,
7 1, 2, 7, 8
KEY: 3/2 = 32
2) We can read the following information from this stem and leaf diagram:
a) The lowest mark is 32
b) The highest mark is 78
c) More learners got marks in the 50s than in any of the other number ranges
d) More learners got marks in the 40s than in the 60s
e) The most common marks were 45 and 55 as three learners got 45 and three
got 55.
f) Four learners achieved marks of more than 69.
NOTE:
9 The leaf is the digit in the place furthest to the right in the number.
9 The stem is the digit or digits that remain when the leaf is dropped.
9 If the list of numbers included numbers like 120 ; 134 ; 127 then 12 and 13
would be the stems and 0, 4 and 7 would be the leaves.
9 If the list of numbers included a single digit number like 2, 3 or 9, then the stem
would be 0 and the leaves would be 2, 3 and 9.
Exercise 2.3
Given below are the heights (in centimetres) of 90 Grade 8 boys in an Eastern Cape
school as recorded in the 2009 Census@School:
165 148 158 150 160 165 150 156 155 164 162 160 158 148 158
140 146 160 148 152 139 165 148 160 156 158 170 155 160 148
155 158 179 170 158 161 155 160 163 178 138 172 170 156 160
160 171 140 160 170 175 148 170 177 155 167 154 160 170 155
136 179 150 167 148 160 164 167 157 165 163 140 162 178 160
170 163 162 165 175 165 152 147 180 148 170 165 167 165 165
27
C E N S U S @ S C H O O L
)
Example 2.3
The heights of 10 boys and 10 girls in Grade 7 were randomly
selected.
a) Draw a back-to-back stem-and-leaf diagram to illustrate the data
b) Write down at least 2 conclusions about the heights.
Heights of a random selection of 20 learners in Grade 7 in cm
Boys 171; 156; 154; 160; 145; 147; 149; 160; 150; 147
Girls 162; 155; 155; 155; 164; 170; 149; 156; 161; 164
Solution
a) STEP 1 : Decide on the values of the stems and draw up the table.
GIRLS’ HEIGHT BOYS HEIGHTS
in centimetres in centimetres
14
15
16
17
28
C E N S U S @ S C H O O L
)
Example 2.4
Organise the heights (in centimetres) of 90 Grade 8 boys in an
Eastern Cape school as recorded in the 2009 Census@School in a
grouped frequency table:
Solution
o The heights range from 139 cm to 180 cm.
o We can organise these heights
o 130 < h ч 140 means that the heights
into the following class intervals:
are more than 130 cm, but less than
130 < h ч 140
140 cm or equal to 140 cm.
140 < h ч 150
150 < h ч 160 o 140 < h ч 150 means that the heights
160 < h ч 170 are more than 140 cm, but less than
170 < h ч 180 150 cm or equal to 150 cm.
o The following grouped frequency table shows the heights of the learners:
Height in
Frequency
centimetres
130 < h ч 140 6
140 < h ч 150 13
150 < h ч 160 31
160 < h ч 170 30
170 < h ч 180 10
29
C E N S U S @ S C H O O L
NOTE:
9 The groups do not overlap at all.
Notice that the groups were written 130 < h ч 140 and then 140 < h ч 150.
In other words, you have to be careful that a particular height (e.g. 140 cm) does
not appear in two different intervals.
9 The data in this example – the heights of the Eastern Cape boys – is continuous
data rounded off to the nearest centimetre.
Exercise 2.4
Mathematics Marks
32 ; 56 ; 45 ; 78 ; 77 ; 59 ; 65 ; 54 ; 54 ; 39 ; 45 ; 44 ; 52 ; 47 ; 50 ; 52 ;
51 ; 40 ; 69 ; 72 ; 36 ; 57 ; 55 ; 47 ; 33 ; 39 ; 66 ; 61 ; 48 ; 45 ; 53 ; 57
a) Use intervals 1,50 < x d 1,55 ; 1,55 < x d 1,60 ; etc to draw a grouped
frequency table for Lerato’s data.
b) How many learners are taller than 1,75m?
30
C E N S U S @ S C H O O L
12 20 13 15 22 3
6 24 20 15 9 12
5 6 8 30 7 12
14 25 2 6 12 20
18 3 18 8 9 20
4) The metro bus company in Durban did a survey to find out how many learners
used a particular bus to come to school in town. They counted the number of
learners on the bus each time it arrived in town. The numbers are given below:
11 25 60 58 55 16 23 2 44 26
49 8 14 24 7 16 47 5 30 34
9 33 10 21 1 56 32 19 6 1
21 42 9 35 25 55 37 52 15 7
31 25 6
a) Draw a grouped frequency table to organise the data. You will need to think
of appropriate group intervals.
b) Why do you think the bus company might want to know the numbers of
learners on the bus?
5) A survey was conducted to find the colour of eyes of South African learners and
the following results were recorded and organised in a table:
31
C E N S U S @ S C H O O L
Chapter
3
Mode, Mean, Median
In this chapter you will:
x Learn about measures of central tendency: Mode, Mean, and Median
x Learn about measures of dispersion: Range and Extreme values
W hen you have a data set, it is possible to summarise the data with one single
number (also called a ‘statistic’). These single numbers are called either
Measures of Central Tendency or Measures of Dispersion or Spread.
Mode
9 The mode is the number in your data that occurs most often. You can also say
the mode is the value that has the highest frequency.
9 Sometimes two scores (or numbers) occur equally often and then the data set
has more than one mode. If there are two modes we say that the data set is
bimodal. If there are more than two modes, we say that the data set is multi-
modal.
9 Other times there might not be any number that occurs more often than any
other number. This means that a data set may have no mode.
33
C E N S U S @ S C H O O L
)
Example 3.1
For each of these sets of scores, find the mode.
a) 1, 7, 9, 4, 3, 5, 9, 3, 2, 9
b) 3, 7, 9, 4, 3, 5, 9, 3, 2, 9
c) 1, 2, 3, 4, 5, 6, 7, 8, 9
Solution
a) Order the scores from lowest to highest: 1, 2, 3, 3, 4, 5, 7, 9, 9, 9
It is now easy to see which score is the mode:
The score that occurs most, is 9, the mode of the data set.
Exercise 3.1
A random sample of ten learners was selected from the 2009
Census@School data base.
1) The following data set contains the ages of ten learners in years:
9, 15, 9, 15, 17, 15, 11, 18, 15, 19
a) Order the ages from the smallest to the biggest.
b) How many learners are 9 years old?
c) What is the highest age?
d) How many learners are older than 15?
e) How many learners are 15 years old?
f) Which age occurs most often?
g) What is the mode?
2) The following data set contains the heights of the ten learners in centimetres.
138, 161, 121, 170, 170, 165, 142, 160, 140, 182
a) Rank the heights from the lowest to the highest.
b) Which is the smallest height?
c) How many learners are taller than 160 centimetres?
d) How many learners are 170 centimetres tall?
e) What height is the mode?
34
C E N S U S @ S C H O O L
3) Learners were asked how many people lived in their home. The data set below
shows their answers: 10, 14, 6, 8, 3, 2, 6, 7, 7, 4
a) Rank the answers from smallest to largest.
b) How many learners are in the data set?
c) What is the fewest number of people in a home?
d) What is the most number of people in a home?
e) How many homes have 6 people?
f) How many homes have 7 people?
g) What is the mode of the people in the home?
h) If you only look at the first 7 learners in the data set, what will the mode be?
i) If you only look at the first 5 learners in the data set, what will the mode be?
4) Grade 9 learners had to say which sport is their favourite sport at school. The
frequency of the learners who selected a particular sport is represented in the
bar graph below. Study the graph and answer the questions.
35
C E N S U S @ S C H O O L
36
C E N S U S @ S C H O O L
Mean
9 The mean is also known as the arithmetic mean or average.
9 To determine the mean you add all the scores together and then divide the sum
by the number (n) of scores.
sum of all the scores
9 We can use the formula: Mean = to calculate the mean.
number of scores
9 The “sum of all the numbers” is calculated by adding all the numbers together.
)
Example 3.2
Find the mean of the following set of scores:
1, 7, 9, 4, 3, 5, 9, 3, 2, 8
Solution
The sum of all the scores = 1 7 9 4 3 5 9 3 2 8 50
There are 10 scores in the data set, therefore to calculate the mean, you divide the
sum (50) by the number of scores or numbers (10).
37
C E N S U S @ S C H O O L
Exercise 3.2
1) The following data set consists of the ages (in years) of 5 learners:
9, 15, 11, 18, 19
a) What is the sum of the ages of the learners?
b) What is mean of the ages?
2) The data set below contains the ages of a few learners in years.
9, 15, 9, 15, 17, 15, 11, 18, 15, 19
a) The ages of how many learners are in this data set?
b) What is the sum of the ages of the learners?
c) What is mean of the ages?
d) What is the mean of the first 5 ages?
38
C E N S U S @ S C H O O L
Median
9 The median of a data set is the number right in the middle of an ordered data
set.
9 If there is an odd number of scores, the median is the middle number.
9 If there is an even number of scores, then two numbers will be the middle numbers.
In that case, you add the two numbers together and divide by 2.
)
Example 3.3
What is the median of each of the following sets of numbers?
a) 1, 7, 9, 4, 3, 5, 8, 3, 2
b) 2, 5, 7, 9, 10, 4, 12, 1, 15, 3
Solution
a) First order (or rank) the numbers in the data set from the smallest to the
greatest number: 1, 2, 3, 3, 4, 5, 7, 8, 9
There are 9 items in the data set.
The middle item is the number 4.
So the median = 4
b) First order (or rank) the scores from the smallest to the greatest number.
The data set looks as follows: 1, 2, 3, 4, 5, 7, 9, 10, 12, 15
There are 10 scores in this data set.
There is not only one score in the middle, but there are two, namely 5 and 7.
In this case, you add the two numbers in the middle and divide the sum by two
(2) to calculate the median:
57 12
Median = 6
2 2
NOTE: The median of this data set is 6, although the 6 did not actually appear in
the data set.
39
C E N S U S @ S C H O O L
Exercise 3.3
1) The following data set contains the ages, in years, of a few learners.
9, 15, 11, 18, 19
a) How many ages of learners are in this data set?
b) Is the number of ages an odd or an even number?
c) Rank the ages from the smallest to the biggest.
d) What is the median of the ages?
3) The following data set contains the heights of Grade 9 learners in metres.
1,81; 1,53; 1,6; 1,5; 1,28; 1,65; 1,75; 1,58; 1,13; 1,68; 1,77
a) Rank the heights of the learners from the smallest to tallest.
b) How many heights of learners are in this data set?
c) What is the median height in metres of the learners?
4) Grade 8 learners were asked how long it took them to travel to school.
The following data set contains these times (in hours).
0,50; 0,58; 0,67; 0,75; 0,50; 0,83; 1,00; 1,33; 0,58
a) Rank the travelling times from the shortest to the longest.
b) How many travelling times are in this data set?
c) What is the travelling time of the
(i) 3rd learner?
(ii) 7th learner?
(iii) 8th learner?
d) Which learner(s) travelled 30 minutes to get to school?
e) What is the position of the learner who travels 0,75 hours to get to school?
f) What is the position of the modal travelling time?
g) What is the modal travelling time (in hours)?
40
C E N S U S @ S C H O O L
5) a) A data set has 121 ordered data values. In which position will the
median be?
b) A data set has 502 ordered data values. Explain how you will determine
the median’s position and how you will calculate the median.
6) Thembi makes the following statement. “The median of a data set with 40
ordered data values is in the 20th position.”
Is this a valid statement? Explain your answer.
9 If you do not know the song or the melody, write a song yourself to assist you in
remembering the difference between the three measures of central tendency.
41
C E N S U S @ S C H O O L
Range
The difference between the highest value and the lowest value in a data set is called
the range.
Range = highest value – lowest value
)
Example 3.4
Solution
The lowest value is 1 and the highest value is 15.
So the Range = highest value – lowest value
= 15 – 1
= 14
42
C E N S U S @ S C H O O L
Exercise 3.4
1) The following data set contains the ages of a few learners: 9, 15, 11, 18, 19
a) Write the ages in order.
b) What is the highest age?
c) What is the lowest age?
d) What is the difference between the highest age and the lowest age?
e) What is the range of this data set?
4) Learners were asked how many minutes they travelled to school every day.
They answered as follows: 10, 45, 11, 5, 20, 10, 5, 24
a) What is the shortest travelling time?
b) What is the longest travelling time?
c) What is the range of travelling times?
43
C E N S U S @ S C H O O L
Extreme values
9 Extreme values are unexpected large or small values in a data set.
)
Example 3.5
The lengths (in centimetres) of the right feet of a group of 8-year
old learners are: 13, 18, 18, 19, 20, 20, 20, 21, 22, 31
What do you notice about this data set?
Solution
If you look at the foot lengths carefully, you can see that the lengths of the middle
eight of the foot lengths range from 18 cm to 22 cm.
13, 18, 18, 19, 20, 20, 20, 21, 22, 31
o The first learner has a very small right foot of only 13 cm.
o The largest right foot is 31 cm long.
o The very small value and the very big value are unexpected. They are called
extreme values.
44
C E N S U S @ S C H O O L
Exercise 3.5
1) A group of learners was asked to say how many minutes it took them to travel
to school every day.
They answered as follows: 45, 5, 20, 50, 13, 15, 18, 15, 90, 5, 15, 25, 15, 4
a) How many values are in the data set?
b) Rank the travelling times.
c) Identify the extreme value(s) and say why you think they are extreme.
2) In the Census@School survey, learners had to say how many children in their
home were still at school. The results are given in the following table:
Household Boys Girls
1 1 1
2 1 1
3 3 1
4 1 2
5 1 1
6 2 0
7 1 2
8 2 1
9 0 0
10 0 1
11 1 2
12 12 1
13 1 3
14 3 3
15 1 4
45
C E N S U S @ S C H O O L
3) The heights (in centimetres) of Grade 9 learners are given in the table below:
169 181 145 159 160
171 165 109 168 170
173 176 140 178 155
150 170 162 146 151
a) Rank the heights from shortest to tallest.
b) How many heights are in the data set?
c) What is the modal height?
d) What is the median of the heights?
e) What is the mean of the heights of the learners?
f) What is the range of the heights?
g) Identify an extreme value in the data.
h) Delete the extreme value from the data and recalculate the mean of the
data to 1 decimal place. What do you notice about the original mean and the
new mean?
i) Did the mode and the median change when you deleted the extreme value?
4) Explain the difference between a data set with no mode and a data set with a
mode of 0? Give your own example.
46
C E N S U S @ S C H O O L
Chapter
4
Representing Data
In this chapter you will:
x Draw graphs using pictograms
x Learn to draw 3 different types of bar graphs:
x Learn to draw Pie Charts, Histograms, Broken Line Graphs and Scatter plots
x Learn about scales on a graph
x Learn to choose an appropriate graph to represent given a set of data.
G raphs are often used to display data. Many people find graphs simpler to
understand than a table. They are also more attractive and interesting to look
at. A graph can show data clearly without lots of words or figures. This helps
you to see any patterns and to compare things easily.
PICTOGRAPHS
NOTE A pictograph gives you a quick impression of the
A pictograph uses given information.
simple pictures or
symbols to show data
Pictographs are often used in newspapers, magazines, books and on television because
comparing data in a pictograph is easy; just compare how many pictures each item has.
47
C E N S U S @ S C H O O L
EXAMPLE 4.1
)
The 2009 Census@School was run at Anele’s school.
The results were analysed and it was found that the favourite sport
of the boys in her school was soccer, rugby, cricket and athletics.
Anele drew up a table to show the percentage of the boys that
liked each sport.
Draw a pictograph to represent this data
SOLUTION:
In order to draw a pictograph, we need a title and a key. Title
x The title tells you what Favourite Sports – Males
the pictograph is about. Soccer -------
Rugby ---
x The key shows you what
each little picture “stands Cricket ----
for” or “represents”. Athletics -
x If each - = 2% of the No Favourite Sport -
males, then KEY: - = 2% of males
= half of 2%
= 1% of males Key
x So to represent 7% who
like cricket, we use three
full pictures and 1 half picture like this: Cricket - - -
48
C E N S U S @ S C H O O L
Drawing a pictograph
x Choose a simple picture or symbol that is easy to draw. Try to make it ‘look like’
the data in some way.
x Always give a key.
9 You must say what each picture stands for.
9 Each picture may represent more than 1 piece of data. Choose an
easy number to work with.
9 Part of a picture represents part (or a fraction) of that number
x Draw your pictograph on squared paper. This will help you to keep the pictures
in line.
9 Try to draw each picture the same size and space them out evenly.
9 Estimate fractions of a picture where necessary
x You could also use different pictures for different kinds of data.
EXAMPLE 4.2
)
The Census@School questionnaire also asks learners about the
type of dwelling or house in which they live. Anele drew a
pictograph to show the number of learners in her school living in
each type of dwelling.
SOLUTION
a) The pictograph uses 7 full houses plus half a house to represent the
percentage of the learners living in Formal Dwellings.
This means that 10% + 10% + 10% + 10% + 10% + 10% + 10% + 5%
= 75% of learners live in Formal Dwellings.
b) It also uses one full house plus half a house to represent the percentage of
the learners living in Traditional Dwellings.
This means that 10% + 5% = 15% of the learners live in Traditional Dwellings.
49
C E N S U S @ S C H O O L
EXERCISE 4.1
2) The following results were obtained in David’s school after the Census@School
was run at his school. Draw a pictograph to show the results.
2) They are not always a very accurate way of showing data because:
9 the data is often simplified before drawing;
9 fractions of pictures are estimated; etc.
50
C E N S U S @ S C H O O L
Bar Graphs
The length of the bar stands for the size of the data
NOTE
it represents. This makes the data easy to compare. Just
A bar graph uses bars
compare the lengths of the bars. The bars can be drawn
to display data. horizontally or vertically, and gaps are left between the bars.
Example 4.3
)
Draw a bar graph to illustrate the facilities and services at the
schools that took part in the 2009 Census@School
Facilities and services at school Percentage
Maths teacher 68,6
Electricity 65,9
Running water 60,5
Telephone 69,8
Library 24,6
Computer 53,0
Email 14,7
Internet 14,5
SOLUTION:
x First, you need a title for the graph.
For this graph the title is “Facilities and services at school”.
x Next you need two axes and a label for each axis.
In this example we want a vertical bar graph, so the bars will go up the page.
Since the percentage values will indicate the height of the graph, this will be on
the vertical axis.
o Vertical axis – Percentage
o Horizontal axis –Type of facility or service.
x Determine the scale on the vertical axis.
The highest percentage value is 68,6 %, so you can choose the vertical axis to go
up to 80. The interval of the scale will be 10 units.
x Finally, draw in each bar corresponding to the value of the category.
So, for the “maths teacher” category we draw a graph that is tall enough to
represent 68,6%.
Keep in mind that often we have to approximate the height of the bar since
some scales do not make drawing exact heights possible.
51
C E N S U S @ S C H O O L
52
C E N S U S @ S C H O O L
o GRAPH 1 and GRAPH 2 contain the same information. They show the birthdays
per quarter of a random sample of 80 learners in Grade 8 in Kgomotso’s school.
o Look at how the scale of the vertical axis makes a difference in the way you
interpret the information displayed on the graphs.
learners have a birthday in the 1st quarter and 17 learners have a birthday in
the 4th term.
NOTE: Direct comparison can only be made when the scale begins at zero.
53
C E N S U S @ S C H O O L
x On some bar graphs the scale is very easy. Each ‘small space’ stands for 1
unit.
0 1 2 3 0 1 2 3 4 5 6
0 5 10 15 0 10
x Some scales are not so easy. Each small space may stand for more than 1, or
for a fraction
0 100 0 20 0 1 2
o Changing the interval of the scale also affects the way a bar graph looks.
54
C E N S U S @ S C H O O L
Exercise 4.2
SABC1 78
SABC 2 44
SABC 3 75
MNET 33
e TV 23
2) 180 learners who took part in the 2009 Census@School were randomly selected
from the sample of schools. The following table was drawn showing the
provinces in which they were born. Draw a bar graph to illustrate this
information
NUMBER OF
PROVINCE
LEARNERS
Gauteng 39
Western Cape 25
Mpumalanga 25
Kwa-Zulu Natal 22
Northern Province 19
North West 14
Northern Cape 13
Free State 12
Eastern Cape 9
Outside of South Africa 2
55
C E N S U S @ S C H O O L
o If you look at separate bar graphs, it is not very easy to compare them.
o We can combine two bar graphs and put the bars next to each other. This
makes it much easier to compare the two data sets.
EXAMPLE 4.4
)
The 2009 Census@School asked learners what their favourite sport
was. The following table gives the percentage girls and percentage
boys who reported that the following sports were their favourites.
56
C E N S U S @ S C H O O L
Here is another type of compound bar graph representing the data in Example 4.4:
9 Notice that the bars are not drawn alongside each other, but instead
are drawn on both sides of the vertical axis. This is useful because it helps to
make a quick visual comparison across each data category (sport).
9 You can see that 61,6% of boys like soccer, in comparison with just
6% of the girls. In contrast the netball seems to be the most popular sport for
girls; 50,5% of the girls chose this as their favourite, while only 0,8 % of the boys
chose netball.
57
C E N S U S @ S C H O O L
Exercise 4.3
Study the table showing the differences in the access to goods and services
between the learners in the 2001 Census@School and the 2009 Census@School.
Draw a compound bar graph to show the percentage of learners’ access to goods
and services in the 2001 Census@School and the 2009 Census@School.
58
C E N S U S @ S C H O O L
EXAMPLE 4.5
)
A random sample of learners was taken from the results of the
2009 Census@School, and the following table was drawn.
Favourite Soccer / No
Netball Rugby Cricket Athletics Aerobics
Sport Football Favourite
Girls 4 14 18 1 1 4 2
Boys 19 7 2 5 5 1 1
Solution:
i. Use the data to draw the bars for the girls just as you would draw a single
bar graph.
o So, for soccer you would draw a bar with a length of 4 units.
ii. Draw all the rest of the bars to represent the girls.
iii. Now, for each category, draw the bar for the boys directly above the bars for
the girls, touching the bar for the girls. Shade this in a different colour from
the girls’ bar.
o This means that for soccer, you would draw a bar of length 19 units
above the girls’ bar you drew earlier. So the total height for bar for the
soccer category will be 4 + 19 = 23 units.
FAVOURITE SPORTS
25
Number of learners
20
15
10 B
5
G
0
Soccer No Netball Rugby Cricket Athletics Aerobics
favourite
59
C E N S U S @ S C H O O L
Exercise 4.4
In the 2009 Census@School, 600 learners from Grades 3 to 7 were asked which
subject was their favourite.
60
C E N S U S @ S C H O O L
Pie Charts
NOTE The whole circle stands for the whole amount of
A pie chart is a circular data being dealt with.
diagram used to Each slice stands for part of the data. Its size represents the size of
display data. that part of the data.
9 A pie chart is particularly suitable if you want to illustrate how the “whole” of some
data is divided up into different parts, and what portion of the whole each part
represents.
9 We can write this portion as a fraction, as a decimal fraction, or as a percentage of
the whole.
5) Label each slice carefully. If it is difficult to fit the full name of each group on
each slice, label each with a letter and use a key to say what each stands for.
6) Give the pie graph a suitable title.
61
C E N S U S @ S C H O O L
EXAMPLE 4.6
)
The eye colours of a random sample of 120 learners who took part
in the 2009 Census@School were as follows. Draw a pie chart to
display this data.
Eye colour Frequency
Blue 2
Brown 104
Green 4
Other 10
TOTAL 120
Solution:
Calculate the angles of the pie chart as follows
1) The “whole” is 120 students
2) There are 4 different parts; that is, 4 eye colours.
3) Divide 360° by the “whole” : 360° y 120 = 3°
4) Multiply this answer by the number of items for each part:
62
C E N S U S @ S C H O O L
EXERCISE 4.5
1) Thirty learners were asked how they travel to school, and the
results were recorded in the given table.
a) Copy the table
b) Work out the angle at the centre of the circle for each learner by calculating
360° y 30
c) Fill in the rest of the table
d) Draw a pie chart to show this information
63
C E N S U S @ S C H O O L
2. 90 people were asked which month they were born in. Here are the results:
3. A survey was done of 120 learners from Grade 8, to find out what their favourite
subject was at school.
It was found that 30 prefer History, 40 prefer Geography and 50 prefer Maths.
a) Illustrate this information by drawing:
i) A table to illustrate this data.
ii) A bar chart
iii) A pie chart
b) Which one of the two graphs do you think represents this information
better? Give a reason for your answer.
64
C E N S U S @ S C H O O L
Histograms
NOTE Bars are drawn corresponding in height to the
A histogram is a graph frequency of each group.
of grouped data and The intervals or groups are shown along the horizontal axis
has no gaps between The frequency (or how often something happens) is shown along
the bars. the vertical axis.
EXAMPLE 4.7
)
The heights of the heights of 150 learners in Grade 7 at Makhosi’s
school are recorded in the following table.
Frequency /
Height (cm)
number of learners
115 ч h < 120 6
120 ч h < 125 6
125 ч h < 130 29
130 ч h < 135 30
135 ч h < 140 35
140 ч h < 145 26
145 ч h < 150 12
150 ч h < 150 6
Solution:
x Draw the x-axis. Label this axis “Heights of learners”.
o In this example, the x-axis would have 8 intervals since the table has 8
intervals.
o Use the squares on the graph paper to make each interval the same size.
x Draw the y-axis. This will show the frequency.
o Label it “Number of learners”.
x Draw bars on the histogram with heights corresponding to the frequency of that
interval.
o So the first interval; 115 ч h < 120 will have a height of 6 units.
65
C E N S U S @ S C H O O L
EXERCISE 4.6
1) In the 2009 Census@School, the learners Time in minutes frequency
from Grades 3 to 7 at Ruth’s school were 0 ч m < 10 45
asked how long in minutes, it takes for 10 ч m < 20 48
them to travel to school. 20 ч m < 30 35
30 ч m < 40 14
The table shows the results from a sample 40 ч m < 50 6
of 150 learners.
50 ч m < 60 1
60 ч m < 70 0
Draw a histogram to illustrate this data.
70 ч m < 80 1
66
C E N S U S @ S C H O O L
Scatter Plots
NOTE Each axis represents a different variable.
We plot values of one quantity against corresponding values of
A scatter plot is used to
display two sets of
another quantity.
data in order to find a
relationship between A scatter plot shows trends
them.
For example …
Joe has a job selling ice cream. When will he sell the most ice cream?
o Joe says he will sell the most ice cream on a hot day and less ice cream on a
cooler day.
o He thinks he will hardly sell any ice cream on a very cold day.
o Joe calls this common sense. In maths it is called a correlation.
d
l 45
o
s 40
35
m
a 30
e
rc 25 Notice: The points on the
e 20
c
i 15 graph more or less form a
fo 10
re 5 straight line. We say there
b 0
m 0 10 20 30 40
is a positive linear
u
N Temperature correlation between the
number of ice creams sold
and the temperature.
67
C E N S U S @ S C H O O L
EXAMPLE 4.8
)
Use these results from a group of randomly selected learners who
participated in the 2009 Census@School to draw a scatter plot to
see if there is a correlation between age in years and grade
number.
Age in years Grade In school
13 8
16 10
16 11
11 6
14 7
13 6
18 12
10 4
9 3
10 5
Solution:
Draw the graph as follows:
x Draw two axes. Plot the age in years along the horizontal axis and the Grade
along the vertical axis.
x Choose an appropriate scale for each axis.
Here you can make both intervals 1 unit.
x Plot each set of points.
So the first point is Age = 13 and Grade = 8.
Similarly plot each set of point given. Here is the completed graph:
This point
corresponds to the
first set of data:
Age 13, Grade 8
CONCLUSION: There is a correlation between the age of a learner and the grade
they are in. As the age increases, the grade number increases.
68
C E N S U S @ S C H O O L
Exercise 4.7
1) 12 learners measure their heights and hand spans and tabulated their results.
Copy the given set of axes and draw a scatter plot to see if there is a relationship
between height and hand span.
Hand span in cm 26 25 20 19 25 18 22 27 23 24 23
Height in cm 177 175 165 164 172 160 180 164 175 171 169
2) A group of 14 learners measure their heights and foot sizes and tabulated their
results. Copy the given set of axes and draw a scatter plot to see whether foot
size increases with height.
Foot Height
(cm) (cm)
27 174
24 158
24 163
26 175
23 140
23 126
22 153
25 160
24 160
20 131
23 156
25 165
24 158
20 120
69
C E N S U S @ S C H O O L
Chapter
5
Interpreting Data
In this chapter you will:
x Look at bar graphs, pie charts and line graphs and interpret the information.
x Consider how information on bar graphs, pie charts and line graphs can be misleading
x Know more about errors when collecting and interpreting data
S o far in this workbook you have looked at different ways to collect, organise and
present information or data. Remember that the whole point of collecting data
is to help you understand more about the world you live in. In particular, by using
the Census@School data you find out about different schools and the learners who
attend them.
71
C E N S U S @ S C H O O L
)
Example 5.1
Study the bar graph below taken from the 2009 South African
Census@School results. It shows the modes of transport to school
from home of the learners who participated in the Census@School.
Solution
1) In order to find out what the graph is about, we look at the title. The title tells
us that the graph shows how learners get to school each day.
2) The axes are clearly labelled.
The horizontal axis shows the different forms of transport: walk, car, train, bus,
bicycle, scooter, taxi and other.
The vertical axis shows percentage.
3) The scale is clear and easy to read.
You cannot read exact percentages for the number of learners walking to
school, but it is easy to estimate that nearly 80% (or over 70%) walk to school.
4) You can immediately see that most of the learners in the census walk to school
from home. The other learners mostly use car, bus or taxis to get to school.
72
C E N S U S @ S C H O O L
)
Example 5.2
The following graph (Graph B) illustrates the same data as the bar
graph on the previous page (Graph A).
Compare the two graphs. Why is it more difficult to read
information off Graph B than Graph A?
Solution
1) There is no heading to the graph so it is difficult to tell what the graph is about.
2) The vertical axis has no label. It might be showing actual numbers or
percentages. It is not clear.
3) The cones instead of bars do not give a clear ‘picture’ of what the information is
saying. Three dimensional (3D) shapes like these are usually more difficult to
read than two dimensional (2D) shapes like bars.
73
C E N S U S @ S C H O O L
Exercise 5.1
1) The following table shows the birth months of approximately 45 000 South
African learners who took part in the 2009 Census@School. The bar graph that
follows has been drawn to illustrate some of the information.
74
C E N S U S @ S C H O O L
3) Graph A and Graph B show the number of cars sold each year by a particular car
dealer.
75
C E N S U S @ S C H O O L
)
Example 5.3
In the graph shown (taken from the South African Census@School
2009 results) the favourite subjects of male and female learners in
Grades 8 to 12 were compared.
1) What was the favourite subject of both the boys and the girls?
2) What was the next favourite subject of both the boys and the girls?
3) What percentage of the boy learners and the girl learners say that Technology is
their favourite subject?
Solution
1) Languages were the favourite subject of both boys and girls
2) The next favourite for both the boys and the girls is Mathematics
3) 3,6% of boys chose Technology as their favourite subject, while only 1,5% of the
girls chose it.
76
C E N S U S @ S C H O O L
Exercise 5.2
2) The bar graph below shows the favourite subject of the learners in Grades 3 to
7. Compare this graph to the graph showing the favourite subjects of the
learners in Grades 8 to 12 on the previous page (page 74).
a) What subject is the favourite for Grades 3 to 7 and which subject is the
favourite for Grades 8 to 12?
b) List the favourite subjects of the Grade 3 to 7 girls from most favourite to
least favourite
c) What was the least favourite subject for all Grade 3 to 7 learners?
77
C E N S U S @ S C H O O L
3) The two graphs below show information about the favourite sport for boy and
girl learners in the 2001 and 2009 Census@School.
a) Write at least 3 sentences describing the differences in the favourite sport
for boy and girl learners.
b) Have the favourite sports changed at all from 2001 and 2009? How can you
tell?
78
C E N S U S @ S C H O O L
79
C E N S U S @ S C H O O L
)
Example 5.4
The following pie chart is taken from the 2009 South African
Census@School results. It shows the distances that learners
travelled from home to school each day.
Solution
1) The title is “Travel distance to school from home” and tells us that the pie chart
shows the distance travelled by learners to get to school each day.
80
C E N S U S @ S C H O O L
Exercise 5.3
1) Look at the two pie graphs below and then answer the following questions
about them:
a) What do the graphs represent?
b) Which food type is eaten most by males?
c) Which food type is eaten least by males?
d) Which food type is eaten most and eaten least by females?
e) What is the difference between the dairy consumption of males and the
dairy consumption of females?
f) Write down any other observations you can make about the data
represented in the pie charts.
81
C E N S U S @ S C H O O L
2) Look at the table and graphs shown below that give the gender by grade of
learners who participated in Census@School in 2009 and then answer the
questions on the next page.
GENDER
GRADE TOTAL
Boys Girls Unspecified
Grade 3 4 969 4 983 148 10 100
Grade 4 4 180 4 121 85 8 386
Grade 5 5 126 5 213 84 10 423
Grade 6 5 825 6 128 75 12 028
Grade 7 7 510 7 913 105 15 528
Grade 8 9 654 10 782 118 20 554
Grade 9 9 847 10 547 183 20 577
Grade 10 9 521 9 974 127 19 622
Grade 11 7 521 9 158 96 16 775
Grade 12 6 027 7 862 123 14 012
Total 70 180 76 681 1 144 148 005
82
C E N S U S @ S C H O O L
a) Is the table, the bar chart or the pie charts easier to read? Explain your answer.
b) In which grade were there most learners?
c) Did more boys or more girls participate in the survey? How do you know?
d) Can you tell how many learners participated in the census by looking at the bar
chart or the pie graph?
e) Look carefully at the graph below. In what way is it different to the bar graph on
the previous page? Does it show the same information or different
information?
f) By looking at any of the four graphs for this question, what conclusions can you
make about the gender by grade of the learners who participated in the census?
83
C E N S U S @ S C H O O L
Line Graphs
9 Line graphs are often used to show how a quantity varies.
9 Some important things to help you interpret line graphs are:
1. The title – what is the line graph about?
2. The axes – check the labels of the axes.
3. The scales on the axes – do the scales start at zero? What else do the scales
tell you?
)
Example 5.5
The following line graph is taken from the 2009 South African
Census@School results. It shows the average height of learners by
age and gender. Notice that the heights of the boys and the girls
are compared using two lines on one graph.
Solution
1) The graph shows the average height by grade and gender
2) The vertical axis shows the age of learners and the horizontal axis shows the
grade.
3) Each division on the vertical axis represents 10 cm. The scales on the axes are
easy to read. From the information show you cannot tell actual heights but you
give a general comment about the ages.
4) You can easily see that the average height of boy and girl learners is similar
Grades 3 to 7. However, boys grow faster from Grade 8 to 12 and the average
height of boys in these grades is greater.
84
C E N S U S @ S C H O O L
)
Example 5.6
The following two line graphs show the same information, but look
different.
Solution
o The first graph clearly shows that most learners in Grade 3 are 7 years old, but
there are learners as old as 12 in Grade 3.
o In the second graph the information is much more squashed up and the graph
suggests that there are no learners older than 10 in Grade 3.
85
C E N S U S @ S C H O O L
Exercise 5.4
1) What do you think the person who drew these two graphs wants you to
believe? How would you correct the graphs so that they don’t mislead?
86
C E N S U S @ S C H O O L
Histograms
A bar graph is a graph of ungrouped data and a histogram is a graph of grouped
data.
9 For a histogram, the bars must touch each other, with no spaces between the
bars.
9 Bars must begin and end at the boundaries of the intervals.
For example, if one interval starts just after 1 and ends at 5, the second interval
must start just after 5 and end at 10.
We write this as 1 < x ч 5 and 5 < x ч 10
87
C E N S U S @ S C H O O L
Exercise 5.5
1) The heights of one hundred Grade 12 boys were measured and recorded in the
following table.
Interval Percentage
80 < h ч 100 0
100 < h ч 120 2
120 < h ч 140 1
140 < h ч 160 5
160 < h ч 180 50
180 < h ч 200 40
200 < h ч 220 2
TOTAL 100
a) How wide (in centimetres) is the interval on the horizontal axis in Graph A
and how wide is the interval on the horizontal axis in Graph B?
b) Which of the two graphs best represents the spread of heights of the Grade
12 boys? Give reasons for your answer.
88
C E N S U S @ S C H O O L
2) Two histograms were drawn to show the ages of the boys and the girls who
participated in 2009 in an online Census@School questionnaire.
a) Use the two graphs to answer the following questions:
(i) What was the age of the oldest boys who participated?
(ii) What was the age of the youngest boys who participated?
(iii) What was the age of the oldest girls who participated?
(iv) What was the age of the youngest girls who participated?
(v) What age was the mode for the boys?
(vi) What age was the mode for the girls?
b) Write down at least two other observations you can make about the
information in the two graphs.
89
C E N S U S @ S C H O O L
Sampling
When you take a sample, you need to make sure that you don’t introduce bias
without thinking, e.g. you might only ask a certain group of people - only white
people or only Indian people. Or you might ask only those people who volunteer to
answer your questions.
Misleading Conclusions
Suppose someone tests Battery A and one Battery B. The batteries are each put in a
torch which is switched on. The time taken for the torch to stop working is
measured.
The conclusion is then made that Battery A lasts for up to twice as long as Battery B.
However, this conclusion is misleading. If one Battery A lasts longer than one
Battery B, we cannot use this fact to make a conclusion about all “Battery A’s”.
Dirty data
9 In a large set of data there are almost certainly going to be errors.
9 Errors may arise in many different ways. They could be caused by human errors
(such as someone misreading or miscounting data), to errors due to poor
sampling.
The following list shows some of the ways that errors can creep into lists of data:
x Mistyping:
A correct value might have been obtained when measuring or counting, but
the incorrect number was written down. This can happen when digits are
interchanged, e.g. writing 1 765 instead of 1 756, or when digits are written
twice, e.g. 772 instead of 72.
x Mistaken answer:
In an interview the respondent may misunderstand a question and so give an
incorrect answer, e.g. giving a yearly income instead of a monthly income.
x Mistaken measurement:
Many people like to write down ‘nice’ numbers, so instead of writing 31 they
might write 30.
90
C E N S U S @ S C H O O L
Exercise 5.6
o In this exercise we will look at a data set and find out where errors might have
crept in.
o It has been adapted from a worksheet found at
http://www.censusatschool.ie/en/resources/strand1statistics/118-cleaning-up-
your-data.
o First read the information below carefully.
1) Study the Dirty Spreadsheet on the next page. Find examples of where each of
the following characters tampered with (changed) the data.
a) Pointy Pete
b) Obvious Olive
c) Silly Samantha
d) Devious Dave.
2) The tampered data needs to be corrected. Explain how you will correct the
work of:
a) Pointy Pete
b) Obvious Olive
c) Silly Samantha
d) Devious Dave.
3) Investigate the Spreadsheet further and list any other dirty data on the
spreadsheet.
91
C E N S U S @ S C H O O L
Dirty Spreadsheet
Boy or Date of Height Foot Length Favourite Distance to School
Row 1 Grade
Girl Birth in cm in cm Subject in km
Row 2 Boy 12/04/91 5 143 26 Art 1-2km
Row 3 Girl 31/02/92 4 132 22 Science less than 2 km
Row 4 Girl 14/01/91 5,00 14,2 2,3 PE/Sport 2,5423 km
Row 5 Boy 07/09/89 6 136 25 Art 1-2km
Row 6 Boy 13/12/91 4 128 24 PE/Sport 1-2km
Row 7 Boy 14/03/01 5 140 67 PE/Sport less than 1 km
Row 8 Girl 06/05/89 7 142 24 Art 3-5km
Row 9 Girl 15/08/90 6 138 21 Art 85km
Row 10 Boy 20/02/90 6 192 23 PE/Sport 1-2km
Row 11 Girl 19/05/90 6 140 20 Maths 1-2km
Row 12 Neither 29/06/92 7 48 21 Going Home 3 000km
Row 13 Boy 09/10/91 4 128 21 English less than 1 km
Row 14 Girl 18/12/90 5 135 21 Geography less than 1 km
Row 15 Girl 18/07/91 0,5 13,7 20 Art 3-5km
Row 16 Boy 03/06/34 4 129 21 Art less than 1 km
Row 17 Girl 13/02/89 7 148 23 Art 1-2km
Row 18 Girl 15/09/88 7 150 22,5 PE/Sport 1-2km
Row 19 Girl 07/08/89 7 140 24 Art less than 1 km
Row 20 Boy 08/06/89 7 142 24 Computing less than 1 km
Row 21 Boy 31/11/87 11 1 520 22 Computing 5-10km
Row 22 Both 16/07/88 8 142 26 Japanese 2-3 km
Row 23 Girl 28/04/88 8 145 26,5 PE/Sport 1 mile
Row 24 Boy 25/03/92 4,1 132,1 2,4,5 Maths less than 1 km
Row 25 Boy 26/02/92 4 130 21 PE/Sport less than 1 km
Row 26 Girl 08/07/99 6 142 22 Art 2-3 km
Row 27 Boy 23/05/90 6 151 25,5 Maths 2-3 km
Row 28 Boy 01/03/87 9 162 25 PE/Sport less than 1 km
Row 29 Girl 07/08/91 6 150 23 History 2 roads
Row 30 Girl 03/03/92 4 135 21 English less than 1 km
92
C E N S U S @ S C H O O L
Chapter
6
Relative Frequency and
Probability
93
C E N S U S @ S C H O O L
9 Rather than using words to describe the chance of an event happening, you can
give probability as a number between 0 and 1.
9 This number can be written as a fraction, percentage or decimal.
x If it is impossible for an event to happen, the probability is 0.
x If an event is certain to happen, the probability is 1.
x All other probabilities are greater than 0, but less than 1.
Example 6.1
94
C E N S U S @ S C H O O L
Exercise 6.1
1) An eight-sided die (like the one alongside) is thrown.
a) List all the possible outcomes.
b) List all the favourable outcomes for the following
events:
Event A: Getting a 2
Event B: Getting an odd number
Event C: Getting a number bigger than 4.
M A T H S
a) List all the possible outcomes.
b) List all the favourable outcomes for the following events:
Event G: Getting a T
Event H: Getting a vowel
Event J: Getting a consonant
95
C E N S U S @ S C H O O L
5) The table below is from the South African Census@School 2009 Report.
It shows what percentage of learners use particular modes of transport to
school from home.
96
C E N S U S @ S C H O O L
)
Example 6.2
In an experiment a drawing pin is dropped
100 times.
It lands with its point up 37 times.
What is the relative frequency of the drawing pin landing point up?
Solution
Number of times an outcome occurs in an experiment
Relative frequency =
Total number of trials in the experiment
Number of times the drawing pin lands point up
=
Number of times the drawing pin is dropped
37
=
100
= 0,37 or 37%
)
Example 6.3
50 motor cars are observed passing the school gate.
14 of these motor cars are red.
What is the relative frequency of a red motor car passing the school
gate?
Solution
Number of times an outcome occurs in a survey
Relative frequency =
Total number of observations in the survey
97
C E N S U S @ S C H O O L
Exercise 6.2
3) The results of games of chess played by four children at a chess club are shown
in this table.
b) Suppose there are 10 coloured counters in the bag. How many of these
counters do you think are yellow? Explain how you decided on your answer.
99
C E N S U S @ S C H O O L
Calculating probability
9 In the previous section we looked at how you could estimate probabilities by
doing experiments or making observations.
)
Example 6.4
An ordinary die is rolled. What is the
probability of getting:
a) a 6?
b) an odd number?
c) a 2 or 3?
Solution
The possible outcomes on an ordinary die are: 1; 2; 3; 4; 5; and 6.
The total number of possible outcomes is 6.
100
C E N S U S @ S C H O O L
Exercise 6.3
4) The letters of the word PROBABILITY are written on separate cards of the
same size.
The cards are shuffled and dealt, face down, onto a table.
A card is selected at random.
a) What are the possible outcomes?
b) What is the probability that the card shows:
(i) The letter T?
(ii) The letter B?
101
C E N S U S @ S C H O O L
5) A card is randomly taken from a full pack of 52 playing cards with no jokers.
102
C E N S U S @ S C H O O L
103
C E N S U S @ S C H O O L
12) Below is a data table showing the birth months of South African learners. It is
taken from the 2009 Census@School report.
Month Frequency
January 3 568
February 3 592
March 3 922
April 3 490
May 3 888
June 3 981
July 3 597
August 3 876
September 4 476
October 3 502
November 3 253
December 3 809
TOTAL 44 954
104
C E N S U S @ S C H O O L
)
Example 6.5
Makgoshi does the following experiment with a bag containing
2 red counters and 8 blue counters.
EXPERIMENT
x Take a counter from the bag at random.
x Record the colour then put the counter back in the bag.
x Repeat this for 100 trials.
Makgoshi calculated the relative frequency of getting a red counter after every 10
trials and recorded the results in a table.
Number of throws 10 20 30 40 50 60 70 80 90 100
Relative Frequency 0,3 0,15 0,12 0,24 0,2 0,24 0,21 0,2 0,19 0,21
Solution
number of red counters 2 1
a) P(red counter) = 0,2
total number of counters 10 5
b) As the number of trials increases, the relative frequency varies, but eventually
gets close to 0,2. In other words, it gets closer to the calculated probability.
105
C E N S U S @ S C H O O L
Exercise 6.4
106
C E N S U S @ S C H O O L
)
Example 6.6
Solution
1
The probability of getting heads =
2
Predicted frequency of heads
= probability of getting a head × number of times the coin is tossed
1
= × 10
2
=5
)
Example 7
A bag contains 20 counters of different colours.
A counter is randomly taken out, its colour noted and replaced.
The probability of getting a blue counter is 0,4.
How many blue counters are in the bag?
Solution
Number of blue counters
= probability of getting a blue counter × number of counters
= 0,4 × 20
=8
107
C E N S U S @ S C H O O L
Exercise 6.5
1) There are 50 cars in a school car park. Five of the cars are black.
a) What is the probability that the first car to leave the car park will be black?
b) If the probability that a red car is the first to leave is 0,2, how many red cars
are there in the car park?
3) Mr Sithole buys 300 calculators from a supplier which he intends to sell at his
school.
He is warned that the probability of the calculators being faulty is 0,02.
How many of the calculators could he expect to be faulty?
108
C E N S U S @ S C H O O L
)
Example 6.8
A fair coin is tossed twice.
a) Identify all the possible outcomes
b) Find the probability of getting two heads.
Solution
(ii) Use a two-way table.
(i) List the outcomes systematically. 1st throw
1st throw 2nd throw H T
Head (H) Head (H) H HH TH
throw
2nd
H Æ TH
T
T Æ TT
109
C E N S U S @ S C H O O L
a) All three of these methods show that there are four possible outcomes, namely:
HH, HT, TH and TT.
b) There is only one favourable outcome in the event of getting two heads, so
1
P(H;H) =
4
110
C E N S U S @ S C H O O L
)
Example 6.9
Two dice are thrown together.
1) What is the probability of getting a ‘double 6’?
2) What is the probability of getting any ‘double’?
Solution
o We first need to find how many possible outcomes there are.
o We can list them using a two-way table.
First die
1 2 3 4 5 6
1 1;1 2;1 3;1 4;1 5;1 6;1
2 1;2 2;2 3;2 4;2 5;2 6;2
Second die
1. There is only one favourable outcome for the event “getting double 6”, namely
(6;6)
1
So P(6;6) =
36
2. There are six favourable outcomes for the event “getting any double”, namely
(1;1), (2;2), (3;3), (4;4), (5;5) and (6;6)
6 1
So P(any double) =
36 6
111
C E N S U S @ S C H O O L
Exercise 6.6
112
C E N S U S @ S C H O O L
7) Each day a blue car (B), a yellow car (Y) and a red car (R) are parked one behind
the other in a narrow driveway. The order in which they park in the driveway is
not fixed and varies from day to day.
a) List all the possible orders in which the three cars could be parked.
b) What is the probability that, on any particular day, the red car is the first in
the driveway?
113
C E N S U S @ S C H O O L
Chapter
7
Answers
CHAPTER 1 – COLLECTING DATA
Note
This chapter is aimed at getting you to think about doing research using data and the many things
you need to think about when you do such research.
115
C E N S U S @ S C H O O L
116
C E N S U S @ S C H O O L
117
C E N S U S @ S C H O O L
118
C E N S U S @ S C H O O L
2) a)
Heights in metres Frequency
1,50 < x ч 1,55 1
1,55 < x ч 1,60 4
1,60 < x ч 1,65 5
1,65 < x ч 1,70 6
1,70 < x ч 1,75 6
1,75 < x ч 1,80 3
1,80 < x ч 1,85 4
1,85 < x ч 1,90 1
TOTAL 30
b) 3 + 4 + 1 = 8 learners are taller than 1,75 m
3) a)
Stem Leaves
0 2; 3; 3; 5; 6; 6; 6; 7; 8; 8; 9; 9
1 2; 2; 2; 2; 3; 4; 5; 5; 8; 8
2 0; 0; 0; 0; 2; 4; 5
3 0
KEY: 1/2 = 12 hours
4) a)
Number of learners on the bus Number of buses
0 < l ч 10 12
10 < l ч 20 6
20 < l ч 30 9
30 < l ч 40 6
40 < l ч 50 4
50 < l ч 60 6
TOTAL
b) The bus company might want to know the numbers of learners on the bus to know whether
one bus was enough or if maybe more buses were necessary
119
C E N S U S @ S C H O O L
2) a) 121, 138, 140, 142, 160, 161, 165, 170, 170, 182
b) 121 cm is the smallest height.
c) Five (5) learners are taller than 160 cm.
d) Two (2) learners are 170 cm tall.
e) 170 cm is the mode of the heights.
3) a) 2; 3; 4; 5; 6; 7; 7; 8; 10; 14
b) There are ten (10) learners in the data set.
c) The fewest number of people in a home is two (2).
d) The most number of people in a home is fourteen (14).
e) Two (2) homes have 6 people.
f) Two (2) homes have 7 people.
g) There are two (2) modes in this data set: 6 and 7 people.
h) The first seven (7) values are: 10, 14, 6, 8, 3, 2, 6.
Then you rank the values: 2, 3, 6, 6, 8, 10, 14.
It is easy to see that six (6) occurs most often. The mode is 6 people.
i) The first five (5) values are: 10, 14, 6, 8, 3
Then you rank the values: 3, 6, 8, 10, 14.
Each value only occurs once, so there is NO mode in this data set.
120
C E N S U S @ S C H O O L
143
c) The mean of the ages = 14,3 years
10
9 15 9 15 17 65
d) The mean of the first 5 ages = 13 years
5 5
3) a) The mean height of all the learners
138 161 121 170 170 165 142 160 140 1 367
| 151,89 cm.
9 9
b) The mean height of the first 6 learners
138 161 121 170 170 165 925
= | 154,17 cm
6 6
2) First arrange the data values from small to large. 9, 9, 11, 15, 15, 15, 15, 17, 18
a) 18 years
b) 9 years
c) Range = 18 – 9 = 9 years
121
C E N S U S @ S C H O O L
3) Arrange the data values from small to large: 121, 138, 140, 142, 160, 161, 165, 170, 170, 182
a) 182 cm
b) 121 cm
c) Range = 182 – 121 = 61 cm
4) First order the number of minutes from short to long: 5, 5, 10, 10, 11, 20, 24, 45
a) 5 minutes
b) 45 minutes
c) Range = 45 – 5 = 40 minutes
3) a) 109; 140; 145; 146; 150; 151; 155; 159; 160; 162; 165; 168; 169; 170; 170; 171; 173; 176;
178; 181
b) 20 heights
c) Modal height = 170 cm
d) There are 20 values in the data set. This is an even number of values. The middle two
numbers of this data set are 162 cm and 165 cm.
162 165 327
The median = 163,5 cm .
2 2
e) Remember, you do not have to rank the data to calculate the mean.
3 198
Mean = 159,9 cm
20
f) Range = 181 – 109 = 72 cm
g) 109 cm is an extreme value. It is a lot less than the rest of the heights.
3 198 109 3 089
h) New mean = = 162,6 cm.
19 19
The new mean is larger than the old mean, because the extreme value which was an
unexpected low value, was deleted.
i) The mode is still 170 cm, but the median is now the 10th value and is now 162 cm.
4) In a data set with no mode, all values occur equally often, or there is not one value that occurs
more often than the rest. In a data set where the number 0 occurs most often, 0 will be the
mode of the data set. For example: in the data set: 0; 1; 2; 3; 4; 5; 6; 7 there is no mode, but in
the data set 0; 2; 4; 0; 6; 8; 0, the mode = 0.
122
C E N S U S @ S C H O O L
123
C E N S U S @ S C H O O L
124
C E N S U S @ S C H O O L
1) continued
2)
Month No of people Angle Month No of people Angle
January 7 28q July 6 24q
February 4 16q August 8 32q
March 9 36q September 11 44q
April 8 32q October 7 28q
May 7 28q November 10 40q
June 6 24q December 7 28q
3) a)
NUMBER OF
FAVOURITE SUBJECT
LEARNERS
History 30
Geography 40
Maths 50
TOTAL 120
125
C E N S U S @ S C H O O L
b) Answers will vary. The pie chart show both the difference between the groups and how
each group compares to the whole, so this seems to be a good representation of the data.
126
C E N S U S @ S C H O O L
2)
2) Except for the people who have measurements (20; 120), and (23; 126), the taller people
generally have longer foot lengths.
127
C E N S U S @ S C H O O L
2) a) Grades 3 to 7: Maths;
Grades 8 to 12: Languages.
b) Maths; Languages; Literacy; Arts & Culture; Technology; Numeracy; Life Orientation
c) Life Orientation
128
C E N S U S @ S C H O O L
2) a) The bar graph gives a better visual representation of the data, but the table gives more detail
b) The most learners were in Grade 9
c) There were more girls. We can see this from the table.
d) You will find out approximately how many learners participated in the census by adding up
all the bars. However, the scale makes it impossible to find the exact number represented
by each bar.
You could not use the pie graph to say how many learners participated in the survey. The
pie graph only shows percentages of the whole census.
e) The bar graph shows the percentage of boy and girl learners in each grade as opposed to
the number of learners in each grade shown in the first bar graph.
f) In Grades 3 and 4 there are more boys than girls.
In the rest of the grades there are more girls than boys.
2) a) (i) The oldest boys who participated in the survey were 18 or 19 years old
(ii) The youngest boys were 11 or 12 years old
(iii) The oldest girls were 16 or 17 years old
(iv) The youngest girls were 11 or 12 years old
(v) The mode for the boys is 15 to 16 years
(vi) The mode for the girls is 15 to 16 years
b) Some of the sentences you could write are:
x The boys are older than the girls
x More girls who were 14 or 15 responded than boys of the same age.
129
C E N S U S @ S C H O O L
130
C E N S U S @ S C H O O L
3) a) Possible outcomes: M; A; T; H; S
b) The vowels are a, e, i, o and u. All other letters of the alphabet are called consonants,
Event G: T; Event H: A; Event J: M; T; H; S
5) a) Possible outcomes: Walk/Foot; Car; Train; Bus; Bicycle; Scooter; Taxi; Other
b) Event P: Taxi; Event Q: Train; Bus; Taxi; Event R: Bicycle; Scooter
2) We could calculate the relative frequency after 10 games, or after 30 games. It is better to
calculate the relative frequency after 30 games, as the more times an experiment is repeated,
the more accurate our prediction will be.
a) (i) Relative frequency of Helen’s wins
Number of games won by Helen 21 7
= = 0,7 = 70%
Number of games played 30 10
(ii) Relative frequency of Thandi’s wins
Number of games won by Thandi 9 3
= = 0,3 = 30%
Number of games played 30 10
b) It is unlikely that Thandi will win her next match against Helen as 0,3 is much lower than 0,7.
We cannot however, say for certain that Thandi will not win.
131
C E N S U S @ S C H O O L
b) The most accurate of these relative frequencies is 0,45 because we used the most trials to
calculate this relative frequency.
The number of yellow counters in a bag is 0,45 × 10 = 4,5
As there are no fractions of counters, we can predict that there are 4 or 5 yellow counters in
the bag.
2) a) Possible outcomes: R; R; R; G; G; G; G; G; G; G
b) You are more likely to get a green sweet because there are more of them.
132
C E N S U S @ S C H O O L
c) (i) The number of possible outcomes is: 10; The favourable outcome is: R: R; R;
There are 3 favourable outcomes
Number of favourable outcomes in an event 3
P(R) = = 0,3 = 30%
Total number of possible outcomes 10
(ii) The number of possible outcomes is: 10; The favourable outcome is: G: G; G; G; G; G; G
There are 7 favourable outcomes
Number of favourable outcomes in an event 7
P(G) = = 0,7 = 70%
Total number of possible outcomes 10
3) a) Possible outcomes:
Lose a turn; Lose 5 points; Win 5 points; Go back to start; Have an extra go
b) (i) The number of possible outcomes is: 5; The favourable outcome is: Loses a turn
There is 1 favourable outcome
Number of favourable outcomes in an event 1
P(Loses a turn) = = 0,2 = 20%
Total number of possible outcomes 5
(ii) The number of possible outcomes is: 5;
The favourable outcomes are: Lose 5 points; Win 5 points
There are 2 favourable outcomes
Number of favourable outcomes in an event 2
P(Lose or Win 5 pts) = = 0,4 = 40%
Total number of possible outcomes 5
4) a) Possible outcomes: P; R; O; B; A; B; I; L; I; T; Y
b) (i) The number of possible outcomes is: 11; The favourable outcome is: T
There is 1 favourable outcome
1
P(T) = = 0,091 (correct to three decimal places) = 9% (correct to nearest %)
11
(ii) The number of possible outcomes is: 11; The favourable outcomes are: B; B
There are 2 favourable outcomes
2
P(B) = = 0,182 (correct to three decimal places) = 18% (correct to nearest %)
11
5) a) Number of possible outcomes: 52; Number of favourable outcomes: 26
26 1
P(red card) = = 0,5 = 50%
52 2
b) Number of possible outcomes: 52; Number of favourable outcomes: 13
13 1
P(heart) = = 0,25 = 25%
52 4
c) Number of possible outcomes: 52; Number of favourable outcomes: 1
1
P(Ace of hearts) =
52
= 0,019 (correct to three decimal places) = 2% (correct to nearest %)
6) a) There are 10 possible outcomes.
4 2
b) (i) P(red) = = 0,4 = 40%
10 5
6
(ii) P(white or blue) = = 0,6 = 60%
10
10
(iii) P(red or white or blue) = = 1 = 100%
10
0
(iv) P(green) = = 0 = 0%
10
7) a) There are 12 possible outcomes; Favourable outcomes: 43; 44; 45; 46; 47; 48; 49; 54
There are 8 favourable outcomes: 8
P(at least one 4)
8 2
у 0,667 (correct to three decimal places) у 67% (correct to nearest %)
12 3
133
C E N S U S @ S C H O O L
134
C E N S U S @ S C H O O L
3 4 5 6 7 8 9
4 5 6 7 8 9 10
5 6 7 8 9 10 11
6 7 8 9 10 11 12
b) (i) P(total of 10)
3 1
= = 0,083 (correct to 3 decimal places) = 8% (correct to nearest %)
36 12
(ii) P(greater than 10)
30 5
= = 0,833 (correct to three decimal places) = 83% (correct to nearest %)
36 6
(iii) P(less than 10)
3 1
= = 0,083 (correct to three decimal places = 8% (correct to nearest %)
36 12
c) The probability is equal to 1 because less than 10 and greater than 10 includes all the
possible outcomes.
135
C E N S U S @ S C H O O L
3) a)
Bag A Bag B Outcome
W Æ R; W
R W Æ R; W
R Æ R; R
W Æ R; W
R W Æ R; W
R Æ R; R
W Æ W; W
W W Æ W; W
R Æ W; R
4
b) P(RR or WW) = 0,444 (correct to four decimal places) = 44% (correct to nearest %)
9
4) a)
Stage 1 Stage 2 Outcome
Taxi Æ Lift; Taxi
Lift
Walk Æ Lift; Walk
Taxi Æ Bus; Taxi
Bus
Walk Æ Bus; Walk
Taxi Æ Train; Taxi
Train
Walk Æ Train; Walk
1
b) P(bus; taxi) = 0,167 (correct to four decimal places) = 17% (correct to nearest %)
6
5) a)
First Spin
R10 R5 R5 R50
R10 R20 R15 R15 R60
Second Spin
1
b) (i) P(R20) = = 0,0625 = 6% (correct to nearest %)
16
1
(ii) P(R100) = 0,0625 = 6%
16
2 1
(iii) P(R60) = = 0,125 = 12,5%
16 8
4 1
(iv) P(R15) = 0,25 = 25%
16 4
136
C E N S U S @ S C H O O L
6) a) TWO-WAY TABLE:
1st Spin
R G B Y W
R R; R G; R B; R Y; R W; R
2nd Spin
G R; G G; G B; G Y; G W; G
B R; B G; B B; B Y; B W; B
Y R; B G; Y B; Y Y; Y W; Y
W R; W G; W B; W Y; W W; W
TREE DIAGRAM:
1st Spin 2nd Spin Outcomes
R Æ RR
G Æ RG
R B Æ RB
Y Æ RY
W Æ RW
R Æ GR
G Æ GG
G B Æ GB
Y Æ GY
W Æ GW
R Æ BR
G Æ BG
B B Æ BB
Y Æ BY
W Æ BW
R Æ YR
G Æ YG
Y B Æ YB
Y Æ YY
W Æ YW
R Æ WR
G Æ WG
W B Æ WB
Y Æ WY
W Æ WW
1
b) (i) P(W; W) = 0,04 = 4%
25
(ii) The favourable outcomes here are: RW; GW; BW; YW; WR; WG; WB; WY; WW
9
P(at least one W) = 0,36 = 36%
25
(iii) Here the favourable outcomes are: RR; GG; BB; YY; WW.
5 1
P(both the same colour) = = 0,2 = 20%
25 5
7) a) R; B; Y R; Y; B B; R; Y B; Y; R Y; R; B Y; B; R
2 1
b) P(red is first) = у 0,333 (correct to three decimal places) у 33% (correct to nearest %)
6 3
137