100% found this document useful (1 vote)
121 views70 pages

CH 02

Statistics 01

Uploaded by

Kazi Shahed
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
121 views70 pages

CH 02

Statistics 01

Uploaded by

Kazi Shahed
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 70

CHAPTER 2

ORGANIZING AND
GRAPHING DATA

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
RAW DATA
Definition
Data recorded in the sequence in which
they are collected and before they are
processed or ranked are called raw
data.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Table 2.1 Ages of 50 Students

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Table 2.2 Status of 50 Students

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
ORGANIZING AND GRAPHING QUANTITATIVE
DATA

 Frequency Distributions
 Relative Frequency and Percentage
Distributions
 Graphical Presentation of Qualitative Data

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
TABLE 2.3 Types of Employment Students
Intend to Engage In

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Frequency Distributions
Definition
A frequency distribution for
qualitative data lists all categories and
the number of elements that belong to
each of the categories.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-1
A sample of 30 employees from large
companies was selected, and these
employees were asked how stressful their
jobs were. The responses of these
employees are recorded below, where
very represents very stressful, somewhat
means somewhat stressful, and none
stands for not stressful at all.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-1
somewhat none somewhat very very none
very somewhat somewhat very somewhat somewhat
very somewhat none very none somewhat

somewhat very somewhat somewhat very none


somewhat very very somewhat none somewhat

Construct a frequency distribution table for these


data.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-1: Solution
Table 2.4 Frequency Distribution of Stress on Job

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Relative Frequency and Percentage Distributions
Calculating Relative Frequency of a
Category

Frequency of that category


Re lative frequency of a category 
Sum of all frequencie s

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Relative Frequency and Percentage Distributions
Calculating Percentage
Percentage = (Relative frequency) · 100

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-2
Determine the relative frequency and
percentage for the data in Table 2.4.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-2: Solution
Table 2.5 Relative Frequency and Percentage
Distributions of Stress on Job

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Graphical Presentation of Qualitative Data
Definition
A graph made of bars whose heights
represent the frequencies of respective
categories is called a bar graph.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.1 Bar graph for the frequency distribution
of Table 2.4

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Graphical Presentation of Qualitative Data
Definition
A circle divided into portions that
represent the relative frequencies or
percentages of a population or a sample
belonging to different categories is called
a pie chart.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Table 2.6 Calculating Angle Sizes for the Pie Chart

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.2 Pie chart for the percentage distribution
of Table 2.5.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
ORGANIZING AND GRAPHING QUANTITATIVE
DATA

 Frequency Distributions
 Constructing Frequency Distribution
Tables
 Relative and Percentage Distributions
 Graphing Grouped Data

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Table 2.7 Weekly Earnings of 100 Employees of a
Company

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Frequency Distributions
Definition
A frequency distribution for
quantitative data lists all the classes
and the number of values that belong to
each class. Data presented in the form of
a frequency distribution are called
grouped data.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Frequency Distributions
Definition
The class boundary is given by the
midpoint of the upper limit of one class
and the lower limit of the next class.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Frequency Distributions
Finding Class Width

Class width = Upper boundary – Lower boundary

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Frequency Distributions
Calculating Class Midpoint or Mark

Lower limit  Upperlimit


Class midpoint or mark 
2

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Constructing Frequency Distribution Tables
Calculation of Class Width

Largest value - Smallest v alue


Approximate class width 
Number of classes

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Table 2.8 Class Boundaries, Class Widths, and
Class Midpoints for Table 2.7

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-3
The following data give the total number
of iPods® sold by a mail order company
on each of 30 days. Construct a
frequency distribution table.

8 25 11 15 29 22 10 5 17 21

22 13 26 16 18 12 9 26 20 16

23 14 19 23 20 16 27 16 21 14

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-3: Solution
The minimum value is 5, and the maximum value is 29.
Suppose we decide to group these data using five classes
of equal width. Then
29  5
Approximate width of each class   4.8
5
Now we round this approximate width to a convenient
number, say 5. The lower limit of the first class can be
taken as 5 or any number less than 5. Suppose we take 5
as the lower limit of the first class. Then our classes will be
5 – 9, 10 – 14, 15 – 19, 20 – 24, and 25 – 29

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Table 2.9 Frequency Distribution for the Data on
iPods Sold

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Relative Frequency and Percentage Distributions
Calculating Relative Frequency and Percentage

Frequency of that class f


Relative frequency of a class  
Sum of all frequencie s f
Percentage  (Relative frequency)  100
Example 2-4
Calculate the relative frequencies and
percentages for Table 2.9.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-4: Solution
Table 2.10 Relative Frequency and Percentage
Distributions for Table 2.9

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Graphing Grouped Data
Definition
A histogram is a graph in which classes
are marked on the horizontal axis and the
frequencies, relative frequencies, or
percentages are marked on the vertical
axis. The frequencies, relative
frequencies, or percentages are
represented by the heights of the bars. In
a histogram, the bars are drawn adjacent
to each other.
Prem Mann, Introductory Statistics, 7/E
Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.3 Frequency histogram for Table 2.9.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.4 Relative frequency histogram for Table
2.10.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Graphing Grouped Data
Definition
A graph formed by joining the midpoints
of the tops of successive bars in a
histogram with straight lines is called a
polygon.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.5 Frequency polygon for Table 2.9.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.6 Frequency distribution curve.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-6
The administration in a large city wanted to
know the distribution of vehicles owned by
households in that city. A sample of 40
randomly selected households from this city
produced the following data on the number of
vehicles owned:
5 1 1 2 0 1 1 2 1 1
1 3 3 0 2 5 1 2 3 4
2 1 2 2 1 2 2 1 1 1
4 2 1 1 2 1 1 4 1 3
Construct a frequency distribution table for
these data, and draw a bar graph.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-6: Solution
Table 2.13 Frequency Distribution of Vehicles
Owned
The observations assume only
six distinct values: 0, 1, 2, 3, 4,
and 5. Each of these six values
is used as a class in the
frequency distribution in Table
2.13.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.7 Bar graph for Table 2.13.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
SHAPES OF HISTOGRAMS
1. Symmetric
2. Skewed
3. Uniform or Rectangular

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.8 Symmetric histograms.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.9 (a) A histogram skewed to the right. (b)
A histogram skewed to the left.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.10 A histogram with uniform distribution.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.11 (a) and (b) Symmetric frequency curves. (c)
Frequency curve skewed to the right. (d) Frequency curve
skewed to the left.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
CUMULATIVE FREQUENCY DISTRIBUTIONS

Definition
A cumulative frequency distribution gives
the total number of values that fall below the
upper boundary of each class.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-7

Using the frequency distribution of Table


2.9, reproduced here, prepare a cumulative
frequency distribution for the number of
iPods sold by that company.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-7: Solution
Table 2.14 Cumulative Frequency Distribution of
iPods Sold

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
CUMULATIVE FREQUENCY DISTRIBUTIONS

Calculating Cumulative Relative Frequency


and Cumulative Percentage
Cumulative frequency of a class
Cumulative relative frequency 
Total observations in the data set

Cumulative percentage  (Cumulative relative frequency)  100

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Table 2.15 Cumulative Relative Frequency and
Cumulative Percentage Distributions for iPods Sold

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
CUMULATIVE FREQUENCY DISTRIBUTIONS

Definition
An ogive is a curve drawn for the
cumulative frequency distribution by joining
with straight lines the dots marked above
the upper boundaries of classes at heights
equal to the cumulative frequencies of
respective classes.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.12 Ogive for the cumulative frequency
distribution of Table 2.14.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
STEM-AND-LEAF DISPLAYS
Definition
In a stem-and-leaf display of quantitative
data, each value is divided into two portions
– a stem and a leaf. The leaves for each
stem are shown separately in a display.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-8
The following are the scores of 30 college
students on a statistics test:
75 52 80 96 65 79 71 87 93 95
69 72 81 61 76 86 79 68 50 92
83 84 77 64 71 87 72 92 57 98
Construct a stem-and-leaf display.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-8: Solution

To construct a stem-and-leaf display for


these scores, we split each score into two
parts. The first part contains the first digit,
which is called the stem. The second part
contains the second digit, which is called the
leaf. We observe from the data that the
stems for all scores are 5, 6, 7, 8, and 9
because all the scores lie in the range 50 to
98.
Prem Mann, Introductory Statistics, 7/E
Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.13 Stem-and-leaf display.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-8: Solution

After we have listed the stems, we read the


leaves for all scores and record them next
to the corresponding stems on the right
side of the vertical line. The complete stem-
and-leaf display for scores is shown in
Figure 2.14.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.14 Stem-and-leaf display of test scores.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-8: Solution

The leaves for each stem of the stem-and-


leaf display of Figure 2.14 are ranked (in
increasing order) and presented in Figure
2.15.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Figure 2.15 Ranked stem-and-leaf display of test
scores.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-9

The following data are monthly rents paid by


a sample of 30 households selected from a
small city.
880 1081 721 1075 1023 775 1235 750 965 960
1210 985 1231 932 850 825 1000 915 1191 1035
1151 630 1175 952 1100 1140 750 1140 1370 1280

Construct a stem-and-leaf display for these


data.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-9: Solution

Figure 2.16 Stem-


and-leaf display of
rents.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-10
The following stem-and-leaf
display is prepared for the
number of hours that 25
students spent working on
computers during the last
month.

Prepare a new stem-and-leaf display by


grouping the stems.
Prem Mann, Introductory Statistics, 7/E
Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-10: Solution

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
DOTPLOTS
Definition
Values that are very small or very large
relative to the majority of the values in a
data set are called outliers or extreme
values.
Example 2-11
Table 2.16 lists the lengths of the longest
field goals (in yards) made by all kickers in
the American Football Conference (AFC) of
the National Football League (NFL) during
the 2008 season. Create a dotplot for these
data.

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Table 2.16 Distances of Longest Field Goals (in Yards)
Made by AFC Kickers During the 2008 NFL Season

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved
Example 2-11: Solution
Step1

Step 2

Prem Mann, Introductory Statistics, 7/E


Copyright © 2010 John Wiley & Sons. All right reserved

You might also like