Statistics and Data Analysis Guide

The document defines key statistical concepts such as population, sample, quantitative vs qualitative data, discrete vs continuous data, and different sampling methods. It then discusses measures of central tendency like median, quartiles, and interquartile range. Additional concepts covered include standard deviation, variance, different types of distributions, skewness, bivariate distributions, percentiles, deciles, quartiles, and time series analysis methods.

Uploaded by

waheed.abdulr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views3 pages

Statistics and Data Analysis Guide

Uploaded by

waheed.abdulr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

 Population – Everyone or everything in a study

o Population mean is a parameter

 Sample – small subset/portion of the population
o Sample mean is a statistic
 For example choosing 100 individuals randomly out of 100,000 people
 Quantitative data involves numbers while Quantitive data is descriptive data in form of words
 Discrete data
o Data that can only take on certain values
o Associated with counting
 Continuous Data
o Data that can take on any value
o Associated with measurement
 A stratified random sample aims to represent each group or stratum in the population fairly
o The size of each group in the sample should be proportional to the size of each group in
the population
o Sample members are selected at random from every group in the population
 Quota sampling
o Similar to stratified random only difference is interviewer actually chooses people from
groups

 Median of a dataset is Q2,

 Q1 and Q3 are the medians of start of data till Q2 and median of Q2 till end of data respectively
 Q1 and Q3 come at 25 percent and 75 percent frequency mark in cumulative frequency table
 Interquartile range/IQR = Q3-Q1
 To find whether a point is anomalous, it must lie outside the range [Q1 - 1.5 IQR, Q3 + 1.5 IQR]
 Mean of grouped data set = ∑midpoint * frequency/∑frequency
 Weighted mean = ∑wx / ∑w where w is the weight of each value
 Standard deviation = sqrt( ∑(element – mean)2 / n)
 Variance
o Measure of spread of data—tells you how far your data is spread from the mean
o V = ∑(element – mean)2 / n

 Relative frequency distribution

o f / ∑f = RFD value
 Cumulative relative frequency
o Same as cumulative frequency but deals with relative frequency values
 Frequency density = Frequency/Class Width

 Dot plot
o Data set values on horizontal line and associated frequency as number of dots above
value
o Value with most dots is mode
o Values are in ascending order and have equal
divisions
 Box and Whisker plot
 Stem and Leaf plot
o Always helpful to have a key since conversion can
be different
o Can represent integers and floats
o Stem goes from 1,2,3,4 all the way till n
o Stem value corresponds to digit of highest house in dataset value
 Creating Histogram for ungrouped data:
o Make classes depending on range of data and then make a frequency distribution table
o Plot histogram
 Frequency polygon:
o Need classes, their associated frequency as well as the midpoints of each class
o When plotting, we need a value that is below our lowest midpoint value and a value
higher than our greatest midpoint value
o Coordinates of points will be (midpoint , associated frequency)
o Plot the graph and connect the dots to form a frequency polygon\
 Skewness
o If a frequency distribution is symmetric and unimodal, then median, mode and mean
are equal to one another
o If data is skewed to the right, then:
 mean is greater than median
 Mode is less than median
 Q3-Q2 > Q2-Q1 or max-Q3 > Q1-min
o Vice Versa for skewed to left
 Bivariate Distribution
o Explanatory Variable on X axis and Response variable on y axis
o Graph is usually scatterplot
o Can be described by Direction (Positive/Negative), Strength (Strong/Weak) and Form
(Linear/Non Linear)
 If graph has upward trend, then it is positive and vice versa
 If values are close to each other and spread out very little then it’s a strong
correlation – this only works when form is linear though

 Percentile
o Dividing Data in 100 equal portions
o 70th percentile for example means the data point where 70 percent of data is less than
the value of the data point and 30 percent is greater than it
oLk = (k/100) (n+1) where Lk indicates position of element from beginning of dataset for
kth percentile
o When Lk is a decimal, (for example 4.5)look at both the upper and lower position
elements and take their average (in this instance, take average of 4th and 5th element to
find kth percentile
o To find the percentile of an integer in the dataset:
 use the formula (x + 0.5y) * 100 / n = P
 x = number of data points less than the chosen value
 y= frequency of chosen value
 n= number of data points
o Round of answer to nearest integer to find the percentile of the number
 Decile and Quartile
o Dividing Data in 10 equal Portions = Decile
o Dividing Data in 4 portions = Quartile
o Q1=P25
o Dn= corresponding value of Cumulative Relative Frequency = 0.1 * n
 Where Dn is nth Decile
o If CRF =0.01 * n falls exactly between two distinct data points, then Pn will be the
average of those data points

Time Series

 Method of semi averages

o Divide data set into 2(or any number) portions of equal quantity, take average of all data
points in each portion to reduce it to 2 single data points.
o Draw a linear graph connecting both points and extrapolate it to find future value
o If data set contains uneven number of elements, then ignore middle element
 Moving averages/trend

Data Management
No ratings yet
Data Management
43 pages
Difference Between (Median, Mean, Mode, Range, Midrange) (Descriptive Statistics)
No ratings yet
Difference Between (Median, Mean, Mode, Range, Midrange) (Descriptive Statistics)
11 pages
Research
No ratings yet
Research
9 pages
Data Management (1) (1) - Compressed
No ratings yet
Data Management (1) (1) - Compressed
46 pages
RESEARCH
No ratings yet
RESEARCH
9 pages
QUALITATIVE DATA Are Measurements For Which There Is No Natural
No ratings yet
QUALITATIVE DATA Are Measurements For Which There Is No Natural
9 pages
Elementary Statistics and Probability Chapter 1 3
No ratings yet
Elementary Statistics and Probability Chapter 1 3
5 pages
1 Review of Statistics
No ratings yet
1 Review of Statistics
24 pages
Business Statistics
No ratings yet
Business Statistics
106 pages
Introduction to Statistics
No ratings yet
Introduction to Statistics
43 pages
Unit 12 - Averages - Measures of Speed
No ratings yet
Unit 12 - Averages - Measures of Speed
4 pages
Note 02
No ratings yet
Note 02
31 pages
MMW Midterm Reviewer
No ratings yet
MMW Midterm Reviewer
6 pages
Data Management (1)
No ratings yet
Data Management (1)
46 pages
COMM 191 Reviewer
No ratings yet
COMM 191 Reviewer
17 pages
Statistics Midterm Review
No ratings yet
Statistics Midterm Review
21 pages
Intro to Mathematical Statistics
No ratings yet
Intro to Mathematical Statistics
42 pages
Descriptive Statistics Week 2: L2 - Graphical Display of Data
No ratings yet
Descriptive Statistics Week 2: L2 - Graphical Display of Data
22 pages
Eng 2015 Prelims Reviewer
No ratings yet
Eng 2015 Prelims Reviewer
11 pages
Statistics Maths Clinic Gr12 Eng
No ratings yet
Statistics Maths Clinic Gr12 Eng
6 pages
Random Variables & Sampling Methods
No ratings yet
Random Variables & Sampling Methods
6 pages
Comm 215.MidtermReview
No ratings yet
Comm 215.MidtermReview
71 pages
Spring Semester, 2020-2021
No ratings yet
Spring Semester, 2020-2021
40 pages
Illustrate Measures of Position
No ratings yet
Illustrate Measures of Position
3 pages
Ap Stats Cram Sheet: Symmetric - When The Left Half Is
No ratings yet
Ap Stats Cram Sheet: Symmetric - When The Left Half Is
7 pages
Chapter 3: Statistics
No ratings yet
Chapter 3: Statistics
3 pages
Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022
100% (1)
Lesson 1: Engineering Data Analysis First Semester - A.Y. 2021 - 2022
4 pages
Notes Stat
No ratings yet
Notes Stat
6 pages
Bustat Reviewer
No ratings yet
Bustat Reviewer
6 pages
ADS PRINT Ans
No ratings yet
ADS PRINT Ans
4 pages
Statistics for Students
100% (2)
Statistics for Students
25 pages
Probability and Statistics Lectures (Pre Sessional I)
No ratings yet
Probability and Statistics Lectures (Pre Sessional I)
10 pages
Tutoring Session 2023 - Statistics For Business
No ratings yet
Tutoring Session 2023 - Statistics For Business
65 pages
Inferential Statistics Course
No ratings yet
Inferential Statistics Course
46 pages
Intro to Statistics Basics
No ratings yet
Intro to Statistics Basics
53 pages
CAIE A2 Paper 3 Maths
No ratings yet
CAIE A2 Paper 3 Maths
48 pages
Introduction Book 1
No ratings yet
Introduction Book 1
41 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
19 pages
STATS
No ratings yet
STATS
3 pages
Statistics Summary
No ratings yet
Statistics Summary
6 pages
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
No ratings yet
Descriptive Statistics Summary (Session 1-5) : Types of Data - Two Types
4 pages
QM Statistic Notes
No ratings yet
QM Statistic Notes
24 pages
Pointers To Review Statistics
No ratings yet
Pointers To Review Statistics
6 pages
Probability+&+Statistics Formulas
No ratings yet
Probability+&+Statistics Formulas
47 pages
C3 Comm213
No ratings yet
C3 Comm213
6 pages
Math Notes Module 4A
No ratings yet
Math Notes Module 4A
4 pages
Chapter 1
No ratings yet
Chapter 1
23 pages
Ch01 Intro Stat&DataAnalysis
No ratings yet
Ch01 Intro Stat&DataAnalysis
106 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
39 pages
AI Workbook Statistics Applications and Interpretation SL - Study Guide - Alex Barancova - IB Academy 2020 (Ib - Academy) Pages 39-66
No ratings yet
AI Workbook Statistics Applications and Interpretation SL - Study Guide - Alex Barancova - IB Academy 2020 (Ib - Academy) Pages 39-66
28 pages
Statistics Basics for Students
No ratings yet
Statistics Basics for Students
46 pages
Chapter 1 Mathematics
No ratings yet
Chapter 1 Mathematics
2 pages
Ap Stat Exam Rev ch1-13
No ratings yet
Ap Stat Exam Rev ch1-13
120 pages
Da Notes
No ratings yet
Da Notes
4 pages
Bio Statistics
No ratings yet
Bio Statistics
217 pages
Elementary Statisctics Reviewer
No ratings yet
Elementary Statisctics Reviewer
5 pages
5 - Data Summaries and Visualization
No ratings yet
5 - Data Summaries and Visualization
97 pages
Midterms Stats
No ratings yet
Midterms Stats
8 pages
Statistics For Economists-2
No ratings yet
Statistics For Economists-2
31 pages
CS Final Project Game Overview
No ratings yet
CS Final Project Game Overview
2 pages
Elsewhere Unit 2 Exercise 2
No ratings yet
Elsewhere Unit 2 Exercise 2
2 pages
Unit 1 Exercise 2 Elsewhere
No ratings yet
Unit 1 Exercise 2 Elsewhere
2 pages
Mercury Plan
No ratings yet
Mercury Plan
3 pages
Cambridge International AS & A Level: Chemistry
No ratings yet
Cambridge International AS & A Level: Chemistry
17 pages
Real Time Question
No ratings yet
Real Time Question
3 pages
Courtship, Dating, Marriage Guide
No ratings yet
Courtship, Dating, Marriage Guide
5 pages
Thesis Topics in Public Administration in Nigeria
100% (3)
Thesis Topics in Public Administration in Nigeria
9 pages
DLL-arts 4thQ
No ratings yet
DLL-arts 4thQ
4 pages
Paper 5 Essentials Guideline
No ratings yet
Paper 5 Essentials Guideline
5 pages
BPG The Grand Warren 20210518
No ratings yet
BPG The Grand Warren 20210518
2 pages
Cartography PDF
No ratings yet
Cartography PDF
7 pages
Airiver Waterpower 30 HC Instruction Manual - NEW
No ratings yet
Airiver Waterpower 30 HC Instruction Manual - NEW
27 pages
Haffmans CPT: CO Purity Tester
No ratings yet
Haffmans CPT: CO Purity Tester
2 pages
Lecture 3 Chapter 3 ANOVA
No ratings yet
Lecture 3 Chapter 3 ANOVA
58 pages
PK3 2nded ReferenceManual v1.1 Virtual
No ratings yet
PK3 2nded ReferenceManual v1.1 Virtual
70 pages
Semi Detailed Lesson Plan in Science 4
No ratings yet
Semi Detailed Lesson Plan in Science 4
6 pages
Bss Preliminary Assessment 2024-25
No ratings yet
Bss Preliminary Assessment 2024-25
8 pages
Philippine Parental Involvement in Schools
100% (1)
Philippine Parental Involvement in Schools
2 pages
Appc 1.8 Packet
No ratings yet
Appc 1.8 Packet
5 pages
Grade 7 School Register 2021-2022
No ratings yet
Grade 7 School Register 2021-2022
8 pages
Identifying Good Practice: A Survey of College Provision in Leisure, Travel and Tourism
No ratings yet
Identifying Good Practice: A Survey of College Provision in Leisure, Travel and Tourism
25 pages
Details of Programmes
No ratings yet
Details of Programmes
98 pages
Nelson
No ratings yet
Nelson
1 page
Pit-Stop: Bernie Sander
No ratings yet
Pit-Stop: Bernie Sander
19 pages
Organizational Culture Insights
100% (5)
Organizational Culture Insights
49 pages
Two Hemispheres of The Brain
No ratings yet
Two Hemispheres of The Brain
11 pages
Architecture For The New Nation
No ratings yet
Architecture For The New Nation
1 page
Ethics History Theory and Contemporary Issues 7th Edition Unlocked Test Bank
No ratings yet
Ethics History Theory and Contemporary Issues 7th Edition Unlocked Test Bank
318 pages
Calculation Software A6V13696649 - en
No ratings yet
Calculation Software A6V13696649 - en
63 pages
Relationship Science & Love Styles
No ratings yet
Relationship Science & Love Styles
20 pages
Chương 5 - Đánh Giá R I Ro - Safety Risk Assessments - Training Material
No ratings yet
Chương 5 - Đánh Giá R I Ro - Safety Risk Assessments - Training Material
31 pages
Hyperplast PC339 - 0
No ratings yet
Hyperplast PC339 - 0
2 pages
Source Inspection Complete Setup Process in S4 Han...
No ratings yet
Source Inspection Complete Setup Process in S4 Han...
3 pages
Oil Industry Risk Assessment Techniques
No ratings yet
Oil Industry Risk Assessment Techniques
56 pages
Marketing VIII
No ratings yet
Marketing VIII
30 pages

Statistics and Data Analysis Guide

Uploaded by

Statistics and Data Analysis Guide

Uploaded by

 Population – Everyone or everything in a study

o Population mean is a parameter

 Median of a dataset is Q2,

 Relative frequency distribution

 Method of semi averages

You might also like