© U Dinesh Kumar, IIM Bangalore
© U Dinesh Kumar, IIM Bangalore
Business Analytics
The Science of Data-Driven Decision Making
Second Edition
U. Dinesh Kumar
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability Theory - Terminologies
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Random Experiment
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Sample Space
• It is the universal set that consist of all possible
outcomes of an experiment.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Event
• Event(E) is a subset of a sample space and probability is
usually calculated with respect to an event.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability Estimation using Relative
Frequency
• The classical approach to probability estimation
of an event is based on the relative frequency of
the occurrence of that event
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Example 3.1
A website displays 10 advertisements and the revenue generated
by the website depends on the number of visitors to the site
clicking on any of the advertisements displayed in the website.
The data collected by the company has revealed that out of 2500
visitors, 30 people clicked on 1 advertisement, 15 clicked on 2
advertisements, and 5 clicked on 3 advertisements. Remaining
did not click on any of the advertisements. Calculate
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Solution
(a) Number of customers clicking an advertisement
is 50 and the total number of visitors is 2500.
Thus, the probability that a visitor to the website
will click on an advertisement
50
0.02
is
2500
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Algebra of Events
• Assume that X, Y and Z are three events of a sample space. Then the
following algebraic relationships are valid and are useful while deriving
probabilities of events:
• Distributive rule: X (Y Z) = (X Y) (X Z)
X (Y Z) = (X Y) (X Z)
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Contd…
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Axioms of Probability
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
The elementary rules of probability are directly deduced from the original three
axioms of probability, using the set theory relationships
1. For any event A, the probability of the complementary event, written AC, is given
by
P(A) = 1 – P(AC)
P( ) 0
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
3. If occurrence of an event A implies that an event B also
occurs, so that the event class A is a subset of event class B,
then the probability of A is less than or equal to the probability
of B:
P ( A) P ( B )
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Joint Probability
Number of observations in A B
P( A B)
Total number of observations
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Example 3.2
At an e-commerce customer service centre a total
of 112 complaints were received. 78 customers
complained about late delivery of the items and 40
complained about poor product quality.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Solution to Example 3.2
• Let A = Late delivery and B = Poor quality of the
product. Let n(A) and n(B) be the number of
events in favour of A and B. So n(A) = 78 and
n(B) = 40. Since the total number of complaints
is 112, hence
n(A B) = 118 – 112 = 6
• Probability of a complaint about both delivery and
poor product quality is n(A B) 6
P(A B) 0.0535
Total number of complaints 112
78
• Probability
1 that
0the complaint is only about poor
.3035
112
quality = 1P(A) =
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• Marginal probability is simply a probability of an event X, denoted by P(X),
without any conditions
P( A B)
P( B | A) , P( A) 0
P( A)
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Application of Simple Probability Rules in Analytics
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Association Rule Mining
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Association rule learning Example -
Binary representation of point of sale
data
Strawber Green
Transaction ID Apple Orange Grapes Plums Banana
ry Apple
1 1 1 1 0 1 1 1
2 0 1 0 0 0 1 1
3 0 0 0 0 0 1 1
4 1 0 0 0 1 0 0
5 1 0 0 0 1 1 1
6 0 1 1 0 0 0 1
7 0 1 1 0 0 0 1
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• In Table , transaction ID is the transaction reference number and apple,
orange, etc. are the different SKUs sold by the store. Binary code is used to
represent whether the SKU was purchased (equal to 1) or not (equal to 0)
during a transaction. The strength of association between two mutually
exclusive subsets can be measured using ‘support’, ‘confidence’, and ‘lift’
• Support between two sets (of products purchased) is calculated using the
joint probability of those events:
n( X Y )
Support P( X Y )
N
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Association Rule Leaning Cont…
P( X Y )
Confidence = P (Y | X )
P( X )
• Lift: The third measure in association rule mining is lift, which is given by
P( X Y )
Lift =
P ( X ) P (Y )
Association rules can be generated based on threshold values of support,
confidence and lift. For example, assume that the cut-off for support is 0.25
and confidence is 0.5 (Lift should be more than 1)
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Bayes Theorem
• Bayes theorem is one of the most important concepts in analytics since
several problems are solved using Bayesian statistics
P( A B) P( A B)
P( A | B) and P( B | A)
P( B) P( A)
P( A | B) P( B)
P( B | A)
P( A)
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Terminologies used to describe various
components in Bayes Theorem
1. P(B) is called the prior probability (estimate of the
probability without any additional information).
P( A | B) P( B)
P ( B | A)
P ( A)
2. P(B|A) is called the posterior probability (that is, given that
the event A has occurred, what is the probability of
occurrence of event B). That is, post the additional
information (or additional evidence) that A has
occurred, what is estimated probability of occurrence of B.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Monty Hall Problem
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Monty Hall Problem Using Bayes
Theorem
• Let C1, C2, and C3 be the events that the car is behind
door 1, 2, and 3, respectively. Let D1, D2, and D3 be the
events that Monty opens door 1, 2, and 3, respectively.
Prior probabilities of C1, C2, and C3 are
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• Using, Bayes theorem
P ( D2 | C1 ) P (C1 ) (1 / 2) (1 / 3)
P (C1 | D2 ) 1 / 3
P ( D2 ) (1 / 2)
• P(D2|C1) = 1(if the car is behind door 1, then Monty can open
2
either door 2 or 3)
1 1
P(D2) = 2 3
2
Note that P(C2|D2) = 0. Thus P(C3|D2) = 1 – P(C1|D2) = 1 – = 3
P(D2|C3) = 1 (if the car is behind door 3 and the player has
chosen door 1, Monty has to open door 2 with probability 1)
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Using, Bayes theorem
P( D2 | C1 ) P(C1 ) (1/ 2) (1/ 3)
P(C1 | D2 ) 1/ 3
P( D2 ) (1/ 2)
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Example 3.4
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Solution to Example 3.4
• Let P(A), P(B), P(C) be events corresponding to the
black box being manufactured by companies A, B,
and C, respectively, and P(D) be the probability of
defective black box. We are interested in
calculating the probability
P( D | AP(A|D).
) P( A)
P( A | D)
P( D)
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Random
Variables
• Random variable is a
function that maps every
outcome in the sample
space to a real number.
• Random variable is a
robust and convenient
way of representing the
outcome of a random
experiment
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Discrete Random Variables
• If the random variable X can assume only a finite or countably infinite set of
values, then it is called a discrete random variable.
• Examples of discrete random variables are:
– Credit rating (usually classified into different categories such as low,
medium and high or using labels such as AAA, AA, A, BBB, etc.).
– Number of orders received at an e-commerce retailer which can be
countably infinite.
– Customer churn (the random variables take binary values, 1. Churn and 2.
Do not churn).
– Fraud (the random variables take binary values, 1. Fraudulent transaction
and 2. Genuine transaction).
– Any experiment that involves counting (for example, number of returns in
a day from customers of e-commerce portals such as Amazon, Flipkart;
number of customers not accepting job offers from an organization).
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability mass function
• For a discrete random variable,
the probability that a random
variable X taking a specific value
xi, P(X = xi), is called the
probability mass function P(xi).
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Expected Value
• Expected value (or mean) of a discrete
random variable is given by
n
E ( X ) xi P ( xi )
i 1
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Variance and Standard Deviation
i 1
VAR ( X )
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability Density Function (pdf)
• The probability density function, f(xi), is
defined as probability that the value of
random variable X lies between an
infinitesimally small interval defined by xi
and xi + x
P( xi X xi x)
f ( x) lim
x 0 x
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Cumulative Distribution Function (CDF)
• The cumulative distribution function
(CDF) of a continuous random variable is
defined by a
F (a) P( X a) f ( x)dx
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability density The probability between
function and two values a and b, P(a X
cumulative distribution b), is the area between
function of a the values a and b under
continuous random the probability density
variable satisfy the function
following properties
f(x)
0
F () f ( x ) dx 1
b
P(a X b) f ( x)dx F (b) F (a)
a
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• The expected value of a continuous
random variable, E(X), is given by
E ( X ) xf ( x) dx
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Binomial Distribution
• A random variable X is said to follow a
Binomial distribution when
– The random variable can have only two
outcomes success and failure (also known as
Bernoulli trials).
– The objective is to find the probability of
getting k successes out of n trials.
– The probability of success is p and thus the
probability of failure is (1 p).
– The probability p is constant and does not
change between trials.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability Mass Function (PMF) of Binomial
Distribution
• The PMF of the Binomial distribution (probability
that the number of success will be exactly x out
of n trials) is given by
n x n x
PMF ( x) P ( X x) p (1 p ) , 0 x n
x
n n!
x x!( n x )!
Where
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Mean and Variance of Binomial
Distribution
The Mean of a binomial distribution is given by:
n
n x n
Mean E ( X ) x PMF( x) x p (1 p ) n x np
x 0 x 0 x
n n
n x
Var( X ) ( x E ( X )) PMF( x) ( x E ( X )) p (1 p) n x np(1 p)
2 2
x 0 x 0 x
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Example 3.5
5 20
(c) Probability that more than 5 customers
P ( X 5) 1 P ( X 5) 1 ) 20 k return
(0.1) k (0.9will 1 0.9887the product
0.0113 is
k 0 k
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• The mean and variance of a Poisson random variable are given by E ( X )
and Var( X )
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Example
On average, about 20 customers per day cancel their order
placed at Fashion Trends Online. Calculate the probability that
the number of cancellations on a day is exactly 20 and the
probability that the maximum number of cancellations is 25
Solution
The probability that the number of cancellations is exactly 20
is given by e 20 2020
P ( X 20) 0.0888
20!
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Geometric Distribution
• Geometric distribution represents a random experiment in
which the random variable predicts the number of failures
before the success
) P ( X xfunction
F ( xdistribution
• The cumulative ) 1 (1is p ) x by:
given
1
E( X )
p
• Mean and variance of a geometric distribution are given by
(1 p )
Var( X )
p2
and
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability mass function of a geometric Cumulative distribution function of a
distribution (p = 0.3). geometric distribution (p = 0.3).
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Memoryless Property of Geometric
Distribution
• Memoryless property is a special property of a geometric
distribution in which the conditional
P( X i j | X probability,
i),
depends only on the value j, not on the value i. We know
that i i
P( X i) 1 P( X i) 1 [1 (1 p) ] (1 p)
P( X i j X i) P( X i j ) (1 p)i j j
P( X i j | X i) (1 p )
P( X i) P( X i) (1 p)i
P ( X j ) (1 p ) j P( X i j | X i) P( X j ).
• Note that, Thus,
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Example
Local Dhaniawala (LD) is an online grocery store and has
an innovative feature which predicts whether the
customer has forgotten to buy an item which is very
common among customers of grocery items. The
probability that a customer buys milk in each shopping
visit is 0.2.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Solution
(a) Probability that the customer’s first purchase of
milk happens on 5th trip is given by
P( X 5) (1 0.2) 4 0.2 0.08192
(b)The average time between purchase of milk is
1 1
E( X ) 5
p 0 .2
(c) Given that a customer has not purchased milk
for the past 3 shopping visits, the probability that
the customer will not buy for another 2 visits is
given by
P( X 3 2 | X 3) P( X 2) (1 p) 2 (1 0.2) 2 0.64
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Parameters of Continuous Distributions
• Scale parameter: Scale parameter defines the
range of the continuous distribution. The larger
the scale parameter value, larger is the spread of
the distribution.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Uniform Distribution
Cumulative distribution
Probability density function functions
0, xa
1 x a
, x [ a, b]
f ( x) b a F ( x ) , a x b
b a
0, otherwise 1,
x b
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Exponential Distribution
• Exponential distribution is a single parameter continuous
distribution that is traditionally used for modelling time to
failure of electronic components
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability density function of an
exponential distribution
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Memoryless Property of Exponential
Distribution
• Exponential distribution is the only continuous
probability distribution that has the memoryless
property. That is ,
P ( X t s | X t ) P ( X s )
P ( X t s X t ) P ( X t s ) e (t s ) s
P( X t s | X t ) t e
P( X t ) P( X t ) e
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Example
The time to failure of an avionic system follows an
exponential distribution with a mean time between
failures (MTBF) of 1000 hours.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Solution
(a) The probability that the system will fail by 1000
hours F
is(1000) 1 et 1
1000
1 / 1000 , t 1000 F (1000) 1 e 1000 1 e 1 0.6321
In this case so ,
1
(b) The probability that the system will not
P( X 2000) 1 P ( X 2000) 1 F (t ) e t e
fail up to
1000
0.1353
2000
e 2
2000 hours is
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Normal Distribution
• Normal distribution, also known as Gaussian
distribution, is one of the most popular
continuous distribution in the field of analytics
especially due to its use in multiple contexts
• The probability density function and the
cumulative distribution 1 x function are given by
2
1 2
f ( x) e , x
2
2
x 1 t
1
F ( x) e 2
dt , x
2
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Properties of Normal Distribution
1. Theoretical normal density functions are defined
between and +.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
4. For any normal distribution, the areas between
specific values measured in terms of and are
given by:Value of Random Variable Area under the Normal Distribution (CDF)
variance
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Standard Normal Variable
x2
z 1
F ( z) e 2 dz
2
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• By using the following transformation, any normal random
variable X can be converted into a standard normal variable
X
Z
• The random variable X can be written in the form of a
standard normal random variable using the relationship
X=+Z
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• A simple approximation of standard normal CDF is
given by Tocher (1963)
e2 kz
P ( Z z ) F ( z )
1 e2 kz
where k 2/
2 z2 / 2
z A z A
P( Z z ) F ( z ) 1 1 2 e
2 z 3 B z 2 B z 2 A
1 2 2
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Example
According to a survey on use of smart phones in
India, the smart phone users spend 68 minutes in a
day on average in sending messages and the
corresponding standard deviation is 12 minutes.
Assume that the time spent in sending messages
follows a normal distribution.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
(b) Proportion of customers spending less than 20
minutes is
P(X 20) = F(20)
Using Excel function, we have Normdist(20, 68, 12,
true) = 3.1671 × 105
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Chi-Square Distribution
• Chi-square distribution with k degrees of freedom
[denoted as 2(k) distribution] is a non-parametric
distribution which is obtained by adding square of
k independent standard normal random variables.
• Consider a normal random variable X1 with mean 1 and
standard deviation 1. Then we can define Z1 (the standard
normal random variable) as
X 1
Z1 1
1
• Then,
2
X 1
Z12 1
1
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• Let X2 be a normal random variable with mean 2 and
standard deviation 2 and Z2 is the corresponding standard
2 2
1 Z2
normal Zvariable. Then the random variable
given by 2 2
X 1 X 2
Z12 Z 22 1 2
1 2
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
The probability density function of 2(k) is given by
k x
1 1
f ( x) k 2 x2 e 2
2 ( k 2)
( k ) x k 1
e x
dx
0
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• The cumulative distribution function of a chi-
square distribution with k degrees of freedom is
given by
k x
,
2 2
F ( x)
k
2
k x
,
2 2
• Where is the lower incomplete Gamma
function. It is given by x
(k , x) t k 1e t dt
0
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Cumulative distribution of
Probability density function
chi-square distribution with
of chi-square distribution for
k degrees of freedom
different values of k
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Properties of chi-square distribution
• The mean and standard deviation of a chi-square
2k are k and
distribution where k is the degrees
of freedom
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Student’s t-Distribution
• Student’s t-distribution (or simply t-distribution)
arises while estimating the population mean of a
normal distribution using sample which is either
small and/or the population standard deviation is
unknown
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• Assume that X1, X2, …, Xn are n observations (that is,
sample of size n) from a normal distribution with mean
X
and standard deviation . Let
n
Xi
i1
2
1 n
S X i X
n 1 i 1
X
• where and S are mean and standard deviation estimated
from the sample X1, X2, …, Xn. Then the random variable t
defined by
X
t
S/ n
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
• The probability density function of t-distribution
with n degrees of freedom is given by
n 1
n 1
2 2
2 x
f ( x) 1
n
n
n
2
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Cumulative distribution function of student’s t-
distribution
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Properties of t-distribution:
• The mean of a t distribution with 2 or more degrees of
freedom is 0.
• The standard deviation of t-distribution
n
n 2
is for n >
2, where n is the number of degrees of freedom.
• As the degrees of freedom n increases the probability
density function of a t-distribution approaches the
density function of standard normal distribution. For n
> 120, the difference between the area under
probability density function of a t-distribution is very
close to the area under a standard normal distribution.
• t-distribution is an important distribution for
hypothesis testing of means of a population and for
comparing means of two populations.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
F-Distribution
F-distribution (short form of Fisher’s distribution named after
statistician Ronald Fisher) is a ratio of two chi-square
distributions. Let Y1 and Y2 be two independent chi-square
distributions with k1 and k2 degrees of freedom, respectively.
Then the random variable X Y is1defined
/ k1 as
X
Y2 / k 2
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Probability density Cumulative density
function of F-distribution function of F-distribution
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Properties of F distribution:
k
• Mean of F-distribution is
2 ,
k2 2
for k2 > 2.
2k 22 is
• Standard deviation of F-distribution ( k1 k 2 2)
for k2 > 4. k1 ( k 2 2) 2 ( k 2 4)
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making
Summary
• The concept of probability, random variables and
probability distributions are foundations of data science.
Knowledge of these concepts is important for framing and
solving analytics problems.
©Analytics
Business U. Dinesh Kumar,
– The Science IIMDriven
of Data Bangalore
Decision Making