0% found this document useful (0 votes)

48 views17 pages

Lecture 18 1

Uploaded by

niveditasimmi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views17 pages

Lecture 18 1

Uploaded by

niveditasimmi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Approximate Inference via Sampling (1)

CS698X: Topics in Probabilistic Modeling and Inference

Piyush Rai
2
Plan
▪ Sampling to approximate distributions
▪ Basic sampling methods
▪ Markov Chain Monte Carlo (MCMC)

CS698X: TPMI
3
Sampling for Approximate Inference
▪ Some typical tasks that we have to solve in probabilistic/fully-Bayesian inference
Posterior
distribution

Posterior
predictive
distribution

Needed for model

selection (and in
Marginal
computing
likelihood
posterior too)
Expected
Needed in EM complete data
log-likelihood

Evidence lower
Needed in VI bound (ELBO) ]
▪ Sampling methods provide a general way to (approximately) solve these problems
CS698X: TPMI
4
Approximating a Prob. Distribution using Samples
▪ Can approximate any distribution using a set of randomly drawn samples from it

Samples can thought

of as a histogram-
Given large-enough
based approximation
samples, it is proportional to
of a distribution
the probability density at Height of each bar
that location denotes how many
times that location
was sampled 𝑝(𝑧)

▪ The samples can also be used for computing expectations (Monte-Carlo averaging)

▪ Usually straightforward to generate samples if it is a simple/standard distribution

▪ The interesting bit: Even if the distribution is “difficult” (e.g., an intractable posterior), it
is often possible to generate random samples from such a distribution, as we will see.
CS698X: TPMI
5
The Empirical Distribution
▪ Sampling based approx. can be formally represented using an empirical distribution
▪ Given 𝐿 points/samples 𝒛(1) , 𝒛(2) , … , 𝒛(𝐿) , empirical distr. defined by these is

Weights sum to 1
Dirac Distribution with Weight of point 𝑧 (ℓ)
finite support at
𝒛(1) , 𝒛(2) , … , 𝒛(𝐿)

Can think of 𝐴 as being the

area over which we want to
evaluate the distribution

Dirac Distribution

CS698X: TPMI
𝜕𝑥 6
Sampling: Some Basic Methods 𝑝 𝑧 = 𝑞(𝑥)
𝜕𝑧
▪ Most of these basic methods are based on the idea of transformation Determinant
of Jacobian
▪ Generate a random sample 𝑥 from a distribution 𝑞(𝑥) which is easy to sample from
▪ Apply a transformation on 𝑥 to make it random sample 𝑧 from a complex distr 𝑝(𝑧) 𝐹(𝑧): CDF of 𝑝(𝑧)

▪ Some popular examples of transformation methods 𝑥

▪ Inverse CDF method
𝑧 = 𝐹 −1 (𝑥)

▪ Reparametrization method

▪ Box-Mueller method: Given (𝑥1 , 𝑥2 ) from Unif(−1, +1), generate (𝑧1 , 𝑧2 ) from 𝒩(0, 𝐈2 )
𝑧1 = −2 ln 𝑥1 cos 2𝜋𝑥2 , 𝑧1 = −2 ln 𝑥1 sin(2𝜋𝑥2 )
▪ Transformation Methods are simple but have limitations
▪ Mostly limited to standard distributions and/or distributions with very few variables
CS698X: TPMI
7
Rejection Sampling
෤
𝑝(𝑧)
▪ Goal: Generate a random sample from a distribution of the form 𝑝 𝑧 = , assuming
𝑍𝑝
▪ We can only evaluate the value of numerator 𝑝(𝑧)
෤ for any 𝑧
▪ The denominator (normalization constant) 𝑍𝑝 is intractable and we don’t know its value
Should have the same
support as 𝑝(𝑧)
▪ Assume a proposal distribution 𝑞(𝑧) we can generate samples from, and

▪ Rejection Sampling then works as follows

▪ Sample an random variable 𝑧∗ from 𝑞(𝑧)
▪ Sampling a uniform r.v. 𝑢 ∼ Unif 0, 𝑀𝑞 𝑧∗
▪ If 𝑢 ≤ 𝑝(𝑧
෤ ∗ ) then accept 𝑧∗ , otherwise reject it

▪ All accepted 𝑧∗ ’s will be random samples from 𝑝 𝑧 . Proof on next slide

CS698X: TPMI
8
Rejection Sampling
▪ Why 𝑧 ∼ 𝑞(𝑧) + accept/reject rule is equivalent to 𝑧 ∼ 𝑝(𝑧)?

▪ Let’s look at the pdf of the 𝑧’s that were accepted, i.e., 𝑝(𝑧|accept)

CS698X: TPMI
9
Computing Expectations via Monte Carlo Sampling
▪ Often we are interested in computing expectations of the form
𝔼 𝑓 = න 𝑓 𝑧 𝑝 𝑧 𝑑𝑧
where 𝑓(𝑧) is some function of the random variable 𝑧 ∼ 𝑝(𝑧)
▪ A simple approx. scheme to compute the above expectation: Monte Carlo integration
(ℓ) 𝐿 Assuming we know how
▪ Generate 𝐿 independent samples from 𝑝(𝑧): 𝑧 ℓ=1 ∼ 𝑝(𝑧) to sample from 𝑝(𝑧)
▪ Approximate the expectation by the following empirical average
1 𝐿
𝔼 𝑓 ≈ 𝑓መ = σℓ=1 𝑓(𝑧 (ℓ) )
𝐿
▪ Since the samples are independent of each other, we can show the following
Variance in our
estimate decreases
Unbiased
as 𝐿 increases
expectation
CS698X: TPMI
10
Computing Expectations via Importance Sampling
▪ How to compute Monte Carlo expec. if we don’t know how to sample from 𝑝(𝑧)?
▪ One way is to use transformation methods or rejection sampling
▪ Another way is to use Importance Sampling (assuming 𝑝(𝑧) can be evaluated at least)
𝐿
▪ Generate 𝐿 indep samples from a proposal 𝑞(𝑧) we know how sample from: 𝑧 (ℓ) ℓ=1
∼ 𝑞(𝑧)
▪ Now approximate the expectation as follows
𝑝 𝑧 (ℓ)
1 𝐿 𝑝 𝑧
𝔼 𝑓 = න 𝑓 𝑧 𝑝 𝑧 𝑑𝑧 = න 𝑓 𝑧 𝑞 𝑧 𝑑𝑧 ≈ ෍ 𝑓(𝑧 (ℓ) )
𝑞 𝑧 𝐿 ℓ=1 𝑞 𝑧 (ℓ)
▪ This is basically “weighted” Monte Carlo integration
𝑝 𝑧 (ℓ)
▪ 𝑤 (ℓ) = denotes the importance weight of each sample 𝑧 (ℓ)
𝑞 𝑧 (ℓ) See PRML 11.1.4
෤
𝑝(𝑧)
▪ IS works even when we can only evaluate 𝑝 𝑧 = up to a prop. constant
𝑍𝑝
▪ Note: Monte Carlo and Importance Sampling are NOT sampling methods!
▪ These are only uses for computing expectations (approximately) CS698X: TPMI
11
Limitations of the Basic Methods
▪ Transformation based methods: Usually limited to drawing from standard distributions

▪ Rejection Sampling and Importance Sampling: Require good proposal distributions

1 𝐿 (ℓ)
𝑝 𝑧 (ℓ)
𝔼 𝑓 ≈ ෍ 𝑓(𝑧 )
𝑞(𝑧) should be such that 𝐿 ℓ=1 𝑞 𝑧 (ℓ)
𝑀𝑞(𝑧) envelopes 𝑝(𝑧)
෤
Ideally, would like 𝑞(𝑧) to
everywhere
give samples from where 𝑝(𝑧)
is large or 𝑓(𝑧)𝑝(𝑧) is large
Difficult to guarantee so if 𝑧 is
high-dimensional

▪ In general, difficult to find good prop. distr. especially when 𝑧 is high-dim

▪ More sophisticated sampling methods like MCMC work well in such high-dim spaces
CS698X: TPMI
12
Markov Chain Monte Carlo (MCMC) If the target is a posterior, it will be
conditioned on data, i.e., 𝑝(𝒛|𝒙)

▪ Goal: Generate samples from some target distribution 𝑝 𝒛 =

෤
𝑝(𝒛)
𝑍𝑝
𝒛 usually is high-dim Means we can at least
▪ Assume we can evaluate 𝑝(𝒛) at least up to a proportionality constant evaluate 𝑝(𝒛)
෤

▪ MCMC uses a Markov Chain which, when converged, starts giving samples from 𝑝(𝑧)

▪ Given current sample 𝒛(ℓ) from the chain, MCMC generates the next sample 𝒛(ℓ+1) as
▪ Use a proposal distribution 𝑞(𝒛|𝒛(ℓ) ) to generate a candidate sample 𝒛∗
▪ Accept/reject 𝒛∗ as the next sample based on an acceptance criterion (will see later)
▪ If accepted, set 𝒛(ℓ+1) = 𝒛∗ . If rejected, set 𝒛(ℓ+1) = 𝒛(ℓ)
Should also have the
same support as 𝑝(𝒛)
▪ Important: The proposal distribution 𝑞(𝒛|𝒛(ℓ) ) depends on the previous sample 𝒛(ℓ)
CS698X: TPMI
13
MCMC: The Basic Scheme
▪ The chain run infinitely long (i.e., upon convergence) will give ONE sample from 𝑝 𝒛
MCMC is exact in theory but
▪ But we usually require several samples to approximate 𝑝 𝒛 approximate in practice since
we can’t run the chain for
Thus we say that the infinitely long in practice
▪ This is done as follows samples are approximately
from the target distribution
▪ Start the chain at an initial 𝒛(0) Will treat it as our first
sample from 𝑝(𝒛)
▪ Using the proposal 𝑞(𝒛|𝒛(ℓ) ), run the chain long enough, say 𝑇1 steps
▪ Discard the first 𝑇1 − 1 samples (called “burn-in” samples) and take last sample 𝒛(𝑇1 )
▪ Continue from 𝒛(𝑇1) up to 𝑇2 steps, discard intermediate samples, take last sample 𝒛(𝑇2)
▪ This discarding (called “thinning”) helps ensure that 𝒛(𝑇1) and 𝒛(𝑇2) are uncorrelated
▪ Repeat the same for a total of 𝑆 times Requirement for Monte
Carlo approximation
▪ In the end, we now have 𝑆 approximately independent samples from 𝑝 𝒛

▪ Note: Good choices for 𝑇1 and 𝑇𝑖 − 𝑇𝑖−1 (thinning gap) are usually based on heuristics
CS698X: TPMI
14
MCMC: Some Basic Theory
▪ A first order Markov Chain assumes 𝑝 𝒛(ℓ+1) |𝒛 1 , … , 𝒛(ℓ) = 𝑝(𝒛 ℓ+1 |𝒛(ℓ) )
▪ A 1st order Markov Chain 𝒛(0) , 𝒛(1) , … , 𝒛(𝐿) is a sequence of r.v.’s and is defined by
▪ An initial state distribution 𝑝(𝒛 0 )
▪ A Transition Function (TF): 𝑇ℓ 𝒛 ℓ → 𝒛 ℓ+1 = 𝑝(𝒛 ℓ+1 |𝒛(ℓ) )
▪ TF is a distribution over the values of next state given the value of the current state
▪ Assuming a 𝐾-dim discrete state-space, TF will be 𝐾 × 𝐾 probability table

▪ Homogeneous Markov Chain: The TF is the same for all ℓ , i.e., 𝑇ℓ = 𝑇

CS698X: TPMI
15
MCMC: Some Basic Theory
▪ Consider the following Markov Chain with a 𝐾 = 3 discrete state-space

0 0 0
𝑝 𝒛0 = 𝑝 𝑧1 , 𝑧2 , 𝑧3
= [0.5,0.2,0.3]

1 0
𝑝 𝒛 = 𝑝 𝒛 × 𝑇 = [0.2,0.6,0.2] (rounded to single digit after decimal)
After doing it a few more Stationary/Invariant Distribution 𝑝(𝒛) is multinoulli with 𝜋 = [0.2,0.4,0.4]
(say some 𝑚) times 𝑝(𝒛) of this Markov Chain
𝑝 𝒛 0 × 𝑇 𝑚 = [0.2,0.4,0.4] (rounded to single digit after decimal)

▪ 𝑝(𝒛) being Stationary means no matter what 𝑝 𝒛 0 is, we will reach 𝑝(𝒛)
▪ A Markov Chain has a stationary distribution if 𝑇 has the following properties
▪ Irreducibility: T’s graph is connected (ensures reachability from anywhere to anywhere)
▪ Aperiodicity: T’s graph has no cycles (ensures that the chain isn’t trapped in cycles)
CS698X: TPMI
16
MCMC: Some Basic Theory
▪ A Markov Chain with transition function 𝑇 has stationary distribution 𝑝(𝒛) if 𝑇 satisfies
Here 𝑇 𝑏 𝑎 denotes the
Known as the Detailed transition probability of going
Balance condition 𝑝 𝒛 𝑇 𝒛′ |𝒛 = 𝑝 𝒛′ 𝑇(𝒛|𝒛′ ) from state 𝑏 to state 𝑎

▪ Integrating out (or summing over) detailed balanced condition on both sides w.r.t. 𝒛′
Thus 𝑝(𝑧) is the
stationary distribution of
this Markov Chain
𝑝 𝒛 = න 𝑝 𝒛′ 𝑇 𝒛 𝒛′ 𝑑𝒛′

▪ Thus a Markov Chain with detailed balance always converges to a stationary distribution

▪ Detailed Balance ensures reversibility

▪ Detailed balance is sufficient but not necessary condition for having a stationary distr.
CS698X: TPMI
17
Coming Up Next
▪ MCMC algorithms
▪ Metropolis Hastings (MH)
▪ Gibbs sampling (special case of MH)

CS698X: TPMI

Lecture 19
No ratings yet
Lecture 19
12 pages
MCMC Sampling - Class 2025
No ratings yet
MCMC Sampling - Class 2025
101 pages
MCMC Brief
100% (1)
MCMC Brief
69 pages
Lecture 18-2
No ratings yet
Lecture 18-2
11 pages
CPSC 540: Machine Learning: Monte Carlo Methods
No ratings yet
CPSC 540: Machine Learning: Monte Carlo Methods
32 pages
2 MS2 (Sampling)
No ratings yet
2 MS2 (Sampling)
29 pages
CPSC 440: Advanced Machine Learning: Monte Carlo Methods
No ratings yet
CPSC 440: Advanced Machine Learning: Monte Carlo Methods
30 pages
Scalable Monte Carlo For Bayesian Learning: Paul Fearnhead, Christopher Nemeth, Chris J. Oates and Chris Sherlock
No ratings yet
Scalable Monte Carlo For Bayesian Learning: Paul Fearnhead, Christopher Nemeth, Chris J. Oates and Chris Sherlock
244 pages
Intro Bayes Time Series 1
No ratings yet
Intro Bayes Time Series 1
72 pages
Sampling
No ratings yet
Sampling
100 pages
Statistics 202C Study Guide: Part I: Sampling Basic Unstructured Distributions and Monte Carlo Basics
No ratings yet
Statistics 202C Study Guide: Part I: Sampling Basic Unstructured Distributions and Monte Carlo Basics
14 pages
On The Markov Chain Monte Carlo (MCMC) Method: Rajeeva L Karandikar
No ratings yet
On The Markov Chain Monte Carlo (MCMC) Method: Rajeeva L Karandikar
24 pages
Unit 5
No ratings yet
Unit 5
74 pages
MCMC Notes
No ratings yet
MCMC Notes
77 pages
Importance Sampling
No ratings yet
Importance Sampling
13 pages
Markov Chain Monte Carlo and Gibbs Sampling
No ratings yet
Markov Chain Monte Carlo and Gibbs Sampling
24 pages
Sampling Methods: Søren Højsgaard
No ratings yet
Sampling Methods: Søren Højsgaard
22 pages
Lec 25
No ratings yet
Lec 25
3 pages
Bayesian - Lec - 4
No ratings yet
Bayesian - Lec - 4
25 pages
Rarefied Gas Dynamics - DSMC Course
No ratings yet
Rarefied Gas Dynamics - DSMC Course
50 pages
MCMC Final Edition
No ratings yet
MCMC Final Edition
17 pages
Lec30 GibbsSampling
No ratings yet
Lec30 GibbsSampling
55 pages
Computation
No ratings yet
Computation
11 pages
Basic Sampling Methods: Sargur Srihari Srihari@cedar - Buffalo.edu
No ratings yet
Basic Sampling Methods: Sargur Srihari Srihari@cedar - Buffalo.edu
30 pages
Using Early Rejection Markov Chain Monte Carlo and Gaussian Processes To Accelerate ABC Methods
No ratings yet
Using Early Rejection Markov Chain Monte Carlo and Gaussian Processes To Accelerate ABC Methods
33 pages
Lecture Notes On Regression: Markov Chain Monte Carlo (MCMC)
No ratings yet
Lecture Notes On Regression: Markov Chain Monte Carlo (MCMC)
13 pages
Chapter 9 Bayesian Methods - Machine Learning For Factor Investing
No ratings yet
Chapter 9 Bayesian Methods - Machine Learning For Factor Investing
11 pages
MCMC
No ratings yet
MCMC
70 pages
Monte Carlo
No ratings yet
Monte Carlo
59 pages
Questions For Unit 5 RM
No ratings yet
Questions For Unit 5 RM
4 pages
Markov Chain Monte Carlo Explained
100% (1)
Markov Chain Monte Carlo Explained
31 pages
Big Data JPM
No ratings yet
Big Data JPM
31 pages
MCMC
No ratings yet
MCMC
7 pages
MCMC Bayes PDF
No ratings yet
MCMC Bayes PDF
27 pages
MCMC: Metropolis Hastings Algorithm
No ratings yet
MCMC: Metropolis Hastings Algorithm
5 pages
Bayesian Networks: Approximate Inference
No ratings yet
Bayesian Networks: Approximate Inference
37 pages
General State Space Markov Chains and MCMC Algorithms - Gareth O. Roberts, Jeffrey S. Rosenthal
No ratings yet
General State Space Markov Chains and MCMC Algorithms - Gareth O. Roberts, Jeffrey S. Rosenthal
64 pages
Gibbs Sampling in Time Series
No ratings yet
Gibbs Sampling in Time Series
7 pages
MCMC
No ratings yet
MCMC
76 pages
Lec29 ImportanceSampling
No ratings yet
Lec29 ImportanceSampling
84 pages
L11 TopicModels 2
No ratings yet
L11 TopicModels 2
37 pages
MCMC With Temporary Mapping and Caching With Application On Gaussian Process Regression
No ratings yet
MCMC With Temporary Mapping and Caching With Application On Gaussian Process Regression
16 pages
Markov Chain Monte Carlo
No ratings yet
Markov Chain Monte Carlo
29 pages
Intro To Markov Chain Monte Carlo: Rebecca C. Steorts Bayesian Methods and Modern Statistics: STA 360/601
No ratings yet
Intro To Markov Chain Monte Carlo: Rebecca C. Steorts Bayesian Methods and Modern Statistics: STA 360/601
35 pages
Unit V Graphical Models
No ratings yet
Unit V Graphical Models
23 pages
Markov Chains: Modified by Longin Jan Latecki Temple University, Philadelphia Latecki@temple - Edu
No ratings yet
Markov Chains: Modified by Longin Jan Latecki Temple University, Philadelphia Latecki@temple - Edu
36 pages
Annotated L19
No ratings yet
Annotated L19
53 pages
Putational Statistics Using Matlab
No ratings yet
Putational Statistics Using Matlab
78 pages
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
No ratings yet
Computational Statistics With Matlab: Mark Steyvers May 13, 2011
78 pages
Lectures 6
No ratings yet
Lectures 6
17 pages
Paap 2019-08-15 Peter Glynn
No ratings yet
Paap 2019-08-15 Peter Glynn
46 pages
Bayes Intro PT 2
No ratings yet
Bayes Intro PT 2
13 pages
18 Aos1715
No ratings yet
18 Aos1715
33 pages
Sampling Slides
No ratings yet
Sampling Slides
38 pages
Tungban Probabilistic ML 2021 - 04 - Sampling
No ratings yet
Tungban Probabilistic ML 2021 - 04 - Sampling
24 pages
CSE291D Lecture 6: Monte Carlo Methods 2: Markov Chain Monte Carlo
No ratings yet
CSE291D Lecture 6: Monte Carlo Methods 2: Markov Chain Monte Carlo
66 pages
MCMC Gibbs Sampler in Matlab
No ratings yet
MCMC Gibbs Sampler in Matlab
21 pages
ML - Unit-V-1
No ratings yet
ML - Unit-V-1
42 pages
12 Akelarre
No ratings yet
12 Akelarre
23 pages
Machine Learning Applications For Building Structural Design and Performance Assessment
No ratings yet
Machine Learning Applications For Building Structural Design and Performance Assessment
41 pages
Video Shot Boundary Detection Using Block Based Cumulative Approach
No ratings yet
Video Shot Boundary Detection Using Block Based Cumulative Approach
24 pages
MIP Lab Manual BM3652 MEDICAL IMAGE PROCESSING
No ratings yet
MIP Lab Manual BM3652 MEDICAL IMAGE PROCESSING
72 pages
Significant Figures
No ratings yet
Significant Figures
3 pages
Dynamic Linear Models
No ratings yet
Dynamic Linear Models
18 pages
L-2 - Matrices and Determinants
No ratings yet
L-2 - Matrices and Determinants
42 pages
Discriminative vs. Generative Models
No ratings yet
Discriminative vs. Generative Models
21 pages
An Introduction To Artificial Neural Network
No ratings yet
An Introduction To Artificial Neural Network
5 pages
LabVIEW Model Predictive Control Guide
100% (1)
LabVIEW Model Predictive Control Guide
22 pages
Advanced Financial Mathematics
No ratings yet
Advanced Financial Mathematics
7 pages
Linear Programming II
No ratings yet
Linear Programming II
2 pages
CG L7 DDA Line Drawing Algorithm
No ratings yet
CG L7 DDA Line Drawing Algorithm
18 pages
Pantech - AI, ML & Image Processing Projects Using MATLAB and OpenCV - 2021 - 22
No ratings yet
Pantech - AI, ML & Image Processing Projects Using MATLAB and OpenCV - 2021 - 22
6 pages
Time Series Models. AR, MA, ARMA, ARIMA - by Charanraj Shetty - Towards Data Science
No ratings yet
Time Series Models. AR, MA, ARMA, ARIMA - by Charanraj Shetty - Towards Data Science
3 pages
Tukey Test for Additivity Guide
No ratings yet
Tukey Test for Additivity Guide
3 pages
Generative AI Report
No ratings yet
Generative AI Report
36 pages
BRM CS
No ratings yet
BRM CS
4 pages
NumericalAnalysis Notes (In Progress)
No ratings yet
NumericalAnalysis Notes (In Progress)
79 pages
Unit3 Notes
No ratings yet
Unit3 Notes
29 pages
FEM of Spring
No ratings yet
FEM of Spring
8 pages
2023 05 Struktur Variaans-Kovarians
No ratings yet
2023 05 Struktur Variaans-Kovarians
42 pages
Dual
No ratings yet
Dual
35 pages
Be - Computer Engineering Aids - Semester 7 - 2023 - December - Big Data Analytics Rev 2019 C' Scheme
No ratings yet
Be - Computer Engineering Aids - Semester 7 - 2023 - December - Big Data Analytics Rev 2019 C' Scheme
1 page
Advanced Digital Signal Processing
No ratings yet
Advanced Digital Signal Processing
37 pages
Programming A PIDE Instruction in A Function Block Diagram
No ratings yet
Programming A PIDE Instruction in A Function Block Diagram
7 pages
Sanket Resume PDF
No ratings yet
Sanket Resume PDF
1 page
Kenny-230720-8 Unique Machine Learning Interview Questions About K Nearest Neighbors
No ratings yet
Kenny-230720-8 Unique Machine Learning Interview Questions About K Nearest Neighbors
3 pages
6 ANI Probit and Logit Models Example
No ratings yet
6 ANI Probit and Logit Models Example
5 pages
Stat110 Cheatsheet PDF
No ratings yet
Stat110 Cheatsheet PDF
2 pages

Lecture 18 1

Uploaded by

Lecture 18 1

Uploaded by

Approximate Inference via Sampling (1)

CS698X: Topics in Probabilistic Modeling and Inference

Needed for model

Samples can thought

▪ Usually straightforward to generate samples if it is a simple/standard distribution

Can think of 𝐴 as being the

▪ Some popular examples of transformation methods 𝑥

▪ Rejection Sampling then works as follows

▪ All accepted 𝑧∗ ’s will be random samples from 𝑝 𝑧 . Proof on next slide

▪ Rejection Sampling and Importance Sampling: Require good proposal distributions

▪ In general, difficult to find good prop. distr. especially when 𝑧 is high-dim

▪ Goal: Generate samples from some target distribution 𝑝 𝒛 =

▪ Homogeneous Markov Chain: The TF is the same for all ℓ , i.e., 𝑇ℓ = 𝑇

▪ Detailed Balance ensures reversibility

You might also like