0% found this document useful (0 votes)

25 views60 pages

ML Lecture17

This document discusses Kalman filtering and related techniques for time series modeling and state estimation. It provides an overview of the problem of estimating the most probable state at different times (prediction, filtering, smoothing) using noisy sensor measurements and a system model. Applications mentioned include robotics, tracking, econometrics, and navigation. The document then discusses the fundamentals of modeling time series as stochastic processes, hidden Markov models, dynamic systems, and recursive Bayesian filtering. It notes that in certain restrictive cases including linear Gaussian systems, the optimal solution exists in closed form using the Kalman filter equations.

Uploaded by

Marche Remi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views60 pages

ML Lecture17

Uploaded by

Marche Remi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

Kalman filtering and friends:

Inference in time series models

Herke van Hoof
slides mostly by Michael Rubinstein
Problem overview
• Goal
– Estimate most probable state at time k using
measurement up to time k’
k’<k: prediction
k’=k: filtering
k’>k: smoothing
• Input
– (Noisy) Sensor measurements
– Known or learned system model (see last lecture)

• Many problems require estimation of the state of

systems that change over time using noisy
measurements on the system
Applications
• Ballistics
• Robotics
– Robot localization
• Tracking hands/cars/…
• Econometrics
– Stock prediction
• Navigation

• Many more…
Example: noisy localization
Position at t=0
https://img.clipartfest.com

Measurement at t=1

t=2
t=3
t=4
t=5

t=6
Example: noisy localization
Position at t=0
https://img.clipartfest.com

Measurement at t=1

t=2
Smoothing: where was I in the past t=3
t=4
t=5

t=6
Example: noisy localization
Position at t=0
https://img.clipartfest.com

Measurement at t=1

t=2
Smoothing: where was I in the past t=3
t=4
t=5
Filtering: where am I now t=6
Example: noisy localization
Position at t=0
https://img.clipartfest.com

Measurement at t=1

t=2
Smoothing: where was I in the past t=3
t=4
t=5
Filtering: where am I now t=6

Prediction: where will I be in the future

Today’s lecture
• Fundamentals
• Formalizing time series models
• Recursive filtering
• Two cases with optimal solutions
• Linear Gaussian models
• Discrete systems
• Suboptimal solutions
Stochastic Processes
• Stochastic process
– Collection of random variables indexed by some set
– Ie. R.V. 𝑥" for every element 𝑖 in index set

• Time series modeling

– Sequence of random states/variables
– Measurements available at discrete times
– Modeled as stochastic process indexed by ℕ
Stochastic Processes
• Stochastic process
– Collection of random variables indexed by some set
– Ie. R.V. 𝑥" for every element 𝑖 in index set

• Time series modeling

– Sequence of random states/variables
– Measurements available at discrete times
– Modeled as stochastic process indexed by ℕ

𝑝(𝑥$ )

𝑥$ (location at t=1)
Stochastic Processes
• Stochastic process
– Collection of random variables indexed by some set
– Ie. R.V. 𝑥" for every element 𝑖 in index set

• Time series modeling

– Sequence of random states/variables
– Measurements available at discrete times
– Modeled as stochastic process indexed by ℕ

𝑝(𝑥$ )

𝑝(𝑥( )

𝑝(𝑥) )
(First-order) Markov process
• The Markov property – the likelihood of a
future state depends on present state only

• Markov chain – A stochastic process with

Markov property

k-1 k k+1 time

xk-1 xk xk+1 States
Hidden Markov Model (HMM)
• the state is not directly visible, but output
dependent on the state is visible

k-1 k k+1 time

xk-1 xk xk+1 States
(hidden)

zk-1 zk zk+1 Measurements

(observed)
State space
• The state vector contains all available
information to describe the investigated system
– usually multidimensional:

• The measurement vector represents

observations related to the state vector
– Generally (but not necessarily) of lower dimension
than the state vector
State space

• Tracking: Econometrics:
• Monetary flow
• Interest rates
• Inflation
• …
Hidden Markov Model (HMM)
• the state is not directly visible, but output
dependent on the state is visible

k-1 k k+1 time

xk-1 xk xk+1 States
(hidden)

zk-1 zk zk+1 Measurements

(observed)
Dynamic System
k-1 k k+1
xk-1 xk xk+1

zk-1 zk zk+1
Stochastic diffusion
State equation:
state vector at time instant k
state transition function,
i.i.d process noise

Observation equation:
observations at time instant k
observation function,
i.i.d measurement noise
A simple dynamic system
• (4-dimensional state space)
• Constant velocity motion:

• Only position is observed:

Gaussian distribution

Yacov Hel-Or
Today’s lecture
• Fundamentals
• Formalizing time series models
• Recursive filtering
• Two cases with optimal solutions
• Linear Gaussian models
• Discrete systems
• Suboptimal solutions
Recursive filters
• For many problems, estimate is required each time a
new measurement arrives

• Batch processing
– Requires all available data
• Sequential processing
– New data is processed upon arrival
– Need not store the complete dataset
– Need not reprocess all data for each new measurement
– Assume no out-of-sequence measurements (solutions for
this exist as well…)
Bayesian filter
• Construct the posterior probability Thomas Bayes
density function of the state based
on all available information
Posterior

Sample space

• By knowing the posterior many kinds of

estimates for can be derived
– mean (expectation), mode, median, …
– Can also give estimation of the accuracy (e.g.
covariance)
Recursive Bayes filters
• Given:
– System models in probabilistic forms
Markovian process

Measurements conditionally
independent given state
(known statistics of vk, wk)
– Initial state also known as the prior
– Measurements 𝑧$, … , 𝑧-
Recursive Bayes filters
• Prediction step (a-priori)

– Uses the system model to predict forward

– Deforms/translates/spreads state pdf due to random noise

• Update step (a-posteriori)

– Update the prediction in light of new data

– Tightens the state pdf
Prior vs posterior?

• It can seem odd to regard 𝑝 𝑥- 𝑧$:-/$ as prior

• Compare likelihood prior
posterior
𝑝 𝑧- 𝑥- , 𝑧$:-/$ 𝑃(𝑥- |𝑧$:-/$ )
𝑃(𝑥- |𝑧- , 𝑧$:-/$ ) =
𝑝(𝑧- |𝑧$:-/$ )
evidence
to
𝑝 𝑧- 𝑥- , 𝑧$:-/$ 𝑃(𝑥- |𝑧$:-/$ )
𝑃(𝑥- |𝑧- , 𝑧$:-/$ ) =
𝑝(𝑧- |𝑧$:-/$ )

• In update with 𝑧- , it is a working prior

General prediction-update framework

• Assume is given at time k-1

• Prediction:
System model Previous posterior
(1)

• Using Chapman-Kolmogorov identity + Markov

property
General prediction-update framework

• Update step

Measurement Current
model prior

(2)

Normalization constant
Where
Generating estimates

• Knowledge of enables to compute

optimal estimate with respect to any
criterion. e.g.
– Minimum mean-square error (MMSE)

– Maximum a-posteriori
General prediction-update framework

➔So (1) and (2) give optimal solution for the

recursive estimation problem!
• Unfortunately… only conceptual solution
– integrals are intractable…
– Cannot represent arbitrary pdfs!

• However, optimal solution does exist for

several restrictive cases
Today’s lecture
• Fundamentals
• Formalizing time series models
• Recursive filtering
• Two cases with optimal solutions
• Linear Gaussian models
• Discrete systems
• Suboptimal solutions
Restrictive case #1
• Posterior at each time step is Gaussian
– Completely described by mean and covariance
• If is Gaussian it can be shown that
is also Gaussian provided that:
– are Gaussian
– are linear
Restrictive case #1
• Why Linear?

Yacov Hel-Or
Restrictive case #1
• Why Linear?

Yacov Hel-Or
Restrictive case #1
• Linear system with additive noise

• Simple example again

/$
The Kalman filter
Rudolf E. Kalman

• Substituting into (1) and (2) yields the predict and

update equations
The Kalman filter
Predict:

Update:
Intuition via 1D example
• Lost at sea
– Night
– No idea of location
– For simplicity – let’s
assume 1D
– Not moving

* Example and plots by Maybeck, “Stochastic models, estimation and control, volume 1”
Example – cont’d
• Time t1: Star Sighting
– Denote z(t1)=z1
• Uncertainty (inaccuracies, human error, etc)
– Denote σ1 (normal)
• Can establish the conditional probability of
x(t1) given measurement z1
Example – cont’d

• Probability for any location, based on measurement

• For Gaussian density – 68.3% within ±σ1
• Best estimate of position: Mean/Mode/Median
Example – cont’d
• Time t2: friend (more trained)
– x(t2)=z2, σ(t2)=σ2
– Since she has higher skill: σ2<σ1
Example – cont’d
• f(x(t2)|z1,z2) also Gaussian
Example – cont’d

• σ less than both σ1 and σ2

• σ1= σ2: average
• σ1> σ2: more weight to z2
• Rewrite:
Example – cont’d
• The Kalman update rule:

Best estimate
Given z2
(a posteriori)

Best Prediction prior to z2 Optimal Weighting Residual

(a priori) (Kalman Gain)
The Kalman filter
Predict:

Generally increases
variance
Update:

Generally decreases
variance
Kalman gain

• Small measurement error, 𝐻 invertible:

• Small prediction error:

The Kalman filter
• Pros (compared to e.g. particle filter)
– Optimal closed-form solution to the tracking problem
(under the assumptions)
• No algorithm can do better in a linear-Gaussian environment!
– All ‘logical’ estimations collapse to a unique solution
– Simple to implement
– Fast to execute
• Cons
– If either the system or measurement model is non-
linear à the posterior will be non-Gaussian

Smoothing possible with a

backward message
(cf HMMs, lecture 10)
Restrictive case #2
• The state space (domain) is discrete and finite
• Assume the state space at time k-1 consists of
states
• Let be the conditional
probability of the state at time k-1, given
measurements up to k-1
The Grid-based filter
• The posterior pdf at k-1 can be expressed as
sum of delta functions

• Again, substitution into (1) and (2) yields the

predict and update equations

Equivalent to belief
monitoring in HMMs
(Lecture 10)
The Grid-based filter
• Prediction
(1)

• New prior is also weighted sum of delta functions

• New prior weights are reweighting of old posterior weights using state transition
probabilities
The Grid-based filter
• Update
(2)

• Posterior weights are reweighting of prior weights using likelihoods (+

normalization)
The Grid-based filter
• Pros:
– assumed known, but no
constraint on their (discrete) shapes
– Easy extension to varying number of states
– Optimal solution for the discrete-finite environment!
• Cons:
– Curse of dimensionality
• Inefficient if the state space is large
– Statically considers all possible hypotheses
Smoothing possible with a
backward message
(cf HMMs, lecture 10)
Today’s lecture
• Fundamentals
• Formalizing time series models
• Recursive filtering
• Two cases with optimal solutions
• Linear Gaussian models
• Discrete systems
• Suboptimal solutions
Suboptimal solutions
• In many cases these assumptions do not hold
– Practical environments are nonlinear, non-Gaussian,
continuous
➔Approximations are necessary…

– Extended Kalman filter (EKF)

Analytic approximations
– Approximate grid-based methods
– Multiple-model estimators Numerical methods
– Unscented Kalman filter (UKF) Gaussian-sum filters
– Particle filters (PF)
– …
Sampling approaches
The extended Kalman filter
• The idea: local linearization of the dynamic
system might be sufficient description of the
nonlinearity
• The model: nonlinear system with additive
noise
The extended Kalman filter
• f, h are approximated using a first-order Taylor
series expansion (eval at state estimations)
Predict:

Update:
The extended Kalman filter
The extended Kalman filter
• Pros
– Good approximation when models are near-linear
– Efficient to calculate
(de facto method for navigation systems and GPS)
• Cons
– Only approximation (optimality not proven)
– Still a single Gaussian approximations
• Nonlinearity à non-Gaussianity (e.g. bimodal)
– If we have multimodal hypothesis, and choose
incorrectly – can be difficult to recover
– Inapplicable when f,h discontinuous
The unscented Kalman filter

Yacov Hel-Or

• Can give more accurately approximates posterior

Challenges
• Detection specific
– Full/partial occlusions
– False positives/false negatives
– Entering/leaving the scene
• Efficiency
• Multiple models and switching dynamics
• Multiple targets
• …
Conclusion
• Inference in time series models:
• Past: smoothing
• Present: filtering
• Future: prediction
• Recursive Bayes filter optimal
• Computable in two cases
• Linear Gaussian systems: Kalman filter
• Discrete systems: Grid filter
• Approximate solutions for other systems

Sheffield Workshop2013 Osborne
No ratings yet
Sheffield Workshop2013 Osborne
86 pages
Derivation of The Kalman Filter in A Bayesian Filtering Perspective
No ratings yet
Derivation of The Kalman Filter in A Bayesian Filtering Perspective
6 pages
Bayesian Modeling for Engineers
No ratings yet
Bayesian Modeling for Engineers
44 pages
Bayesian Filtering Techniques: Kalman and Extended Kalman Filter Basics
No ratings yet
Bayesian Filtering Techniques: Kalman and Extended Kalman Filter Basics
4 pages
Sequential State Estimation in Nonlinear, Non Gaussian Dynamical Systems
No ratings yet
Sequential State Estimation in Nonlinear, Non Gaussian Dynamical Systems
38 pages
KalmanSlides 2
No ratings yet
KalmanSlides 2
57 pages
2020 - Bretz - Notes Onf Filtering Piecewise Linear Data
No ratings yet
2020 - Bretz - Notes Onf Filtering Piecewise Linear Data
6 pages
Time Series Prediction by Kalman Smoother
No ratings yet
Time Series Prediction by Kalman Smoother
5 pages
Chapter 14
No ratings yet
Chapter 14
38 pages
Kalman Filtering
No ratings yet
Kalman Filtering
20 pages
Kalman Filter For Vision Tracking: Erik Cuevas1,2, Daniel Zaldivar1,2 and Raul Rojas1 10th August 2005
No ratings yet
Kalman Filter For Vision Tracking: Erik Cuevas1,2, Daniel Zaldivar1,2 and Raul Rojas1 10th August 2005
18 pages
Tutorial On Kalman Filter
No ratings yet
Tutorial On Kalman Filter
47 pages
State-Space Models for Economists
No ratings yet
State-Space Models for Economists
9 pages
14 Kalmanfilter
No ratings yet
14 Kalmanfilter
34 pages
Bellman Filtering For State Space Models
No ratings yet
Bellman Filtering For State Space Models
26 pages
Kalman and Extended Kalman Filters Conce
No ratings yet
Kalman and Extended Kalman Filters Conce
44 pages
Kalman and Extended Kalman Concept, Derivation and Properties
No ratings yet
Kalman and Extended Kalman Concept, Derivation and Properties
44 pages
METULecture 1
No ratings yet
METULecture 1
15 pages
Aravkin Et Al. 2017 - Generalized Kalman Smoothing - Modeling and Algorithms
No ratings yet
Aravkin Et Al. 2017 - Generalized Kalman Smoothing - Modeling and Algorithms
30 pages
An Introduction To Kalman Filtering:Probabilistic And: Deterministic Approaches
No ratings yet
An Introduction To Kalman Filtering:Probabilistic And: Deterministic Approaches
12 pages
Kalman Filters Switching Kalman Filter: Adventures of Our BN Hero
No ratings yet
Kalman Filters Switching Kalman Filter: Adventures of Our BN Hero
22 pages
Bellman Filtering and Smoothing For State-Space Models
No ratings yet
Bellman Filtering and Smoothing For State-Space Models
60 pages
Kalman Filters Theory and Implementation
No ratings yet
Kalman Filters Theory and Implementation
13 pages
A Step by Step Mathematical Derivation A
No ratings yet
A Step by Step Mathematical Derivation A
32 pages
An Introduction To Kalman Filtering With MATLAB Examples: S L S P
89% (9)
An Introduction To Kalman Filtering With MATLAB Examples: S L S P
83 pages
Kalman Filtering Book PDF
No ratings yet
Kalman Filtering Book PDF
83 pages
L6 - Kalman Filter
No ratings yet
L6 - Kalman Filter
15 pages
Kalman Filter and Economic Applications
No ratings yet
Kalman Filter and Economic Applications
15 pages
Kalman Filter
100% (1)
Kalman Filter
15 pages
4.3.1 The Kalman Filter
No ratings yet
4.3.1 The Kalman Filter
3 pages
Observers and Kalman Filters: CS 393R: Autonomous Robots
No ratings yet
Observers and Kalman Filters: CS 393R: Autonomous Robots
37 pages
KALMAN FILTER Applications in Image Processing
No ratings yet
KALMAN FILTER Applications in Image Processing
24 pages
Kalman Filter and Smoother
No ratings yet
Kalman Filter and Smoother
9 pages
Posterior Linearization Filter Principles and Implementation Using Sigma Points
No ratings yet
Posterior Linearization Filter Principles and Implementation Using Sigma Points
13 pages
From Least Squares To Signal Processing
No ratings yet
From Least Squares To Signal Processing
35 pages
Kalman Filters for Engineers
100% (1)
Kalman Filters for Engineers
7 pages
Lec 11 Tracking
No ratings yet
Lec 11 Tracking
70 pages
Bayesian Estimation in State Space
No ratings yet
Bayesian Estimation in State Space
33 pages
Kalman Filter for Engineers
No ratings yet
Kalman Filter for Engineers
9 pages
KF PF
No ratings yet
KF PF
45 pages
Unit 3 - Estimation And Prediction: θ 1 2 n 1 2 n 1 1 2 2 n n
No ratings yet
Unit 3 - Estimation And Prediction: θ 1 2 n 1 2 n 1 1 2 2 n n
14 pages
Kalman Filter Basics & Gaussian Distributions
No ratings yet
Kalman Filter Basics & Gaussian Distributions
11 pages
An Introduction To Particle Filters: David Salmond and Neil Gordon Sept 2005
No ratings yet
An Introduction To Particle Filters: David Salmond and Neil Gordon Sept 2005
27 pages
Kalmannote Basics
No ratings yet
Kalmannote Basics
4 pages
Linear Dynamical Models, Kalman Filtering and Statistics. Lecture Notes To IN-ST 259
No ratings yet
Linear Dynamical Models, Kalman Filtering and Statistics. Lecture Notes To IN-ST 259
163 pages
Robotics Localization Techniques
No ratings yet
Robotics Localization Techniques
42 pages
Kalman Filtering
No ratings yet
Kalman Filtering
15 pages
1 Deriving Kalman Filter
No ratings yet
1 Deriving Kalman Filter
7 pages
An Introduction To The Kalman Filter
No ratings yet
An Introduction To The Kalman Filter
16 pages
An Introduction To The Kalman Filter - Greg Welch and Gary Bishop
No ratings yet
An Introduction To The Kalman Filter - Greg Welch and Gary Bishop
16 pages
Calculating The Height of A Building Worksheet
No ratings yet
Calculating The Height of A Building Worksheet
3 pages
Vector Exercise From DPG
No ratings yet
Vector Exercise From DPG
10 pages
Spider Web Algorithm
50% (2)
Spider Web Algorithm
11 pages
Ell Handbook Mathematic PDF
No ratings yet
Ell Handbook Mathematic PDF
22 pages
Class 5th
No ratings yet
Class 5th
7 pages
Charles Swartz - An Introduction To Functional Analysis-M. Dekker (1992) PDF
100% (1)
Charles Swartz - An Introduction To Functional Analysis-M. Dekker (1992) PDF
615 pages
Cs3351 Aiml Unit 3 Notes Eduengg
No ratings yet
Cs3351 Aiml Unit 3 Notes Eduengg
32 pages
Compound Semiconductor Device Modelling
No ratings yet
Compound Semiconductor Device Modelling
54 pages
FMS 2009 Solutions
No ratings yet
FMS 2009 Solutions
11 pages
Summative Assessment 7.1.2
100% (1)
Summative Assessment 7.1.2
4 pages
CUDA Libraries for Developers
No ratings yet
CUDA Libraries for Developers
86 pages
Ipm at Iim Indore Mock Questions 025ca95d888bb
No ratings yet
Ipm at Iim Indore Mock Questions 025ca95d888bb
19 pages
ER Model
No ratings yet
ER Model
6 pages
Future Managers Pty LTD 2009/2010 Catalogue
0% (1)
Future Managers Pty LTD 2009/2010 Catalogue
32 pages
Verilog Implementation of Gaussion Random Number Generator Using Boxmuller Method, Full Verilog Code
No ratings yet
Verilog Implementation of Gaussion Random Number Generator Using Boxmuller Method, Full Verilog Code
22 pages
Analyzing Fluid Film Bearings and Rotordynamics With ANSYS - Presentation
100% (1)
Analyzing Fluid Film Bearings and Rotordynamics With ANSYS - Presentation
34 pages
Flat Belt Drive Design Guide
100% (1)
Flat Belt Drive Design Guide
9 pages
Software Testing Techniques Guide
No ratings yet
Software Testing Techniques Guide
47 pages
Appendix F. 4S Self-Learning Module 3
No ratings yet
Appendix F. 4S Self-Learning Module 3
21 pages
GR 11 MATHS JUNE P2 2024 Marking Guideline
No ratings yet
GR 11 MATHS JUNE P2 2024 Marking Guideline
11 pages
Laws of Motion - Class 11 Physics NCERT Solutions Free PDF Download
No ratings yet
Laws of Motion - Class 11 Physics NCERT Solutions Free PDF Download
51 pages
Electromagnetic Induction Guide
No ratings yet
Electromagnetic Induction Guide
19 pages
Ice Cream Statistics
No ratings yet
Ice Cream Statistics
6 pages
Chirwa Gy
No ratings yet
Chirwa Gy
20 pages
Unit IV Numerical Methods Solution
No ratings yet
Unit IV Numerical Methods Solution
33 pages
TJ N5
No ratings yet
TJ N5
249 pages
A Finite Element Solution of The Beam Equation Via MATLAB: Applied Science and Engineering Progress October 2012
No ratings yet
A Finite Element Solution of The Beam Equation Via MATLAB: Applied Science and Engineering Progress October 2012
10 pages
Contoh Interpretasi Data Pada SPSS 21
No ratings yet
Contoh Interpretasi Data Pada SPSS 21
22 pages
Improper Integral
100% (1)
Improper Integral
23 pages
Microsoft Excel Booklet: With One or More Worksheets. A Worksheet (Sheet1) Is Your Work Area
No ratings yet
Microsoft Excel Booklet: With One or More Worksheets. A Worksheet (Sheet1) Is Your Work Area
11 pages

ML Lecture17

Uploaded by

ML Lecture17

Uploaded by

Kalman filtering and friends:

Inference in time series models

• Many problems require estimation of the state of

Prediction: where will I be in the future

• Time series modeling

• Time series modeling

• Time series modeling

• Markov chain – A stochastic process with

k-1 k k+1 time

k-1 k k+1 time

zk-1 zk zk+1 Measurements

• The measurement vector represents

k-1 k k+1 time

zk-1 zk zk+1 Measurements

• Only position is observed:

• By knowing the posterior many kinds of

– Uses the system model to predict forward

• Update step (a-posteriori)

– Update the prediction in light of new data

• It can seem odd to regard 𝑝 𝑥- 𝑧$:-/$ as prior

• In update with 𝑧- , it is a working prior

• Assume is given at time k-1

• Using Chapman-Kolmogorov identity + Markov

• Knowledge of enables to compute

➔So (1) and (2) give optimal solution for the

• However, optimal solution does exist for

• Simple example again

• Substituting into (1) and (2) yields the predict and

• Probability for any location, based on measurement

• σ less than both σ1 and σ2

Best Prediction prior to z2 Optimal Weighting Residual

• Small measurement error, 𝐻 invertible:

• Small prediction error:

Smoothing possible with a

• Again, substitution into (1) and (2) yields the

• New prior is also weighted sum of delta functions

• Posterior weights are reweighting of prior weights using likelihoods (+

– Extended Kalman filter (EKF)

• Can give more accurately approximates posterior

You might also like