0% found this document useful (0 votes)

51 views56 pages

Memory-Based Learning: ENPM808F: Robot Learning Summer 2017

This document provides an overview of the course ENPM808F: Robot Learning for Summer 2017. The course covers topics like memory-based learning, reinforcement learning, imitation learning, and deep reinforcement learning. It also discusses the differences between global and local learning approaches, as well as eager versus lazy learning methods. Specific techniques covered include k-nearest neighbors, locally weighted regression, radial basis function networks, and more.

Uploaded by

Jay Guthrie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views56 pages

Memory-Based Learning: ENPM808F: Robot Learning Summer 2017

Uploaded by

Jay Guthrie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

ENPM808F:

Robot Learning
Summer 2017

Lecture 2:
Memory-Based Learning
Course Outline

•  Motor Learning and the EvoluBon of Intelligence

•  Memory-Based Learning

•  Behavior Based RoboBcs

•  Reinforcement Learning

•  Value versus Policy IteraBon

•  Q-Learning and Actor-CriBc Models

•  Robot Shaping and Evolving Behaviors

•  Crossing the Reality Gap

•  ImitaBon and Learning from DemonstraBon

•  Deep Reinforcement Learning with CNNs

•  On-line and Lifelong Learning

Global vs. Local Learning

Global Neural Network Learning (e.g., mul=layer perceptron network)

u=lizes a completely distributed weight representa=on, thus requiring
update of all network weights per paFern per itera=on.

Local Neural Network Learning (e.g., CMAC, RBF, LWR) u=lizes
Locally distributed weight representa=on, thus requiring only
A small subset of all network weights.
Global Learning Local Learning

Advantages: Advantages:

•  Compact Representa.on •  Rapid Convergence
•  Automa.c Resource Alloca.on •  Computa.onally Inexpensive
•  Generally Con.nuous and •  No Local Minima
Diﬀeren.able Mappings •  Convergence Guaranteed

•  Very High Accuracy

Disadvantages: Disadvantages:

•  Very Slow Convergence •  Memory Intensive
•  Unpredictable Local Minima (may not •  Resource Alloca.on Not Automa.c
converge to global minimum) •  Con.nuity and Diﬀeren.ability of
•  Computa.onally Expensive Mapping More Diﬃcult to Guarantee
•  Generaliza.on Not Easily Controllable
•  Compara.vely Poor Accuracy
(on some problems)
Secant Approxima=on to Tangent
Con=nuous CMAC
Con=nuous CMAC
Curse of Dimensionality
vs.
Blessing of Non-Uniformity*
* --- Pedro Domingos

The Curse of Dimensionality in Machine Learning refers to the eﬀect that

many algorithms that work well in low dimensions become intractable in
higher dimensions.

The Blessing of Non-Uniformity refers to the fact that many problems, and
therefore input spaces, which are high-dimensional can be represented
using a lower dimensional manifold or representa=on.

Lazy Learning

versus

Eager Learning
Lazy versus Eager Learning

Lazy Learning methods store all of the training data and use it only
when called with a new input vector (query) to perform a
mapping. They make no assump=ons about the overall
shape of the global mapping before the query is presented.
Also referred to as Instance-based Learning methods.

Examples include: k-Nearest Neighbors, Locally Weighted Regression,
and Case-Based Reasoning
Lazy versus Eager Learning

Eager Learning methods construct an approximate representa=on

of the global func=on before receiving a query. They are
therefore limited to genera=ng a single global
approxima=on to the target mapping.

Examples include: Mul.layer Perceptrons (e.g., Backprop Networks),
RBFs, Decision Trees, CMAC, …
k-Nearest Neighbor

Training Examples !! , !(!! )

k-Nearest Neighbor

Query
!!
k-Nearest Neighbor

k = 6

!
!!! !(!! )
!(!! ) ←
!
k-Nearest Neighbor

Requires:
•  Training exemplars and queries map to points in ℜ!
•  Small input vectors
•  Suﬃcient density of training data to cover areas of interest

Advantages:
•  No informa=on is lost
•  Fast training
•  Can model highly complex surfaces

Disadvantages:
•  Addi=onal computa=onal complexity to answer queries
•  Weights all input aFributes equally
•  Suﬀers from Curse of Dimensionality
k-Nearest Neighbor Algorithm
(Real-valued)

•  Store all training examples !! , !(!! )

!! ! !
•  Given query calculate mean of values of nearest neighbors

!
!!! !(!! )
!(!! ) ←
!
•  Note that there are no weights to update
Distance Weighted
k-Nearest Neighbor Algorithm
(Real-valued)

Neighbors are weighted based upon their distance from the query point !!

!(!! , !! ) !! !!
Deﬁne distance between and
1
Deﬁne weights !! ≡ !

!(! ,
! ! ! )

!
!!! !! !(!! )
!! ! !! ←
Then given query , !
!!! !!
Shepard’s Method

•  Store all training examples !! , !(!! )

!! !
•  Given query calculate weighted sum of values of all points

!
!!! !! !(!! )
! !! ← !
!!! !!

•  Note that since all points are used, Shepard’s Method is a global learning
algorithm
Nearest Neighbor vs
Locally Weighted Regression

k-Nearest Neighbor
(Discrete)

Distance Weighted
k-Nearest Neighbor
(Con=nuous)

Locally Weighted
Regression
Locally Weighted Regression

!(!)
LWR is a Lazy Learning method in which an approxima=on is
!!
formed around each query point

!!
•  It is Local since only the points near are used.

•  It is Weighted since the inﬂuence of the points is determined by their

distance from the query point.

!(!)
•  It is Regression since approximates a real-valued target func=on.

We can approximate the target func=on as a linear combina=on of

m weighted aFributes of the training points

! ! = !! + !! !! ! + ⋯ + !! !! (!)
Locally Weighted Linear Regression

Unweighted Averaging Locally Weighted Averaging

Using Springs Using Springs
Locally Weighted Linear Regression

To calculate squared error over k nearest neighbors, deﬁne error func=on

!
1
!! !! ≡ (! ! − !(!))!
2
!!!

To weight the inﬂuence of points based upon distance, deﬁne kernel func=on
1
!! ! ≡
!
1
!! ! ≡ !
!
!
!! ! ≡ ! !!
To minimize the weighted error across the en=re training set
!
1 !
!! !! ≡ ! ! −! ! !(!)
2
!!!
Locally Weighted Linear Regression

Some typical kernel

func=ons (from
Atkeson et al., 1997a)
Locally Weighted Linear Regression

To minimize the weighted error across the en=re training set

!
1 !
!! !! ≡ ! ! −! ! !(!)
2
!!!

To minimize the weighted error across the k nearest neighbors

!
1 !
!! !! ≡ ! ! −! ! !(!)
2
!!!

The weight update rule can then be expressed

△ !! = ! !(!) ! ! − ! ! !! (!)
!!!
! !
with learning rate for each aFribute of input vector
Locally Weighted Nonlinear Regression

Locally weighted regression techni-

ques may be extended to u=lize
nonlinear local support func=ons
(e.g, quadra=c, cubic polynomial)
through use of widely supported
curve fi`ng techniques from
sta=s=cal analysis.

LOESS (Locally Es=mated ScaFerplot
Smoothing) and LOWESS (Locally
Weighted ScaFerplot Smoothing) are
efficient nonparametric methods for
fi`ng models to subsets of data.
Radial Basis Func=on Networks

Radial Basis Func.on Networks (RBFs) are a type of

neural network closely related to distance-weighted
regression.

They form global approxima=ons of the target
func=ons using a linear combina=on of basis
func=ons weighted by distance from the points of
interest.
Radial Basis Func=on Network

! = (!! ! , !! ! , … , !! (!))
Radial Basis Func=on Networks

! ! = !! + !! !! (!(!! , !))
!!!

!!
Kernel func=on is deﬁned so that
it decreases with increasing distance.

!!
A common choice for is the
Gaussian func=on

! !
! ! ! (! ! ,!)
!! ! !! , ! = ! !!!

!!!
where denotes the variance of the Gaussian at !!
RBF Network Training

RBF networks are trained in a two-stage process.

First, the number of hidden units is determined (ocen the same as
the number of points in the training set). Their centers are
ini=alized (generally to the training points), and their variances
ini=alized to correspond with the chosen kernel func=on.

Second, the weights are trained using the global error func=on
!
1 !

!! !! ≡ ! ! −! ! !(!)
2
!!!
or a localized error func=on, e.g.

!
1 !
!! !! ≡ ! ! −! ! !(!)
2
!!!
RBF Network Training

The hidden layer nodes may

alterna=vely be ini=alized using the
EM (Expecta.on Maximiza.on)
Algorithm, an unsupervised
clustering technique for ﬁ`ng data
to a mixture of Gaussians.

Only the input vectors are used to
determine the cluster centers. As
EM is an unsupervised clustering
technique, the target vectors
(outputs) are not used or needed.
RBF Network Training

The outputs, however, are used to determine the weights connec=ng

the hidden layer nodes to the output. Since the output(s) are a linear
combina=on of these inputs, there are numerous linear regression-based
techniques for eﬃciently op=mizing these weights.

Locally Weighted Learning
for Robo=c Control

Locally weighted learning for control u=lizes Memory Based (Lazy)

Learning methods for construc=ng local models from data.

A key concept is that rather than building a global model up front,
simply store all of the data.

When a query is presented, use the data near the query point to
construct a local model to answer the query. This model is then
discarded.

Locally Weighted Learning methods specify how models may be
built using Lazy Learning methods (such as LWR), but not how
they are used to construct learning controllers.
Locally Weighted Learning
for Robo=c Control

First we will consider temporally independent control tasks, such

as setpoint control and vehicle trajectory following, such that

! = ! !, ! + !"#$%

! ! !
for output vector , state vector , and control vector .

! !
The control task is to choose so that the outcome is .

!
Use Lazy Learning to infer a model which approximates . !
Locally Weighted Learning
for Inverse Models

Inverse model based control techniques use states and desired

outcomes to predict the control inputs necessary to achieve
the desired outcomes.

! = ! !! (!, !)

Learned database
implemen=ng
inverse model
Locally Weighted Learning
for Inverse Models

Pros:

•  The database is “trained” by adding new points (!, !, !)

! !
•  If there is a monotonic rela=onship between and , then there
are eﬃcient methods for rapidly converging on the correct mapping

Cons:

•  May not work if
Ø  Vector space of ac=ons and outcomes is not the same
Ø  Mapping is not one-to-one
Ø  Data include misleading noisy observa=ons

Locally Weighted Learning
for Inverse Models

Inverse model fails to accurately predict control ac.on for

desired outcome on non-monotonic func.on.
Locally Weighted Learning
for Forward Models

Forward model based control techniques use states and control

inputs to predict outcomes.

! = !(!, !)

Learned database
implemen=ng
forward model
Locally Weighted Learning
for Forward Models

Pros:

•  The database is “trained” by adding new points (!, !, !)
•  Allows “mental simula=on,” or predic=on of the eﬀects of diﬀerent
ac=ons

Cons:

•  Requires search of the database to ﬁnd ac=on that corresponds
to the desired outcome for the current state.
Combining Inverse and
Forward Models

An Inverse Model may be used to generate a good star=ng poin=ng
for search of a Forward Model.

!! (!, ! )
! ! = ! !

! may be used with a Lazy Forward Model
!

! = !(!, !! )

! !!
If is close to then Newton’s Method may be used to
!
further reﬁne .
Locally Weighted Learning
for Robo=c Control

Next we will consider temporally dependent control tasks such that

! ! + 1 = !(! ! , ! ! )

The task is then to regulate (control) the state to achieve a desired
!!
setpoint , or a sequence of values to create a trajectory
!! 1 , !! 2 , !! 3 , …

The database is “trained” by adding triples of the form (!! , !! , !!!! )

!!
current state
! !
current control input
! next state
!!!
Deadbeat Control
for Devil S=cking

One-step deadbeat control chooses ac.ons to cause the immediate

next state to be the desired next state. If the next state is aYainable
in one step the ac.on may be chosen without regard to future
states, decisions, or performance.

Atkeson et al. (1997) applied deadbeat control to learn the Devil
S=cking task.

!! = ! !! (!! , !!!! )
First, an Inverse Model was learned
(database populated from suﬃcient explora=on).

!!
Next, given a desired state the database was queried to
determine !! = ! !! (!! , !! )

Locally Weighted Learning
for Robo=c Control

Devil s=cking robot link, by Professor Chris Atkeson:

YouTube Link: Devil S=cking Robot Video

Locally Weighted Learning
for Robo=c Control

Deadbeat Control will fail if the dynamics of the plant being

controlled require more than a single step lookahead.
Lazy Learning may be applied to more complex control
methods (e.g., LQR, Nonlinear Op.mal Control).

The use of linear regression models ocen (with sufficient data)
guarantees the existence of deriva=ves, which may be easily
calculated from the models through numerical differen=a=on.

Learning methods for Nonlinear Op.mal Control techniques
(e.g., value and policy itera=on) ocen fall under Reinforcement
Learning techniques and will be visited later in the course.

References
•  Albus, James S., “A New Approach to Manipulator Control: The Cerebellar Model
Ar=cula=on Controller,” Journal of Dynamic Systems, Measurement, and Control,
pp. 225, September 1975.

•  Atkeson, C. G., Moore, A. W., and Schaal, S., "Locally Weighted Learning,”
Ar.ficial Intelligence Review, 11.1-5 (1997): 11-73, 1997.

•  Atkeson, C. G., Moore, A. W., and Schaal, S., “Locally Weighted Learning for Control,”
Ar.ﬁcial Intelligence Review, 11:1-5 (1997): 75-113, 1997.

•  Domingos, P., "A few useful things to know about machine learning,"
Communica.ons of the ACM, 55.10:78-87, 2012.

•  Mitchell, T., “The Discipline of Machine Learning,” CMU-ML-06-108

(CMU Technical Report), July 2006.

•  Mitchell, T., Machine Learning, Chapter 8, McGraw Hill Educa=on, 1997.
Reading Assignments
•  Atkeson, C. G., Moore, A. W., & Schaal, S., "Locally Weighted Learning,”
Ar=ﬁcial Intelligence Review 11.1-5 (1997): 11-73.

•  Atkeson, C. G., Moore, A. W., & Schaal, S., “Locally Weighted Learning for
Control,” Ar=ﬁcial Intelligence Review, 11:1-5 (1997): 75-113.

•  Brooks, Rodney A. "Intelligence without representa=on." Ar.ﬁcial Intelligence

47.1 (1991): 139-159.

•  Arkin, Ronald. "Motor schema based naviga=on for a mobile robot: An approach
to programming by behavior." Robo.cs and Automa.on. Proceedings. 1987 IEEE
Interna.onal Conference on. Vol. 4. IEEE, 1987.

•  Wahde, Ma`as, and Wolﬀ, Krister, “Behavior-Based Robo=cs,” 1997.

On-line (hYp://www.am.chalmers.se/~wolﬀ/AA/Chapter3.pdf)
Homework Assignment #2
Due by 4pm, 06/20

1)  Program a Discrete CMAC and train it on a 1-D func=on (ref: Albus 1975, Fig. 5)
Explore eﬀect of overlap area on generaliza=on and =me to convergence.

2)  Program a Con=nuous CMAC by allowing par=al cell overlap, and modifying
the weight update rule accordingly. Compare the output of the Discrete CMAC
with that of the Con=nuous CMAC.

3)  Discuss how you might use recurrent connec=ons to train a CMAC to output
a desired trajectory without using =me as an input (e.g., state only).

Unit 3 (B) NGP
No ratings yet
Unit 3 (B) NGP
84 pages
Locally Weighted Learning Survey
No ratings yet
Locally Weighted Learning Survey
63 pages
Instance-Based Learning Explained
No ratings yet
Instance-Based Learning Explained
6 pages
AML Mod5
No ratings yet
AML Mod5
33 pages
Unit 3
No ratings yet
Unit 3
12 pages
Machine Learning for Data Scientists
No ratings yet
Machine Learning for Data Scientists
13 pages
MLT Unit 3 Part 2
No ratings yet
MLT Unit 3 Part 2
57 pages
Instance Based Learning
100% (1)
Instance Based Learning
27 pages
Locally Weighted Linear Regression
No ratings yet
Locally Weighted Linear Regression
11 pages
CS8082U4L02 - Locally Weighted Regression
No ratings yet
CS8082U4L02 - Locally Weighted Regression
13 pages
ML 4
No ratings yet
ML 4
7 pages
ML Unit V
No ratings yet
ML Unit V
10 pages
K-NN Algorithm Overview
No ratings yet
K-NN Algorithm Overview
8 pages
INSTANCE Based Learning
No ratings yet
INSTANCE Based Learning
12 pages
CS8082U4L01 - K-Nearest Neighbour Learning
No ratings yet
CS8082U4L01 - K-Nearest Neighbour Learning
21 pages
Instance Based Learning
No ratings yet
Instance Based Learning
21 pages
Unit 5 ML-2-70
No ratings yet
Unit 5 ML-2-70
69 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
UNIT 3 - INSTANCE BASED LEARNING Akgec
No ratings yet
UNIT 3 - INSTANCE BASED LEARNING Akgec
14 pages
MCA 4th Sem
No ratings yet
MCA 4th Sem
18 pages
Week 09 Lesson 1 Intro Machine Learning 1 To 32
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 To 32
61 pages
MLT Essentials
No ratings yet
MLT Essentials
32 pages
CAT - 2 Class
No ratings yet
CAT - 2 Class
62 pages
Unit 4
No ratings yet
Unit 4
12 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
20 pages
ML - Unit 3
No ratings yet
ML - Unit 3
32 pages
Module 3
No ratings yet
Module 3
101 pages
ML Module5Notes
No ratings yet
ML Module5Notes
20 pages
Instance-Based Learning Guide
No ratings yet
Instance-Based Learning Guide
19 pages
ML Unit Iv
No ratings yet
ML Unit Iv
8 pages
GML Slides 2024 04 29
No ratings yet
GML Slides 2024 04 29
206 pages
15 Ec 834
No ratings yet
15 Ec 834
26 pages
Neural Networks for Engineers
No ratings yet
Neural Networks for Engineers
44 pages
Notes Class1 Copy 2
No ratings yet
Notes Class1 Copy 2
225 pages
ML3 Unit 3
No ratings yet
ML3 Unit 3
3 pages
Unit 2
No ratings yet
Unit 2
37 pages
Short Course Machine Learning F de Vuyst 1715052496
No ratings yet
Short Course Machine Learning F de Vuyst 1715052496
74 pages
Kernel Adaptive Filtering PDF
No ratings yet
Kernel Adaptive Filtering PDF
124 pages
ML Primer PDF
No ratings yet
ML Primer PDF
122 pages
Neural Networks for Data Scientists
No ratings yet
Neural Networks for Data Scientists
20 pages
Unit 1,2,3
No ratings yet
Unit 1,2,3
17 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
31 pages
Lect3 UWA PDF
No ratings yet
Lect3 UWA PDF
73 pages
Least Mean Square (LMS) Algorithm: 3.1 Spatial Filtering
No ratings yet
Least Mean Square (LMS) Algorithm: 3.1 Spatial Filtering
16 pages
Cs229 ML Notes
No ratings yet
Cs229 ML Notes
192 pages
Module 4 A
No ratings yet
Module 4 A
29 pages
Intro to Machine Learning Concepts
No ratings yet
Intro to Machine Learning Concepts
70 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
38 pages
Softcomputing NN
No ratings yet
Softcomputing NN
84 pages
CH 2
No ratings yet
CH 2
30 pages
Week 12
No ratings yet
Week 12
59 pages
Intelligent Robotic Systems
No ratings yet
Intelligent Robotic Systems
66 pages
Week3 LearningI
No ratings yet
Week3 LearningI
48 pages
A 200
No ratings yet
A 200
180 pages
1 Electrical Systems
100% (4)
1 Electrical Systems
102 pages
Huawei RTN QoS Setup Guide
100% (2)
Huawei RTN QoS Setup Guide
7 pages
ZX700.5 Serie 8 2008
No ratings yet
ZX700.5 Serie 8 2008
10 pages
Green SaND
No ratings yet
Green SaND
7 pages
1st Annual Examination For BCOM 2016
No ratings yet
1st Annual Examination For BCOM 2016
76 pages
Marketing Research Process
No ratings yet
Marketing Research Process
22 pages
ATC Substation Eqpt Ratings CR-0063
100% (1)
ATC Substation Eqpt Ratings CR-0063
39 pages
UZE Unizinc Easy Galvanised Shelves
No ratings yet
UZE Unizinc Easy Galvanised Shelves
21 pages
Molecular Sieve Desiccant: Highest Quality - Two Types - Two Sizes
No ratings yet
Molecular Sieve Desiccant: Highest Quality - Two Types - Two Sizes
1 page
NUML University Calendar 2024-2025
No ratings yet
NUML University Calendar 2024-2025
30 pages
Tel Directory NITH
No ratings yet
Tel Directory NITH
19 pages
Shopee Video & Live Criteria Guide
No ratings yet
Shopee Video & Live Criteria Guide
7 pages
Java Interview Guide - 200+ Interview Questions and Answers (Video)
No ratings yet
Java Interview Guide - 200+ Interview Questions and Answers (Video)
5 pages
Report Line Check TP.009
No ratings yet
Report Line Check TP.009
6 pages
Experienced Driller & Engineer
No ratings yet
Experienced Driller & Engineer
4 pages
Cutsheet pcm8
No ratings yet
Cutsheet pcm8
2 pages
BBG Sondir
No ratings yet
BBG Sondir
20 pages
Mock Lesson 4
No ratings yet
Mock Lesson 4
2 pages
Patient Strecher - Hil Rom
No ratings yet
Patient Strecher - Hil Rom
80 pages
Brochure ACER
No ratings yet
Brochure ACER
7 pages
Manufacturing Inline Mixer For Industries
No ratings yet
Manufacturing Inline Mixer For Industries
5 pages
Session 6 Dar5
No ratings yet
Session 6 Dar5
36 pages
Qaida Noorania Sheikh Noor Muhammad Haqqani Markaz Al Furqan Taleem Ul Quran HTTPSWWW - Siraatalmustaqim.com
No ratings yet
Qaida Noorania Sheikh Noor Muhammad Haqqani Markaz Al Furqan Taleem Ul Quran HTTPSWWW - Siraatalmustaqim.com
1 page
TLC 7628
No ratings yet
TLC 7628
12 pages
Cornell VC Directory
No ratings yet
Cornell VC Directory
186 pages
Engineering Thermodynamics ETH
No ratings yet
Engineering Thermodynamics ETH
36 pages
MXT Manual
No ratings yet
MXT Manual
40 pages
Acer, Inc: Taiwan's Rampaging Dragon
100% (6)
Acer, Inc: Taiwan's Rampaging Dragon
20 pages
Common Industrial Protocol and Family of CIP Networks de ODV
100% (2)
Common Industrial Protocol and Family of CIP Networks de ODV
134 pages

Memory-Based Learning: ENPM808F: Robot Learning Summer 2017

Uploaded by

Memory-Based Learning: ENPM808F: Robot Learning Summer 2017

Uploaded by

ENPM808F:

Global Neural Network Learning (e.g., mul=layer perceptron network)

The Curse of Dimensionality in Machine Learning refers to the eﬀect that

Eager Learning methods construct an approximate representa=on

Training Examples !! , !(!! )

• Store all training examples !! , !(!! )

• Store all training examples !! , !(!! )

• It is Weighted since the inﬂuence of the points is determined by their

We can approximate the target func=on as a linear combina=on of

Unweighted Averaging Locally Weighted Averaging

To calculate squared error over k nearest neighbors, deﬁne error func=on

Some typical kernel

To minimize the weighted error across the en=re training set

To minimize the weighted error across the k nearest neighbors

The weight update rule can then be expressed

Locally weighted regression techni-

Radial Basis Func.on Networks (RBFs) are a type of

RBF networks are trained in a two-stage process.

The hidden layer nodes may

The outputs, however, are used to determine the weights connec=ng

Locally weighted learning for control u=lizes Memory Based (Lazy)

First we will consider temporally independent control tasks, such

Inverse model based control techniques use states and desired

Inverse model fails to accurately predict control ac.on for

Forward model based control techniques use states and control

Next we will consider temporally dependent control tasks such that

One-step deadbeat control chooses ac.ons to cause the immediate

Devil s=cking robot link, by Professor Chris Atkeson:

YouTube Link: Devil S=cking Robot Video

Deadbeat Control will fail if the dynamics of the plant being

• Mitchell, T., “The Discipline of Machine Learning,” CMU-ML-06-108

• Brooks, Rodney A. "Intelligence without representa=on." Ar.ﬁcial Intelligence

• Wahde, Ma`as, and Wolﬀ, Krister, “Behavior-Based Robo=cs,” 1997.

You might also like

•  Store all training examples !! , !(!! )

•  Store all training examples !! , !(!! )

•  It is Weighted since the inﬂuence of the points is determined by their

•  Mitchell, T., “The Discipline of Machine Learning,” CMU-ML-06-108

•  Brooks, Rodney A. "Intelligence without representa=on." Ar.ﬁcial Intelligence

•  Wahde, Ma`as, and Wolﬀ, Krister, “Behavior-Based Robo=cs,” 1997.