0% found this document useful (0 votes)

13 views38 pages

Curve Fitting

The document discusses polynomial curve fitting as a method for predicting target variables based on input data, specifically using the function sin(2πx) with added noise. It explains the process of fitting a polynomial to training data, the challenges of overfitting with high-order polynomials, and the use of regularization to mitigate this issue. Additionally, it touches on instance-based learning methods, including k-nearest neighbor algorithms and locally weighted regression, emphasizing their lazy learning approach and the importance of distance metrics.

Uploaded by

Stu udy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views38 pages

Curve Fitting

Uploaded by

Stu udy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

Polynomial Curve Fitting

Machine Learning 1
Polynomial Curve Fitting
A Simple Regression Problem
• We observe a real-valued input variable x and we wish to use this observation to
predict the value of a real-valued target variable t.

• We use synthetically generated data from the function sin(2πx) with random noise
included in the target values.
– A small level of random noise having a Gaussian distribution

• We have a training set comprising N observations of x, written x ≡ (x1, . . . , xN)T,

together with corresponding observations of the values of t, denoted t ≡ (t1, . . . , tN)T.

• Our goal is to predict the value of t for some new value of x,

Machine Learning 2
Polynomial Curve Fitting
• A training data set of N = 10 points,
(blue circles),

• The green curve shows the actual

function sin(2πx) used to generate the
data.

• Our goal is to predict the value of t for

some new value of x, without
knowledge of the green curve.

Machine Learning 3
Polynomial Curve Fitting

• We try to fit the data using a polynomial function of the form

Machine Learning 4
Polynomial Curve Fitting
• The values of the coefficients will be
determined by fitting the polynomial to
the training data.

• This can be done by minimizing an

error function that measures the misfit
between the function y(x,w), for any
given value of w, and the training set
data points.

• Error Function: the sum of the squares

of the errors between the predictions
y(xn,w) for each data point xn and the
corresponding target values tn.

Machine Learning 5
Polynomial Curve Fitting
• We can solve the curve fitting problem by choosing the value of w for which E(w) is
as small as possible.

• Since the error function is a quadratic function of the coefficients w, its derivatives
with respect to the coefficients will be linear in the elements of w, and so the
minimization of the error function has a unique solution, denoted by w*,

• The resulting polynomial is given by the function y(x,w*).

• Choosing the order M of the polynomial  model selection.

Machine Learning 6
0th Order Polynomial

Machine Learning 7
1st Order Polynomial

Machine Learning 8
3rd Order Polynomial

Machine Learning 9
9th Order Polynomial

Machine Learning 10
Polynomial Curve Fitting
• The 0th order (M=0) and first order (M=1) polynomials give rather poor fits to the data
and consequently rather poor representations of the function sin(2πx).
• The third order (M=3) polynomial seems to give the best fit to the function sin(2πx) of
the examples.
• When we go to a much higher order polynomial (M=9), we obtain an excellent fit to
the training data.
– In fact, the polynomial passes exactly through each data point and E(w*) = 0.

Machine Learning 11
Polynomial Curve Fitting

• We obtain an excellent fit to the training data with 9th order.

• However, the fitted curve oscillates wildly and gives a very poor representation of the
function sin(2πx).

• This behaviour is known as over-fitting

Machine Learning 12
Polynomial Curve Fitting
Over-fitting
• We can then evaluate the residual value of E(w*) for the training data, and we can also
evaluate E(w*) for the test data set.
• Root-Mean-Square (RMS) Error:
– in which the division by N allows us to compare different sizes of data sets, and the square
root ensures that ERMS is measured on the same scale as the target variable t.

Machine Learning 13
Polynomial Curve Fitting
Polynomial Coefficients
• Magnitude of coefficients increases dramatically as order of polynomial increases.
• Large positive and negative values so that the corresponding polynomial function
matches each of the data points exactly, but between data points the function exhibits
the large oscillations  over-fitting

Machine Learning 14
Polynomial Curve Fitting
• Increasing the size of the data set reduces the over-fitting problem.

• 9th Order Polynomial.

Machine Learning 15
Polynomial Curve Fitting
regularization
• We may wish to use relatively complex and flexible models with data sets of limited
size.
• The over-fitting phenomenon can be controlled with regularization, which involves
adding a penalty term to the error function.

• Regularization: Penalize large coefficient values

• the coefficient λ governs the relative importance of the regularization term compared
with the sum-of-squares error term.

Machine Learning 16
Polynomial Curve Fitting
regularization
• Plots of M = 9 polynomials fitted to the data set using the regularized error function

no regularization (λ=0) too much regularization

Machine Learning 17
Polynomial Curve Fitting
regularization

• Polynomial Coefficients

Graph of the root-mean-square error

versus ln λ for the M=9 polynomial.

Machine Learning 18
Linear Basis Function Models

Machine Learning 19
Linear Basis Function Models
• Generally

where 𝜙j(x) are known as basis functions.

Machine Learning 20
Linear Basis Function Models
Linear Regression
• The simplest linear model for regression is one that involves a linear combination of
the input variables.
• It is often simply known as linear regression.

where
𝜙0(x) = 1 (=x0)
𝜙j(x) = xj j>0

Machine Learning 21
Linear Basis Function Models
Polynomial Curve Fitting: Polynomial basis functions

• 𝜙j(x) = xj

Machine Learning 22
Linear Basis Function Models
Gaussian basis functions

where the μj govern the locations of the basis functions in input space, and the parameter s
governs their spatial scale.

Machine Learning 23
Linear Basis Function Models
Sigmoidal basis functions

where σ(a) is the logistic sigmoid function

defined by

• we can use the ‘tanh’ function because this is related to the logistic sigmoid by
tanh(a) = 2σ(a) − 1,

Machine Learning 24
INSTANCE-BASE LEARNING

Machine Learning 25
INSTANCE-BASE LEARNING
• Instance-based learning methods simply store the training examples instead of learning
explicit description of the target function.
– Generalizing the examples is postponed until a new instance must be classified.
– When a new instance is encountered, its relationship to the stored examples is
examined in order to assign a target function value for the new instance.

• Instance-based learning includes nearest neighbor, locally weighted regression and

case-based reasoning methods.

• Instance-based methods are sometimes referred to as lazy learning methods because

they delay processing until a new instance must be classified.

• A key advantage of lazy learning is that instead of estimating the target function once
for the entire instance space, these methods can estimate it locally and differently for
each new instance to be classified.

Machine Learning 26
k-Nearest Neighbor Learning
• k-Nearest Neighbor Learning algorithm assumes all instances correspond to points in
the n-dimensional space Rn
• The nearest neighbors of an instance are defined in terms of Euclidean distance.
• Euclidean distance between the instances xi = <xi1,…,xin> and xj = <xj1,…,xjn> are:

n
d ( xi, xj )   (
r 1
xir  xjr ) 2

• For a given query instance xq, f(xq) is calculated the function values of k-nearest
neighbor of xq

Machine Learning 27
k-Nearest Neighbor Learning
• Store all training examples <xi,f(xi)>
• Calculate f(xq) for a given query instance xq using k-nearest neighbor
• Nearest neighbor: (k=1)
– Locate the nearest traing example xn, and estimate f(xq) as
– f(xq)  f(xn)
• k-Nearest neighbor:
– Locate k nearest traing examples, and estimate f(xq) as
– If the target function is real-valued, take mean of f-values of k nearest neighbors.

f(xq) =

– If the target function is discrete-valued, take a vote among f-values of k nearest

neighbors.
Machine Learning 28
When To Consider Nearest Neighbor
• Instances map to points in Rn
• Less than 20 attributes per instance
• Lots of training data

• Advantages
– Training is very fast
– Learn complex target functions
– Can handle noisy data
– Does not loose any information

• Disadvantages
– Slow at query time
– Easily fooled by irrelevant attributes

Machine Learning 29
Distance-Weighted kNN

Machine Learning 30
k-Nearest Neighbor Classification - Example

A B Class 3-Nearest Neighbor Classification of instance <3,3>

1 1 no
2 1 no A B Distance
of <3,3>
3 2 yes
1 1 8
7 7 yes
2 1 5
8 8 yes
3 2 1
7 7 32
8 8 50

• First three example are 3 Nearest Neighbors of instance <3,3>.

• Two of them is no and one of them is yes.
• Majority of classes of its neighbors are no, the classification of instance <3,3> is no.

Machine Learning 31
Distance Weighted kNN Classification- Example

A B Class Distance Weighted 3-Nearest Neighbor Classification

of instance <3,3>
1 1 no
2 1 no A B Distance
3 2 yes of <3,3>

7 7 yes 1 1 8
8 8 yes 2 1 5
3 2 1
7 7 32
8 8 50

• First three example are 3 Nearest Neighbors of instance <3,3>.

• Weight of no = 1/8 + 1/5 = 13/40 Weight of yes = 1/1 = 1
• Since 1 > 13/40, the classification of instance <3,3> is yes.

Machine Learning 32
k-Nearest Neighbor Classification - Issues
• Choosing the value of k:
– If k is too small, sensitive to noise points
– If k is too large, neighborhood may
include points from other classes.

• Scaling issues:
– Attributes may have to be scaled to
prevent distance measures from being
dominated by one of the attributes

Machine Learning 33
Curse of Dimensionality

Machine Learning 34
Locally Weighted Regression
• KNN forms local approximation to f for each query point xq
• Why not form an explicit approximation f(x) for region surrounding xq
 Locally Weighted Regression
• Locally weighted regression uses nearby or distance-weighted training examples to
form this local approximation to f.
• We might approximate the target function in the neighborhood surrounding x, using a
linear function, a quadratic function, a multilayer neural network.
• The phrase "locally weighted regression" is called
– local because the function is approximated based only on data near the query point,
– weighted because the contribution of each training example is weighted by its distance
from the query point, and
– regression because this is the term used widely in the statistical learning community for the
problem of approximating real-valued functions.

Machine Learning 35
Locally Weighted Regression
• Given a new query instance xq, the general approach in locally weighted regression
is to construct an approximation f that fits the training examples in the
neighborhood surrounding xq.
• This approximation is then used to calculate the value f(xq), which is output as the
estimated target value for the query instance.

Machine Learning 36
Case-based reasoning
• Instance-based methods
– lazy
– classification based on classifications of near (similar) instances
– data: points in n-dim. space

• Case-based reasoning
– as above, but data represented in symbolic form
– new distance metrics required

Machine Learning 37
Lazy & eager learning
• Lazy: generalize at query time
– kNN, CBR

• Eager: generalize before seeing query

– regression, ann, ID3, …

• Difference
– eager must create global approximation
– lazy can create many local approximation
– lazy can represent more complex functions using same H (H = linear functions)

Machine Learning 38

ML Unit 1 Part 2
No ratings yet
ML Unit 1 Part 2
18 pages
Lecture 1-Polynomial Curve Fitting
No ratings yet
Lecture 1-Polynomial Curve Fitting
47 pages
Unit-1 PCF
No ratings yet
Unit-1 PCF
18 pages
Slides Foundations
No ratings yet
Slides Foundations
81 pages
Introduction ML
No ratings yet
Introduction ML
65 pages
3 Polyreg
No ratings yet
3 Polyreg
22 pages
Lect 1
No ratings yet
Lect 1
24 pages
Lec19 Introduction2LinearRegression
No ratings yet
Lec19 Introduction2LinearRegression
53 pages
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
No ratings yet
Visit:: Join Telegram To Get Instant Updates: Contact: MAIL: Instagram: Instagram: Whatsapp Share
20 pages
Machine Learning for Data Scientists
No ratings yet
Machine Learning for Data Scientists
13 pages
Instance-Based Learning Guide
No ratings yet
Instance-Based Learning Guide
19 pages
Polynomial Curve Fitting Lecture
No ratings yet
Polynomial Curve Fitting Lecture
43 pages
Introduction: Geometric Models: - Page 1 of 25
No ratings yet
Introduction: Geometric Models: - Page 1 of 25
25 pages
PRML RefSheet
No ratings yet
PRML RefSheet
6 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
1 ModuleEcontent - Session5
No ratings yet
1 ModuleEcontent - Session5
24 pages
21Csc305P-Machine Learning: Offline
No ratings yet
21Csc305P-Machine Learning: Offline
8 pages
Maths For ML
No ratings yet
Maths For ML
156 pages
Matematics and Machine Learning
No ratings yet
Matematics and Machine Learning
156 pages
Regression Analysis
No ratings yet
Regression Analysis
11 pages
Machinelearning
No ratings yet
Machinelearning
59 pages
Neural Networks Study Notes
100% (2)
Neural Networks Study Notes
11 pages
Lecture 1 2022
No ratings yet
Lecture 1 2022
55 pages
Supervised Learning: Instance Based Learning
No ratings yet
Supervised Learning: Instance Based Learning
16 pages
ML Unit 2 Possible Questions and Answers
No ratings yet
ML Unit 2 Possible Questions and Answers
48 pages
Lec08 Classification KNN ANN
No ratings yet
Lec08 Classification KNN ANN
39 pages
Vahid
No ratings yet
Vahid
18 pages
Machine Learning
No ratings yet
Machine Learning
45 pages
2 LinearRegression2
No ratings yet
2 LinearRegression2
45 pages
Curs 1 SSL - Introduction
No ratings yet
Curs 1 SSL - Introduction
57 pages
RADL TQKhoat
No ratings yet
RADL TQKhoat
50 pages
cs4302 Lecture2
No ratings yet
cs4302 Lecture2
40 pages
ML Unit V
No ratings yet
ML Unit V
10 pages
Lecture 2 2022
No ratings yet
Lecture 2 2022
34 pages
Nearest Neighbour
No ratings yet
Nearest Neighbour
25 pages
Ai&ml Module 5 Final
No ratings yet
Ai&ml Module 5 Final
14 pages
Lesson 4 - Supervised Learning
No ratings yet
Lesson 4 - Supervised Learning
36 pages
Machine Learning and Pattern Recognition Week 3 Intro - Classification
No ratings yet
Machine Learning and Pattern Recognition Week 3 Intro - Classification
5 pages
QSRI Lecture1
No ratings yet
QSRI Lecture1
45 pages
MLT Unit 3 Part 2
No ratings yet
MLT Unit 3 Part 2
57 pages
Classification and Regression
No ratings yet
Classification and Regression
34 pages
PRML Exercise Solutions Guide
No ratings yet
PRML Exercise Solutions Guide
87 pages
Statistical Learning
No ratings yet
Statistical Learning
31 pages
Chap1 Bishop
No ratings yet
Chap1 Bishop
35 pages
Pattern Recognition Machine Learning: Chapter 1: Introduction
No ratings yet
Pattern Recognition Machine Learning: Chapter 1: Introduction
59 pages
Pattern Recognition and Machine Learning
100% (2)
Pattern Recognition and Machine Learning
59 pages
18CS71 AI & ML Module 5 Notes
No ratings yet
18CS71 AI & ML Module 5 Notes
21 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Curve Fitting
No ratings yet
Curve Fitting
17 pages
Unit 1,2,3
No ratings yet
Unit 1,2,3
17 pages
Machine Learning for Engineering Students
No ratings yet
Machine Learning for Engineering Students
31 pages
Statistical Machine Learning: Yiqiao YIN Department of Statistics Columbia University
No ratings yet
Statistical Machine Learning: Yiqiao YIN Department of Statistics Columbia University
204 pages
Linear Regression With Multiple Variable
No ratings yet
Linear Regression With Multiple Variable
30 pages
07 Moving Beyond Linearity I 169
No ratings yet
07 Moving Beyond Linearity I 169
40 pages
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
No ratings yet
Anuranan Das Summer of Sciences, 2019. Understanding and Implementing Machine Learning
17 pages
Chapter 6 ML Classifications
100% (1)
Chapter 6 ML Classifications
51 pages
Practical 5 Peering
No ratings yet
Practical 5 Peering
19 pages
LMS Algorithm
No ratings yet
LMS Algorithm
11 pages
DNN Question Format
No ratings yet
DNN Question Format
2 pages
Membership Function
No ratings yet
Membership Function
29 pages
Game Theory (End Sem) - Solution
No ratings yet
Game Theory (End Sem) - Solution
10 pages
Mean Mode Median Theory
No ratings yet
Mean Mode Median Theory
30 pages
Java Day 9
No ratings yet
Java Day 9
32 pages
Java Day 8
No ratings yet
Java Day 8
15 pages
Adaptive Filtering With Neural Networks
No ratings yet
Adaptive Filtering With Neural Networks
9 pages
Questions With Solutions (Module-I)
No ratings yet
Questions With Solutions (Module-I)
10 pages
Yield Farming
No ratings yet
Yield Farming
8 pages
Sepm Unit 6
No ratings yet
Sepm Unit 6
9 pages
K-Nearest Neighbors (K-NN) Algorithm
No ratings yet
K-Nearest Neighbors (K-NN) Algorithm
10 pages
TechWorld With Nana DevOps Bootcamp Syllabus
No ratings yet
TechWorld With Nana DevOps Bootcamp Syllabus
37 pages
What Is Konjac / Glucomannan? Can He Help Lose Weight? View
No ratings yet
What Is Konjac / Glucomannan? Can He Help Lose Weight? View
12 pages
Bagong Barrio National High School
No ratings yet
Bagong Barrio National High School
14 pages
Charlie and The Chocolate Factory
0% (1)
Charlie and The Chocolate Factory
14 pages
OWASP Top 10 Web Vulnerabilities
No ratings yet
OWASP Top 10 Web Vulnerabilities
37 pages
To Study The Fraud Prevention and Control in Banking System: A Project Submitted To
No ratings yet
To Study The Fraud Prevention and Control in Banking System: A Project Submitted To
87 pages
Toolkit - Delphi
No ratings yet
Toolkit - Delphi
14 pages
MidTerm Exam Advanced Business English
100% (1)
MidTerm Exam Advanced Business English
7 pages
CSS Properties CSS Property Groups: Fonts
No ratings yet
CSS Properties CSS Property Groups: Fonts
8 pages
Franz Kieser - Cabala Chymica
No ratings yet
Franz Kieser - Cabala Chymica
36 pages
Xerostomia: Dental Student Guide
No ratings yet
Xerostomia: Dental Student Guide
16 pages
Emca Labels 2024-04-23
No ratings yet
Emca Labels 2024-04-23
4 pages
Baldassare Di Bartolo - Classical Theory of Electromagnetism-WSPC (2018)
100% (2)
Baldassare Di Bartolo - Classical Theory of Electromagnetism-WSPC (2018)
717 pages
C Transmission Unit Parts List
No ratings yet
C Transmission Unit Parts List
16 pages
MMW Practice Exam Answer Key
No ratings yet
MMW Practice Exam Answer Key
6 pages
Piple Handi Khola Bridge-Hydrology
No ratings yet
Piple Handi Khola Bridge-Hydrology
14 pages
Class XI - Maths Assignment - Basic Maths
No ratings yet
Class XI - Maths Assignment - Basic Maths
4 pages
Fake Dating Adrian Hunter - Skyla Summers
100% (1)
Fake Dating Adrian Hunter - Skyla Summers
289 pages
The Figures in The Margin Indicate Full Marks. Candidates Are Required To Write Their Answers in Their Own Words As Far As Practicable
No ratings yet
The Figures in The Margin Indicate Full Marks. Candidates Are Required To Write Their Answers in Their Own Words As Far As Practicable
3 pages
WBS Chapt 8
No ratings yet
WBS Chapt 8
21 pages
Data + AI Summit 2024 - Keynote Day 2
No ratings yet
Data + AI Summit 2024 - Keynote Day 2
32 pages
5 +Hilya+Ayu+Adene+Taqya
No ratings yet
5 +Hilya+Ayu+Adene+Taqya
10 pages
Aerodynamic CFD Analysis On High Lift Mu
No ratings yet
Aerodynamic CFD Analysis On High Lift Mu
12 pages
Lesson 2 Errors
No ratings yet
Lesson 2 Errors
19 pages
1 19 Loadrunner (Controller Module) Interview Questions 43 Q. 1: What Is The Purpose of Using HP - Loadrunner?
No ratings yet
1 19 Loadrunner (Controller Module) Interview Questions 43 Q. 1: What Is The Purpose of Using HP - Loadrunner?
17 pages
Understanding Emotions and Moods in OB
No ratings yet
Understanding Emotions and Moods in OB
14 pages
Student Pregnancy and Maternity Implications For Heis
No ratings yet
Student Pregnancy and Maternity Implications For Heis
42 pages
Assignment No. 1: GENERAL INSTRUCTION: Use Microsoft Word, Legal, 12 Arial. COPY THE FORMAT
No ratings yet
Assignment No. 1: GENERAL INSTRUCTION: Use Microsoft Word, Legal, 12 Arial. COPY THE FORMAT
2 pages
CCNA 4 Exam Answers & Networking Tips
No ratings yet
CCNA 4 Exam Answers & Networking Tips
28 pages
Iupac and Goc - With Key
No ratings yet
Iupac and Goc - With Key
48 pages
Aerodynamics: Aerodynamics Is A Branch of Dynamics
No ratings yet
Aerodynamics: Aerodynamics Is A Branch of Dynamics
13 pages

Curve Fitting

Uploaded by

Curve Fitting

Uploaded by

Polynomial Curve Fitting

• We have a training set comprising N observations of x, written x ≡ (x1, . . . , xN)T,

• Our goal is to predict the value of t for some new value of x,

• The green curve shows the actual

• Our goal is to predict the value of t for

• We try to fit the data using a polynomial function of the form

• This can be done by minimizing an

• Error Function: the sum of the squares

• The resulting polynomial is given by the function y(x,w*).

• Choosing the order M of the polynomial  model selection.

• We obtain an excellent fit to the training data with 9th order.

• This behaviour is known as over-fitting

• 9th Order Polynomial.

• Regularization: Penalize large coefficient values

no regularization (λ=0) too much regularization

Graph of the root-mean-square error

where 𝜙j(x) are known as basis functions.

where σ(a) is the logistic sigmoid function

• Instance-based learning includes nearest neighbor, locally weighted regression and

• Instance-based methods are sometimes referred to as lazy learning methods because

– If the target function is discrete-valued, take a vote among f-values of k nearest

A B Class 3-Nearest Neighbor Classification of instance <3,3>

• First three example are 3 Nearest Neighbors of instance <3,3>.

A B Class Distance Weighted 3-Nearest Neighbor Classification

• First three example are 3 Nearest Neighbors of instance <3,3>.

• Eager: generalize before seeing query

You might also like