0% found this document useful (0 votes)

68 views12 pages

ADALINE Network & LMS Algorithm

The document describes the Adaline network, an adaptive linear network proposed in the 1960s. It is similar to the perceptron network but uses a purely linear transfer function, whereas the perceptron uses a hard limiting transfer function. The Adaline network can be trained using the Widrow-Hoff or LMS algorithm to minimize the mean squared error between the network's output and the target values. It has various applications in areas like adaptive filtering, system identification, inverse system modeling, and prediction.

Uploaded by

Neha Rajput

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views12 pages

ADALINE Network & LMS Algorithm

Uploaded by

Neha Rajput

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

ADALINE NETWORK

 Proposed by Widrow & Hoff in 1960’s.

 Stands for Adaptive Linear Network.

 Architecturally it is similar to perceptron network except for transfer

function.
Adaline uses pure purely linear transfer function while perceptron uses hard
limiting transfer function.

 Has large number of application in signal processing.

W11
output
[ ] W12
∑

W1r

BIAS

Direct Weight
Adjustment or
Iterative Training
Algorithm

TARGET

Adaline network along with arrangement for training

DECISION BOUNDARY OF ADALINE N/W

Consider 2 input, 1 output adaline network

W11
a1*1
* + ∑
W12
b
1

n= w11p1 + w12p2 + b

a= purelin(n) = n

a= w11p1 + w12p2 + b

Limiting case n=0

w12p2 =- w11p1 - b

p2= -w11p1/w12 –b/w12

This line is called the decision boundary.

a= 0 along decision boundary

How to decide on which side o/p is greater than zero?

Direction of weight vector is the direction in which output will be positive.

Thus adaline has some limitation as the perceptron can classify only linearly
separable patterns. However due to linear TF it can be put to other uses.

TRAINING ADALINE USING LMS ALGORITHM.

N/W can be considered to be trained if it produces o/p with acceptable error

for i/ps.

Let X= * + = [ ]

Z= * +=[ ]
n = wTp + b
a = purelin(n)=n

In matrix notation,

a= [ ] [ ] = XTZ

e = t-a =t-XTZ

Since error may be positive or negative, we take square of the error.

e2 = (t- XTZ)2

Mean of square of errors

E[e2] = E[(t-a)2] = E[(t- XTZ)2]

where E[e2] = statistical expectation operator.

-----------------------
Expectation of discrete variable x E(X) = ∑ xip(xi)
xi is the ith discrete value of variable x.
p(xi) is probability of occurrence of xi.

Hence, E[e2] = e12p(e12) + e22p(e22) + ………

Assuming that all values of e2 have equal probability of occurrence p(e2)

=1/n.

E[X2] = e12/n+ e22/n+……. en2/n

Thus E[X2] is mean of square of errors or mean squared error.

------------------------
Let E[e2] = F(X) performance function(reflects how well the
network is performing)

F(X) = E[(t- XTZ)2]

= E[(t2-2tXTZ+XTZZTX)]
= E(t2) – 2XTE(tZ) +XTE(ZZT)X
= C – 2XTh + XTRX

Where C = E(t2), h = E(tZ), R = E(ZZT)

R is input correlation matrix ( measure of similarity of a signal with a
delayed version of same signal)

h is cross correlation vector (measure of similarity between a signal and

delayed version of another signal)

In order to bring F (X) to standard quadratic form

F(X) = 1/2 XT2RX -2hTX+C
=1/2 XTAX + dTX+C
Where A=2R d=-2h

Stationary point (point at which gradient is zero) can be found by setting

gradient of F(X) = 0
F(X) = 0
But gradient of quadrature function is given by Ax+d.
So, 2RX -2h = 0
2RX = 2h
2R-1RX = 2R-1h

X=R-1h

where R = E(ZZT) and h= E(t2)

Thus if we could calculate statistical properties like R & h the value of

vector X i.e. weights & biases can be computed directly i.e without any
iterations. In general it is not convenient to calculate h & R .We can avoid
calculating inverse of R using steepest descent algorithm.

WINDOW HOFF ALGORIYHM FOR TRAINING ADALINE.

It is an approximate steepest descend algorithm in which performance index

is mean square error. Performance function to be minimized is taken as e2(k)
rather than E(e2). Error is minimized after each individual pattern is applied.
(k+1)th value of vector X (weight vector) is found from kth value such that
F(x(k+1)) < F(x(k)). i.e. we are going down hill on the surface formed by
performance function.

xk+1 = xk – αkgk (method of steepest descent) gk is gradient at kth

iteration,

From 2 input network,

2
gk = F(x) = (k) = =

[ ] [ ]
e(k) =t(u) – a(k)
e(k) = t(k) –wTp(k)+b
e(k) = t(k) – ( w1ipi(k)+ b)

= 0- p1(k)-0……+0= -p1(k)

In general, = -pj(k) & = -1

gk = -2e(k)[ ]

xk+1 = xk -α gk

[ ] = [ ] +2 αe(k) [ ]

OR
W(k+1) =w(k)+2αe(k)p(k)
B(k+1) = b(k)+2αe(k)

These two equations are LMS algorithm or Widrow Hoff learning algorithm
or delta rule

Widrow Hoff Algorithm

 Performance function to be minimized is e2(k)

 Minimization method used is method of steepest descent.

xk+1= xk-αkgk

 Window Hoff algorithm

W(k+1) =w(k)+2αe(k)pT(k)
B(k+1) = b(k)+2αe(k)
Steps:
1. Start with small random weights & biases.
2. Apply the 1st i/p vector & propagate it forward to find output.
3. Compute the error.
4. Modify weights & biases using formulae.

W(k+1) =w(k)+2αe(k)pT(k)

B(k+1) = b(k)+2αe(k)

5. Stop when e(k) drops to an acceptably low value.

In the classical LMS method, we first apply all the available input patterns &
find the individual errors. We then try to minimize the mean of squared
error. In Windrow Hoff method, we proceed in iterative fashion as each
input pattern is applied, thus avoiding finding inverse of matrix that requires
statistical properties of input vectors to be known. This solves large amount
of labor in practical sized problems.

PROBLEM:

I/O pairs are

p1 =* + t1 =1 ; p2 = * + t2= -1

Train the network using LMS algorithm with the initial guess set to zero &
learning rate α = 0.25. Neglect bias.

a(k) = purelin(wTpk)

w(k+1) = w(k) +2 αe(k)p(k)

p1 is applied

a(0)= purelin([ ] * +) =0 ; t(0) = 1

e(0) =t(0) –a(0) =1

w(1)= wT(0) +20.251*pT(0)

= [ ] + 2*0.25* [ ] =[ ]

p2 is applied.

a(1)= purelin([ ]* +)=0

e(1) = t(1) –a(1)= -1-0 = -1

w(2) = wT(1) +20.25-1*pT(1)

=[ ] [ ]

=[ ]

Adaline is more widely used than perceptron.

Major area of application of adaline is in adaptive filtering.

Adaptive filter is able to separate undesirable components from signals even if

undesirable components fall in the frequency band as the useful signal.

The adaptive filtering has the foll applications.

 Noise cancellation
 System identification
 Inverse sysyem modeling
 Prediction

s(t) +f1(n(t)) Restored signal s(t)

Useful signal s(t)

f1(n(t)) f2(n(t))

Noise Adaline Training

path
filter algorithm
error
Noise source n(t)
System Identification

input output desired signal

system to
be
identified
target

LMS
Algorithm
error

output of
adaptive filter
Adaptive filter
adaline
Inverse System Modeling

output of filter
input System whose Adaptive filter
inverse model (adaline) for
is to be found inverse system
modeling

Training
algorithm

Delay
Prediction

Prediction is required in many situations.

signal
Actual value of current
sample (target)
LMS
D algorithm

error
Past D
samples Adaptive filter
of signal (adaline) for Predicted value
D prediction of current
sample

ML ch3
No ratings yet
ML ch3
26 pages
Adaptive Filters
No ratings yet
Adaptive Filters
14 pages
EELU ANN ITF309 Lecture 04+05 Spring 2023-2024
No ratings yet
EELU ANN ITF309 Lecture 04+05 Spring 2023-2024
50 pages
Widrow-Hoff Learning: (LMS Algorithm)
No ratings yet
Widrow-Hoff Learning: (LMS Algorithm)
26 pages
ADALINE Network and LMS Algorithm
No ratings yet
ADALINE Network and LMS Algorithm
26 pages
Adaline and K
0% (1)
Adaline and K
29 pages
Error
No ratings yet
Error
24 pages
Activation Functions
No ratings yet
Activation Functions
32 pages
Ch10 Pres
No ratings yet
Ch10 Pres
26 pages
Presentation On: Neural Network
No ratings yet
Presentation On: Neural Network
30 pages
ADALINE Network: AA AA AA A A A
No ratings yet
ADALINE Network: AA AA AA A A A
25 pages
Answer Key
No ratings yet
Answer Key
12 pages
CH 2
No ratings yet
CH 2
49 pages
Chapter3 Neural Networks Problems SCI
No ratings yet
Chapter3 Neural Networks Problems SCI
6 pages
NN Lec - 03
No ratings yet
NN Lec - 03
56 pages
Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)
No ratings yet
Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)
5 pages
3-ADALINE (Adaptive Linear Neuron) (Widrow & Hoff, 1960) : W X T E
No ratings yet
3-ADALINE (Adaptive Linear Neuron) (Widrow & Hoff, 1960) : W X T E
8 pages
Kernel Adaptive Filtering PDF
No ratings yet
Kernel Adaptive Filtering PDF
124 pages
Neural Network Models & MATLAB
No ratings yet
Neural Network Models & MATLAB
7 pages
3 E63fffeef6 Artificial Intelligence - Unit - 2
No ratings yet
3 E63fffeef6 Artificial Intelligence - Unit - 2
84 pages
FIR AND IIR Report Using Matlap
No ratings yet
FIR AND IIR Report Using Matlap
16 pages
Perceptron and Optimization Techniques
No ratings yet
Perceptron and Optimization Techniques
40 pages
Fuzzy Modeling
No ratings yet
Fuzzy Modeling
65 pages
ANN - Ch2-Adaline and Madaline
100% (1)
ANN - Ch2-Adaline and Madaline
29 pages
Ci - Adaline & Madaline Network
No ratings yet
Ci - Adaline & Madaline Network
35 pages
Exam2-Problem 1 Part (A)
100% (1)
Exam2-Problem 1 Part (A)
15 pages
Chapter 2 Optimization
No ratings yet
Chapter 2 Optimization
47 pages
Discrete Time Systems - Properties
No ratings yet
Discrete Time Systems - Properties
55 pages
Neural Link Assignment
No ratings yet
Neural Link Assignment
14 pages
PDF 4
No ratings yet
PDF 4
11 pages
System Id
No ratings yet
System Id
3 pages
Neural Network - Optimization DRAFT 3.11
No ratings yet
Neural Network - Optimization DRAFT 3.11
66 pages
Least Mean Square (LMS) Algorithm: 3.1 Spatial Filtering
No ratings yet
Least Mean Square (LMS) Algorithm: 3.1 Spatial Filtering
16 pages
ADALINE Linear Approximation Guide
No ratings yet
ADALINE Linear Approximation Guide
20 pages
Soft Computing Test Analysis
No ratings yet
Soft Computing Test Analysis
10 pages
ANN - Ch2-Adaline and Madaline
No ratings yet
ANN - Ch2-Adaline and Madaline
27 pages
Chapter 2 Adaline
No ratings yet
Chapter 2 Adaline
71 pages
437 Solutions Modified
No ratings yet
437 Solutions Modified
18 pages
SI2018
No ratings yet
SI2018
32 pages
2-LTI Discrete Time Systems
No ratings yet
2-LTI Discrete Time Systems
22 pages
Answers 2024
No ratings yet
Answers 2024
11 pages
Soft
No ratings yet
Soft
4 pages
Key Concepts in Neural Networks
No ratings yet
Key Concepts in Neural Networks
11 pages
Ar$ficial Neural Network - : Adaline and Madaline
No ratings yet
Ar$ficial Neural Network - : Adaline and Madaline
22 pages
Adaline and Medaline
50% (2)
Adaline and Medaline
14 pages
Adaptive Filtering For Biomedical Applications
No ratings yet
Adaptive Filtering For Biomedical Applications
27 pages
Single Perceptor
No ratings yet
Single Perceptor
37 pages
ML Notes
No ratings yet
ML Notes
14 pages
Assignment 3
No ratings yet
Assignment 3
14 pages
NNunit 2
No ratings yet
NNunit 2
25 pages
Convergence and Stability Analysis of Spline Adaptive Filtering Based On Adaptive Averaging Step-Size Normalized Least Mean Square Algorithm
No ratings yet
Convergence and Stability Analysis of Spline Adaptive Filtering Based On Adaptive Averaging Step-Size Normalized Least Mean Square Algorithm
11 pages
Notes Machine Learning
No ratings yet
Notes Machine Learning
34 pages
ANN - Ch2-Adaline and Madaline
No ratings yet
ANN - Ch2-Adaline and Madaline
27 pages
AbhishekYadav Assignment 02
No ratings yet
AbhishekYadav Assignment 02
24 pages
Backpropagation Math
No ratings yet
Backpropagation Math
11 pages
Eem520l3 2023
No ratings yet
Eem520l3 2023
25 pages
Neural Network Training Exercises
No ratings yet
Neural Network Training Exercises
11 pages
Networks With Threshold Activation Functions: Navigation
No ratings yet
Networks With Threshold Activation Functions: Navigation
6 pages
A Parameter Recognition Based Impedance Tuning Method For SS-Compensated Wireless Power Transfer Systems
No ratings yet
A Parameter Recognition Based Impedance Tuning Method For SS-Compensated Wireless Power Transfer Systems
16 pages
Switching Angle Characteristics
No ratings yet
Switching Angle Characteristics
13 pages
Va VB VC TL Ia Ib Ic Te WR: Induction Motor Model
No ratings yet
Va VB VC TL Ia Ib Ic Te WR: Induction Motor Model
14 pages
Serial Peripheral Interface in DSP: by Nayana Soni & Neha Rajput
No ratings yet
Serial Peripheral Interface in DSP: by Nayana Soni & Neha Rajput
19 pages
Serial Peripheral Interface in DSP: by Nayana Soni & Neha Rajput
No ratings yet
Serial Peripheral Interface in DSP: by Nayana Soni & Neha Rajput
19 pages
Assignment - 03: Write A Program To Glow 8 Leds One by One of One Second in Clockwise and Then in Anticlockwise Direction
No ratings yet
Assignment - 03: Write A Program To Glow 8 Leds One by One of One Second in Clockwise and Then in Anticlockwise Direction
2 pages
UNSymmetrical Faults
No ratings yet
UNSymmetrical Faults
24 pages
Assignment 04
No ratings yet
Assignment 04
3 pages
Symmetrical Faults
No ratings yet
Symmetrical Faults
11 pages
MBA 19 PAT 302 DS Unit 1.2.3
No ratings yet
MBA 19 PAT 302 DS Unit 1.2.3
23 pages
ch3 PDF
No ratings yet
ch3 PDF
20 pages
Shadow Prices in Optimization
No ratings yet
Shadow Prices in Optimization
21 pages
Unit Ii
No ratings yet
Unit Ii
37 pages
Automata Theory: Lecture 8, 9
No ratings yet
Automata Theory: Lecture 8, 9
20 pages
Neural Sheet 6
No ratings yet
Neural Sheet 6
3 pages
peRIODIC TEST GRADE 7
No ratings yet
peRIODIC TEST GRADE 7
5 pages
2022 Ac Exnormal
No ratings yet
2022 Ac Exnormal
7 pages
Experiment-1-B E-Example - 1 Date:13-10-2020: %discretizing The Interval I %finding The Values of F (X) at T Values
No ratings yet
Experiment-1-B E-Example - 1 Date:13-10-2020: %discretizing The Interval I %finding The Values of F (X) at T Values
6 pages
Transcript of Records
No ratings yet
Transcript of Records
2 pages
CFD Unit 1
No ratings yet
CFD Unit 1
30 pages
GA Operators - Crossover and Mutation: Introduction To Soft Computing
No ratings yet
GA Operators - Crossover and Mutation: Introduction To Soft Computing
3 pages
Learning Opportunity 1
No ratings yet
Learning Opportunity 1
4 pages
Finite-Element Analysis Guide
No ratings yet
Finite-Element Analysis Guide
12 pages
B.Sc. Math II Year: Numerical Methods
No ratings yet
B.Sc. Math II Year: Numerical Methods
98 pages
Algorithm Design & Analysis Guide
No ratings yet
Algorithm Design & Analysis Guide
7 pages
CNN and RNN Applications in AI
No ratings yet
CNN and RNN Applications in AI
41 pages
DAA - Theory Assignment 3
No ratings yet
DAA - Theory Assignment 3
2 pages
Unit3 - Numerical Integration PDF
100% (1)
Unit3 - Numerical Integration PDF
23 pages
Fuzzy Logic & Neural Networks Guide
100% (1)
Fuzzy Logic & Neural Networks Guide
125 pages
DSA Lab: Dijkstra & Knapsack
No ratings yet
DSA Lab: Dijkstra & Knapsack
10 pages
DSP Lab Manual
No ratings yet
DSP Lab Manual
41 pages
KOM6110 ANN - Machine - Learning Assignment 1 Spring 2017
No ratings yet
KOM6110 ANN - Machine - Learning Assignment 1 Spring 2017
1 page
NumericalMethods FinalExamQ&As
No ratings yet
NumericalMethods FinalExamQ&As
6 pages
Simplex Method
No ratings yet
Simplex Method
21 pages
Digital Signal Processing Course Outline
No ratings yet
Digital Signal Processing Course Outline
6 pages
Assignment 28individual 29
No ratings yet
Assignment 28individual 29
4 pages
On The Implementation of Implicit Runge-Kutta Methods: 2s3n3/3 + O (N )
No ratings yet
On The Implementation of Implicit Runge-Kutta Methods: 2s3n3/3 + O (N )
4 pages
Numerical Linear Algebra University of Edinburgh Past Paper 2019-2020
No ratings yet
Numerical Linear Algebra University of Edinburgh Past Paper 2019-2020
5 pages
Efficient Hybrid Topology Optimization Using GPU
No ratings yet
Efficient Hybrid Topology Optimization Using GPU
23 pages

ADALINE Network & LMS Algorithm

Uploaded by

ADALINE Network & LMS Algorithm

Uploaded by

ADALINE NETWORK

 Proposed by Widrow & Hoff in 1960’s.

 Stands for Adaptive Linear Network.

 Architecturally it is similar to perceptron network except for transfer

 Has large number of application in signal processing.

Adaline network along with arrangement for training

Consider 2 input, 1 output adaline network

Limiting case n=0

p2= -w11p1/w12 –b/w12

This line is called the decision boundary.

How to decide on which side o/p is greater than zero?

Direction of weight vector is the direction in which output will be positive.

TRAINING ADALINE USING LMS ALGORITHM.

N/W can be considered to be trained if it produces o/p with acceptable error

Since error may be positive or negative, we take square of the error.

Mean of square of errors

E[e2] = E[(t-a)2] = E[(t- XTZ)2]

where E[e2] = statistical expectation operator.

Hence, E[e2] = e12p(e12) + e22p(e22) + ………

Assuming that all values of e2 have equal probability of occurrence p(e2)

E[X2] = e12/n+ e22/n+……. en2/n

Thus E[X2] is mean of square of errors or mean squared error.

F(X) = E[(t- XTZ)2]

Where C = E(t2), h = E(tZ), R = E(ZZT)

h is cross correlation vector (measure of similarity between a signal and

In order to bring F (X) to standard quadratic form

Stationary point (point at which gradient is zero) can be found by setting

where R = E(ZZT) and h= E(t2)

Thus if we could calculate statistical properties like R & h the value of

WINDOW HOFF ALGORIYHM FOR TRAINING ADALINE.

It is an approximate steepest descend algorithm in which performance index

xk+1 = xk – αkgk (method of steepest descent) gk is gradient at kth

From 2 input network,

In general, = -pj(k) & = -1

Widrow Hoff Algorithm

 Performance function to be minimized is e2(k)

 Window Hoff algorithm

5. Stop when e(k) drops to an acceptably low value.

I/O pairs are

w(k+1) = w(k) +2 αe(k)p(k)

a(0)= purelin([ ] * +) =0 ; t(0) = 1

e(0) =t(0) –a(0) =1

w(1)= wT(0) +2*0.25*1*pT(0)

a(1)= purelin([ ]* +)=0

w(2) = wT(1) +2*0.25*-1*pT(1)

Adaline is more widely used than perceptron.

Major area of application of adaline is in adaptive filtering.

Adaptive filter is able to separate undesirable components from signals even if

The adaptive filtering has the foll applications.

s(t) +f1(n(t)) Restored signal s(t)

Noise Adaline Training

input output desired signal

Prediction is required in many situations.

You might also like

w(1)= wT(0) +20.251*pT(0)

w(2) = wT(1) +20.25-1*pT(1)