0% found this document useful (0 votes)

4 views9 pages

Lec3 Backpropagation

The document discusses the construction and training of Multilayer Perceptrons (MLPs) using backpropagation, focusing on determining weights and biases to minimize expected error. It explains the process of estimating the true function through sampling and empirical loss, and introduces various loss functions for different tasks such as regression, binary classification, and multiclass classification. The learning algorithm is highlighted as a search in the hypothesis space, with a reference to further optimization techniques in neural networks.

Uploaded by

tahaebrahimie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views9 pages

Lec3 Backpropagation

Uploaded by

tahaebrahimie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Backpropagation

Fatemeh Seyyedsalehi

Sharif University of Technology

Spring 2025

Computer science - Sharif univ. Backpropagation 1/9

Constructing the MLP

MLPs are capable to represent any function

But how do we construct it?
▶ I.e., how do we determine the weights (and biases) of the network to
best represent a target function?
▶ Assuming that the architecture of the network is given
By minimizing expected error
Z
W = argmin e(f (X ; W ), t(X ))p(X ) dX
W X

= argmin E [e(f (X ; W ), t(X ))]

W
Computer science - Sharif univ. Backpropagation 2/9
Estimating the True Function

The true function t(x) is unknown, so sample it

▶ Basically, get input-output pairs for a number of samples of input
▶ i.e., preparing the training dataset
Estimate the function from the samples
The empirical estimate of the expected error is the average error over
the samples:
T
1 X
E [div(f (X ; W ), t(X ))] ≈ e(f (Xi ; W ), yi )
T
i=1

We can hope that minimizing the empirical loss will minimize the true
loss

Computer science - Sharif univ. Backpropagation 3/9

Training Models and Loss Functions

We seek parameters that produce the best possible mapping from

input to output for the task at hand.
A loss function or cost function returns a single number describing
the mismatch between:
▶ Model predictions f (X ; W )
▶ Ground-truth outputs yi
We shifted perspective to think of neural networks as computing of
probability distributions pr (y |W ) over the output space.
▶ This led to a principled approach for building loss functions.
▶ Maximizing the likelihood of the observed data under these
distributions.

Computer science - Sharif univ. Backpropagation 4/9

Example 1: Univariate Regression

The loss function is given by:

N
X
L[W ] = − log [p(yi |[xi , W ])]
i=1

Considering the conditional probability as a normal distribution, we

have:
" N #
(yi − f(xi ; W ))2

X 1
arg min − log √ exp −
W 2πσ 2 2σ 2
i=1
hP i
N
= arg min i=1 (yi − f (xi ; W ))2
W
Least squares!

Computer science - Sharif univ. Backpropagation 5/9

Example 2: Binary Classification

Bernoulli distribution is a suitable probability that can be defined over

the domain of such predictions

p(y |λ) = (1 − λ)1−y · λy

The neural network can be trained to predict the parameter λ.

N
X
L[W ] = ((1 − yi ) log[1 − f (xi ; W )] − yi log[f (xi ; W )])
i=1

Binary cross-entropy loss!

Computer science - Sharif univ. Backpropagation 6/9

Example 3: Multiclass Classification

Categorical distribution is a suitable one for this domain:

y ∈ {1, 2, ...k}
The neural network should predict k parameters λk ∈ [0, 1], summed
to 1.
Usually we use the Softmax function in this situation:
e zi
Softmax(zi ) = Pn zj
j=1 e

where zj s are outputs of the network.

multi-class cross-entropy loss!

Computer science - Sharif univ. Backpropagation 7/9

Our main problem

Computer science - Sharif univ. Backpropagation 8/9

The learning algorithm

Searching in the hypothesis space

Next: a course on optimization and how to do it neural networks.
Following slides are selected from Deep Learning course CMU 11-785.

Computer science - Sharif univ. Backpropagation 9/9

L05 Slides - mlp2
No ratings yet
L05 Slides - mlp2
21 pages
Python Unit 5
No ratings yet
Python Unit 5
36 pages
03 - Supervised Learning (BPNN)
No ratings yet
03 - Supervised Learning (BPNN)
14 pages
MLP Multilayer
No ratings yet
MLP Multilayer
29 pages
Clase 4 Backpropagation
No ratings yet
Clase 4 Backpropagation
63 pages
L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
No ratings yet
Shortcomings in Single Layer Neural Networks: Most Real World Problems Are Not
43 pages
L3 Cse256 Fa24 FFN
No ratings yet
L3 Cse256 Fa24 FFN
64 pages
Hernandez Lobatoc15
No ratings yet
Hernandez Lobatoc15
9 pages
Unit - 2
No ratings yet
Unit - 2
96 pages
Chapter 10: Artificial Neural Networks
No ratings yet
Chapter 10: Artificial Neural Networks
17 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
ML Session 15 Backpropagation
No ratings yet
ML Session 15 Backpropagation
30 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
80 pages
Neural Networks: Multilayer Perceptrons
No ratings yet
Neural Networks: Multilayer Perceptrons
63 pages
ANN Notes Updated
0% (1)
ANN Notes Updated
46 pages
Intro to Machine Learning Basics
No ratings yet
Intro to Machine Learning Basics
61 pages
Module 2
No ratings yet
Module 2
55 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Back Propagation
No ratings yet
Back Propagation
5 pages
SJNanda Neural Network
No ratings yet
SJNanda Neural Network
47 pages
Soft Computing 2
No ratings yet
Soft Computing 2
33 pages
CI-6-8 Backpropagation (COMPLETE) Updated
No ratings yet
CI-6-8 Backpropagation (COMPLETE) Updated
76 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
38 pages
PDL Challenge 2
No ratings yet
PDL Challenge 2
9 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Part 2
No ratings yet
Part 2
33 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
32 pages
DL145611 03 Shallow
No ratings yet
DL145611 03 Shallow
92 pages
Neural-Networks Back Propagation
No ratings yet
Neural-Networks Back Propagation
70 pages
Supervised Learning: Multilayer Networks I
No ratings yet
Supervised Learning: Multilayer Networks I
40 pages
ML Unit 5
No ratings yet
ML Unit 5
34 pages
Lec 6
No ratings yet
Lec 6
18 pages
SJNanda Neural Network
No ratings yet
SJNanda Neural Network
43 pages
SJNanda - Neural Network
No ratings yet
SJNanda - Neural Network
43 pages
SML Lecture1
No ratings yet
SML Lecture1
37 pages
Part (A) - Differences Between Scalars, Vectors, Ma
No ratings yet
Part (A) - Differences Between Scalars, Vectors, Ma
11 pages
Part 1
No ratings yet
Part 1
48 pages
Deep Learning
No ratings yet
Deep Learning
299 pages
Artificial Neural Networks - Truc
No ratings yet
Artificial Neural Networks - Truc
50 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
79 pages
Lecture 1
No ratings yet
Lecture 1
24 pages
MLP Lecture 4
No ratings yet
MLP Lecture 4
35 pages
4 Perceptron 06 08 2025
No ratings yet
4 Perceptron 06 08 2025
32 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Chapter 9. Classification: Advanced Methods
No ratings yet
Chapter 9. Classification: Advanced Methods
39 pages
Understanding Backpropagation and Its Role in Deep LearningPARTH LAMBAT AND - 20250415 - 122012 - 0000
No ratings yet
Understanding Backpropagation and Its Role in Deep LearningPARTH LAMBAT AND - 20250415 - 122012 - 0000
18 pages
Sns College of Technology: Department of Civil Engineering
No ratings yet
Sns College of Technology: Department of Civil Engineering
15 pages
Deep Learning Summer School 2015: Introduction To Machine Learning
No ratings yet
Deep Learning Summer School 2015: Introduction To Machine Learning
46 pages
NN 05
No ratings yet
NN 05
28 pages
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
No ratings yet
Artificial Neural Networks An Artificial Neuron: X W X W S X W W y
7 pages
14 Backprop
No ratings yet
14 Backprop
34 pages
Lecture 9
No ratings yet
Lecture 9
78 pages
Neural Networks Essay Feranmi Dere
No ratings yet
Neural Networks Essay Feranmi Dere
7 pages
Guide To Cybersecurity in Digital Transformation Trends, Methods, Technologies, Applications and Best Practices (Dietmar P.F. Möller) (Z-Library)
100% (3)
Guide To Cybersecurity in Digital Transformation Trends, Methods, Technologies, Applications and Best Practices (Dietmar P.F. Möller) (Z-Library)
432 pages
Advances in Subsurface Data Analytics: Traditional and Physics-Based Machine Learning Shuvajit Bhattacharya
100% (3)
Advances in Subsurface Data Analytics: Traditional and Physics-Based Machine Learning Shuvajit Bhattacharya
61 pages
Azure AI Exam Prep
100% (1)
Azure AI Exam Prep
35 pages
The Role of Artificial Intelligence in Enhancing Supply Chain Performance Within Nigeria's Oil and Gas Sector
No ratings yet
The Role of Artificial Intelligence in Enhancing Supply Chain Performance Within Nigeria's Oil and Gas Sector
17 pages
Yolo Thesis 04
No ratings yet
Yolo Thesis 04
128 pages
Understaing Support Vector Machine Example Code
No ratings yet
Understaing Support Vector Machine Example Code
11 pages
Sýkorová Et Al., 2024
No ratings yet
Sýkorová Et Al., 2024
16 pages
Telecom Customer Churn
No ratings yet
Telecom Customer Churn
5 pages
Presentation
No ratings yet
Presentation
16 pages
The Levers Framework
No ratings yet
The Levers Framework
58 pages
Module 1 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
100% (1)
Module 1 - S8 CSE NOTES - KTU DEEP LEARNING NOTES - CST414
18 pages
Industrial Training Report: Course "Artificial Intelligence"
No ratings yet
Industrial Training Report: Course "Artificial Intelligence"
30 pages
412 Ba
No ratings yet
412 Ba
4 pages
XEmoAccent Embracing Diversity in Cross-Accent Emotion Recognition Using Deep Learning
No ratings yet
XEmoAccent Embracing Diversity in Cross-Accent Emotion Recognition Using Deep Learning
18 pages
B.Com AI & ML Internship Report
No ratings yet
B.Com AI & ML Internship Report
14 pages
Icaiccit 719
No ratings yet
Icaiccit 719
7 pages
Digital Transformation in Cosmetics
No ratings yet
Digital Transformation in Cosmetics
5 pages
Mldap
No ratings yet
Mldap
6 pages
Module 2-Data Science
No ratings yet
Module 2-Data Science
3 pages
Rakesh M. Verma - David J. Marchette - Cybersecurity Analytics-CRC Press (2020)
No ratings yet
Rakesh M. Verma - David J. Marchette - Cybersecurity Analytics-CRC Press (2020)
357 pages
Statistical Machine Learning-The Basic Approach and Current Research Challenges
No ratings yet
Statistical Machine Learning-The Basic Approach and Current Research Challenges
35 pages
Laptop Price Prediction
No ratings yet
Laptop Price Prediction
8 pages
A Survey On Deep Transfer Learning
No ratings yet
A Survey On Deep Transfer Learning
10 pages
Recopilación de Tesis para Ciencia de Datos
No ratings yet
Recopilación de Tesis para Ciencia de Datos
41 pages
Sai Kumar
No ratings yet
Sai Kumar
2 pages
Marketing Artificial Intelligence
No ratings yet
Marketing Artificial Intelligence
20 pages
Asu Sop-2
No ratings yet
Asu Sop-2
3 pages
IIT Roorkee - Agentic AI Brochure
No ratings yet
IIT Roorkee - Agentic AI Brochure
25 pages
BTech DSE Curriculum 2022 Updated
No ratings yet
BTech DSE Curriculum 2022 Updated
42 pages
Hopfield Networks Is All You Need
No ratings yet
Hopfield Networks Is All You Need
94 pages