0% found this document useful (0 votes)

44 views37 pages

CNN PPT

The document discusses Convolutional Neural Networks (CNNs), explaining how they utilize convolutional layers with filters to detect patterns in images while reducing the number of parameters through shared weights and pooling layers. It highlights the importance of max pooling for subsampling and retaining essential features, leading to more efficient networks. The document also touches on the structure of CNNs and their implementation in frameworks like Keras.

Uploaded by

qwertyluzzluli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views37 pages

CNN PPT

Uploaded by

qwertyluzzluli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Convolutional Neural Network

Dr. Gaurav Trivedi

Indian Institute of Technology, Guwahati
Consider learning an image:
• Some patterns are much smaller than the whole image

Can represent a small region with fewer parameters

“beak” detector
Same pattern appears in different places:
They can be compressed!
What about training a lot of such “small” detectors
and each detector must “move around”.

“upper-left
beak” detector

They can be compressed

to the same parameters.

“middle beak”
detector
A convolutional layer
A CNN is a neural network with some convolutional layers
(and some other layers). A convolutional layer has a
number of filters that does convolutional operation.

Beak detector

A filter
Convolution These are the network
parameters to be learned.

1 -1 -1
1 0 0 0 0 1 -1 1 -1 Filter 1
0 1 0 0 1 0 -1 -1 1
0 0 1 1 0 0
1 0 0 0 1 0 -1 1 -1
0 1 0 0 1 0 -1 1 -1 Filter 2
0 0 1 0 1 0 -1 1 -1

…
…
6 x 6 image
Each filter detects a
small pattern (3 x 3).
1 -1 -1
-1 1 -1
Convolution
Filter 1
-1 -1 1
stride=1

1 0 0 0 0 1 Dot
product
0 1 0 0 1 0 3 -1
0 0 1 1 0 0
1 0 0 0 1 0
0 1 0 0 1 0
0 0 1 0 1 0

6 x 6 image
1 -1 -1
-1 1 -1
Convolution
Filter 1
-1 -1 1
If stride=2

1 0 0 0 0 1
0 1 0 0 1 0 3 -3
0 0 1 1 0 0
1 0 0 0 1 0
0 1 0 0 1 0
0 0 1 0 1 0

6 x 6 image
1 -1 -1
-1 1 -1
Convolution
Filter 1
-1 -1 1
stride=1

1 0 0 0 0 1
0 1 0 0 1 0 3 -1 -3 -1
0 0 1 1 0 0
1 0 0 0 1 0 -3 1 0 -3
0 1 0 0 1 0
0 0 1 0 1 0 -3 -3 0 1

6 x 6 image 3 -2 -2 -1
-1 1 -1
-1 1 -1 Filter 2
Convolution -1 1 -1
stride=1
Repeat this for each filter
1 0 0 0 0 1
0 1 0 0 1 0 3 -1 -3 -1
-1 -1 -1 -1
0 0 1 1 0 0
1 0 0 0 1 0 -3 1 0 -3
-1 -1 -2 1
0 1 0 0 1 0 Feature
0 0 1 0 1 0 -3 -3 Map
0 1
-1 -1 -2 1
6 x 6 image 3 -2 -2 -1
-1 0 -4 3
Two 4 x 4 images
Forming 2 x 4 x 4 matrix
Color image: RGB 3 channels
Filter 2
11 -1-1 -1-1 -1-1 11 -1-1
1 -1 -1 -1 1 -1
-1 1 -1 -1-1 11 -1-1
-1-1 11 -1-1 Filter 1 -1 1 -1
-1-1 -1-1 11 -1-1-1 111 -1-1-1
-1 -1 1
Color image
1 0 0 0 0 1
1 0 0 0 0 1
0 11 00 00 01 00 1
0 1 0 0 1 0
0 00 11 01 00 10 0
0 0 1 1 0 0
1 00 00 10 11 00 0
1 0 0 0 1 0
0 11 00 00 01 10 0
0 1 0 0 1 0
0 00 11 00 01 10 0
0 0 1 0 1 0
0 0 1 0 1 0
Convolution v.s. Fully Connected

1 0 0 0 0 1 1 -1 -1 -1 1 -1
0 1 0 0 1 0 -1 1 -1 -1 1 -1
0 0 1 1 0 0 -1 -1 1 -1 1 -1
1 0 0 0 1 0
0 1 0 0 1 0
0 0 1 0 1 0
convolution
image

x1
1 0 0 0 0 1
0 1 0 0 1 0 x2
Fully- 0 0 1 1 0 0
1 0 0 0 1 0
connected
…
…

…
…
0 1 0 0 1 0
0 0 1 0 1 0
x36
1 -1 -1 Filter 1 1 1
-1 1 -1 2 0
-1 -1 1 3 0
4: 0 3

…
1 0 0 0 0 1
0 1 0 0 1 0 0
0 0 1 1 0 0 8 1
1 0 0 0 1 0 9 0
0 1 0 0 1 0 10: 0

…
0 0 1 0 1 0
13 0
6 x 6 image
14 0
fewer parameters! 15 1 Only connect
to 9 inputs, not
16 1 fully connected
…
1 -1 -1 1: 1
-1 1 -1 Filter 1 2: 0
-1 -1 1 3: 0
4: 0 3

…
1 0 0 0 0 1
0 1 0 0 1 0 7: 0
0 0 1 1 0 0 8: 1
1 0 0 0 1 0 9: 0 -1
0 1 0 0 1 0 10: 0

…
0 0 1 0 1 0
1 0
6 x 6 image
3: 0
14:
Fewer parameters 15: 1
Even fewer parameters 16: 1
Shared weights
…
The whole CNN
cat dog ……
Convolution

Max Pooling
Can
Fully Connected repeat
Feedforward network
Convolution many
times

Max Pooling

Flattened
Max Pooling
1 -1 -1 -1 1 -1
-1 1 -1 Filter 1 -1 1 -1 Filter 2
-1 -1 1 -1 1 -1

3 -1 -3 -1 -1 -1 -1 -1

-3 1 0 -3 -1 -1 -2 1

-3 -3 0 1 -1 -1 -2 1

3 -2 -2 -1 -1 0 -4 3
Why Pooling
• Subsampling pixels will not change the
object
bird
bird

Subsampling

We can subsample the pixels to make image smaller

fewer parameters to characterize the image
A CNN compresses a fully
connected network in two ways:
• Reducing number of connections

• Shared weights on the edges

• Max pooling further reduces the complexity

Max Pooling
New image
1 0 0 0 0 1 but smaller
0 1 0 0 1 0 Conv
3 0
0 0 1 1 0 0 -1 1
1 0 0 0 1 0
0 1 0 0 1 0 Max 3 1
0 3
0 0 1 0 1 0 Pooling
2 x 2 image
6 x 6 image
Each filter
is a channel
The whole CNN
3 0
-1 1 Convolution

3 1
0 3
Max Pooling
Can
A new image
repeat
Convolution many
Smaller than the original
times
image
The number of channels Max Pooling

is the number of filters

The whole CNN
cat dog ……
Convolution

Max Pooling

Fully Connected A new image

Feedforward network
Convolution

Max Pooling

A new image
Flattened
Flattening 3

1
3 0
-1 1 3

3 1 -1
0 3 Flattened

1 Fully Connected
Feedforward network
0

3
Only modified the network structure and
CNN in Keras input format (vector -> 3-D tensor)

input

Convolution
1 -1 -1
-1 1 -1
-1 1 -1
-1 1 -1 … There are
-1 -1 1 25 3x3
-1 1 -1 … Max Pooling
filters.
Input_shape = ( 28 , 28 , 1)

28 x 28 pixels 1: black/white, 3: RGB Convolution

3 -1 3 Max Pooling

-3 1
Only modified the network structure and
CNN in Keras input format (vector -> 3-D array)

Input
1 x 28 x 28

Convolution
How many parameters for
each filter? 9 25 x 26 x 26

Max Pooling
25 x 13 x 13

Convolution
How many parameters 225=
for each filter? 50 x 11 x 11
25x9
Max Pooling
50 x 5 x 5
Only modified the network structure and
CNN in Keras input format (vector -> 3-D array)

Input
1 x 28 x 28

Output Convolution

25 x 26 x 26
Fully connected Max Pooling
feedforward network
25 x 13 x 13

Convolution
50 x 11 x 11

Max Pooling
1250 50 x 5 x 5
Flattened
Pooling Layers
Reduce the spatial dimensions (height
and width) of the feature maps while
retaining the most important
information.

Translation invariance, meaning the CNN

becomes less sensitive to small changes in the
input image.

Make the network more efficient by reducing

the number of parameters and computations.
Types of Pooling Layers
Max Pooling:

Takes the maximum value from each region

-1 -1.33 1.3

2.1 -1.11 -3.2

1 0.55 1.56
Types of Pooling Layers
Max Pooling:

Takes the maximum value from each region

-1 -1.33 1.3

2.1 -1.11 -3.2

1 0.55 1.56
Types of Pooling Layers
Max Pooling:

Takes the maximum value from each region

-1 -1.33 1.3

2.1 -1.11 -3.2

1 0.55 1.56

Significance:

Retains the most important features (strongest activations)

Helps detect important patterns like edges and textures

Types of Pooling Layers
Average Pooling:

Computes the average of all values in the region

-1 -1.33 1.3

2.1 -1.11 -3.2

1 0.55 1.56
Types of Pooling Layers
Average Pooling:

Computes the average of all values in the region

-1 -1.33 1.3

2.1 -1.11 -3.2 = -0.014

1 0.55 1.56

Significance:

Provides a smoother representation of the feature map

Used when preserving overall intensity is important

CNN Architectures

Lenet-5 1989- Earliest One

2012-
AlexNet
Revolutionized DNN

VGGNet 2014- (Powerful)

Resnet 2015

EfficientNet 2019
CNN Architectures
Lenet-5
CNN Architectures
Lenet-5
CNN Architectures
AlexNet
CNN Architectures
AlexNet
Challenges on Hardware
Implementation
Convolutional Operator

Max Pooling

Activation Function

Data Flow Management

Memory Requirement

FPGA Constraints
THANK YOU

Deep Learning LectureCNN
No ratings yet
Deep Learning LectureCNN
28 pages
Deep Learning CNN
No ratings yet
Deep Learning CNN
26 pages
Experiment 3
No ratings yet
Experiment 3
48 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
Lec14-15 CNN
No ratings yet
Lec14-15 CNN
40 pages
CNN Short
No ratings yet
CNN Short
61 pages
CNN
No ratings yet
CNN
37 pages
Smaller Network: CNN
No ratings yet
Smaller Network: CNN
28 pages
CNN - Convolutional Neural Network
No ratings yet
CNN - Convolutional Neural Network
33 pages
Wa0002.
No ratings yet
Wa0002.
28 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Deep Learning 2017 Lecture5CNN
No ratings yet
Deep Learning 2017 Lecture5CNN
30 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
DL Mod3
No ratings yet
DL Mod3
102 pages
Sarma CNN Vce Oct 2022
No ratings yet
Sarma CNN Vce Oct 2022
63 pages
Lesson 6 Convolutional Neural Network
No ratings yet
Lesson 6 Convolutional Neural Network
43 pages
Deep Learning CNN
100% (1)
Deep Learning CNN
28 pages
Unit - 5
No ratings yet
Unit - 5
47 pages
Ee046746 Tut 03 04 Convolutional Neural Networks
No ratings yet
Ee046746 Tut 03 04 Convolutional Neural Networks
26 pages
Unit III
No ratings yet
Unit III
8 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
Syllabus
No ratings yet
Syllabus
29 pages
DL Mod 3
No ratings yet
DL Mod 3
65 pages
Unec 1700728516
No ratings yet
Unec 1700728516
105 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Convolutional Neural Networks (CNN) : Convolutions
No ratings yet
Convolutional Neural Networks (CNN) : Convolutions
17 pages
CNN CV PT2
No ratings yet
CNN CV PT2
34 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
ML Lec 13 CNN
No ratings yet
ML Lec 13 CNN
44 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
11 pages
Module-4 DL
No ratings yet
Module-4 DL
22 pages
Unit Iii Deep Learning
No ratings yet
Unit Iii Deep Learning
31 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
GCET DL Unit-3 CNN
No ratings yet
GCET DL Unit-3 CNN
114 pages
Convolutional Neural Networks - Part 2
No ratings yet
Convolutional Neural Networks - Part 2
49 pages
Unit 2 Part 02
No ratings yet
Unit 2 Part 02
37 pages
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
No ratings yet
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
57 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
DL Unit-3
No ratings yet
DL Unit-3
70 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
3.convolutional Networks and Sequence Modeling
No ratings yet
3.convolutional Networks and Sequence Modeling
19 pages
Module2 1
No ratings yet
Module2 1
27 pages
A Convolutional Neural Network
No ratings yet
A Convolutional Neural Network
6 pages
Chap8 CNN
No ratings yet
Chap8 CNN
48 pages
What Should You Consider or Pay Attention To When Preparing A Data Set
No ratings yet
What Should You Consider or Pay Attention To When Preparing A Data Set
7 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
CNN Explanation
No ratings yet
CNN Explanation
21 pages
Revision Questions - Lecture 3
No ratings yet
Revision Questions - Lecture 3
5 pages
Session02 - CNN
No ratings yet
Session02 - CNN
52 pages
3 Ann
No ratings yet
3 Ann
61 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
48 pages
CNN Module2
No ratings yet
CNN Module2
11 pages
CNN Architecture
No ratings yet
CNN Architecture
24 pages
Module 3
No ratings yet
Module 3
46 pages
Program For Circular Queue Implementation Through Array
No ratings yet
Program For Circular Queue Implementation Through Array
13 pages
21cs502 - Artificial Intelligence Unit 2
No ratings yet
21cs502 - Artificial Intelligence Unit 2
37 pages
A. P Receiver Function Analysis: Pwaveqn Program by Ammon
No ratings yet
A. P Receiver Function Analysis: Pwaveqn Program by Ammon
2 pages
SparseGPT - Massive Language Models Can Be Accurately Pruned in One-Shot
No ratings yet
SparseGPT - Massive Language Models Can Be Accurately Pruned in One-Shot
15 pages
Cryptography CS 555: Department of Computer Sciences Purdue University
No ratings yet
Cryptography CS 555: Department of Computer Sciences Purdue University
45 pages
Image Processing Basics
No ratings yet
Image Processing Basics
17 pages
Design of The Deadbeat Controller With Limited Output: L. Balasevicius, G. Dervinis
No ratings yet
Design of The Deadbeat Controller With Limited Output: L. Balasevicius, G. Dervinis
4 pages
Dynammic Programming, ALS Problem
No ratings yet
Dynammic Programming, ALS Problem
22 pages
EE370 Digital Electronics: L12: Logic Synthesis - Part-2
No ratings yet
EE370 Digital Electronics: L12: Logic Synthesis - Part-2
24 pages
Data Structures
No ratings yet
Data Structures
10 pages
Biomedical Image Processing Exam
No ratings yet
Biomedical Image Processing Exam
3 pages
Math 7 Summative Test
No ratings yet
Math 7 Summative Test
2 pages
Lecture 5
No ratings yet
Lecture 5
16 pages
2d Sampling
No ratings yet
2d Sampling
5 pages
5 Bankers Algorithm Program
No ratings yet
5 Bankers Algorithm Program
4 pages
Newton Raphson & Successive Approximation Methods
No ratings yet
Newton Raphson & Successive Approximation Methods
2 pages
Matlab Script File
No ratings yet
Matlab Script File
8 pages
BT4395 RR Final
No ratings yet
BT4395 RR Final
32 pages
Lesson 4 Solving Quadratic Equations by Completing The Square
No ratings yet
Lesson 4 Solving Quadratic Equations by Completing The Square
15 pages
Operations On Array
No ratings yet
Operations On Array
9 pages
Research Survey On Support Vector Machine
No ratings yet
Research Survey On Support Vector Machine
9 pages
Daa 1mark Questions and Answers
No ratings yet
Daa 1mark Questions and Answers
12 pages
CMP 202 (Recursion)
No ratings yet
CMP 202 (Recursion)
13 pages
TCS NQT Coding Sheet - TCS Coding Questions - Updated 2022
No ratings yet
TCS NQT Coding Sheet - TCS Coding Questions - Updated 2022
8 pages
DSP Cen352 Filterdesign
No ratings yet
DSP Cen352 Filterdesign
43 pages
Eqs Explained
No ratings yet
Eqs Explained
4 pages
Architecture: Simple Neural Nets For Pattern Classification
No ratings yet
Architecture: Simple Neural Nets For Pattern Classification
15 pages
DNN Hongyo2019
No ratings yet
DNN Hongyo2019
3 pages
03 Fourier Representation of Signals and LTI Systems Part03 Annotated G1
No ratings yet
03 Fourier Representation of Signals and LTI Systems Part03 Annotated G1
145 pages
Multirate Filters An Overview Sampling Rate Conversion, Decimation, Interpolation
No ratings yet
Multirate Filters An Overview Sampling Rate Conversion, Decimation, Interpolation
4 pages

CNN PPT

Uploaded by

CNN PPT

Uploaded by

Convolutional Neural Network

Dr. Gaurav Trivedi

Can represent a small region with fewer parameters

They can be compressed

We can subsample the pixels to make image smaller

• Shared weights on the edges

• Max pooling further reduces the complexity

is the number of filters

Fully Connected A new image

28 x 28 pixels 1: black/white, 3: RGB Convolution

Translation invariance, meaning the CNN

Make the network more efficient by reducing

Takes the maximum value from each region

2.1 -1.11 -3.2

Takes the maximum value from each region

2.1 -1.11 -3.2

Takes the maximum value from each region

2.1 -1.11 -3.2

Retains the most important features (strongest activations)

Helps detect important patterns like edges and textures

Computes the average of all values in the region

2.1 -1.11 -3.2

Computes the average of all values in the region

2.1 -1.11 -3.2 = -0.014

Provides a smoother representation of the feature map

Used when preserving overall intensity is important

Lenet-5 1989- Earliest One

VGGNet 2014- (Powerful)

Data Flow Management

You might also like