0% found this document useful (0 votes)

116 views97 pages

Convolution Neural Networks (CNN) : Ms. Anisha Mahato Assistant Professor (CSE Specialization)

Cnn ppt

Uploaded by

Shubham Goel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views97 pages

Convolution Neural Networks (CNN) : Ms. Anisha Mahato Assistant Professor (CSE Specialization)

Cnn ppt

Uploaded by

Shubham Goel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 97

Convolution Neural Networks

(CNN)

Ms. Anisha Mahato

Assistant Professor (CSE Specialization)
Computer Vision Problems
Image Classification Neural Style Transfer

Cat? (0/1)

64x64

Object detection
Deep Learning on large images
Problems
1. Too many parameters to train
2. Positional information is lost
Cat? (0/1) 3. Chance of overfitting

64 x 64 x 3

1000 x 1000 x 3
= 3 million

1000 x 1000 x 3
3 million x 1000 = 3 billion trainable weights
Edge Detection

vertical edges

horizontal edges
Vertical edge detection
3x1 + 0x0 + 1x-1 + 1x1 + 5x0 + 8x-1 + 2x1 + 7x0 + 2x-1 = -5
1 0 -1
3 0 1 2 7 4 Filter / Kernel
1 0 -1
1 5 8 9 3 1 1 0 -1
1 0 -1
2 7 2 5 1 3
* 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6 Convolution
Vertical edge detection
3x1 + 0x0 + 1x-1 + 1x1 + 5x0 + 8x-1 + 2x1 + 7x0 + 2x-1 = -5
1 0 -1
3 0 1 2 7 4 Filter / Kernel
1
1
5
0
8
-1
9 3 1 -5
1 0 -1
1 0 -1
2 7 2 5 1 3
* 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6 Convolution
Vertical edge detection

1 0 -1
3 0 1 2 7 4
1 5
1
8
0
9
-1
3 1 -5 -4
1 0 -1
1 0 -1
2 7 2 5 1 3
* 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

1 0 -1
3 0 1 2 7 4
1 5 8
1
9
0
3
-1
1 -5 -4 0
1 0 -1
1 0 -1
2 7 2 5 1 3
* 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

1 0 -1
3 0 1 2 7 4
1 5 8 9
1
3
0
1
-1 -5 -4 0 8
1 0 -1
1 0 -1
2 7 2 5 1 3
* 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1
1
5
0
8
-1
9 3 1 -5 -4 0 8
1 0 -1
2
1
7
0
2
-1
5 1 3 -10
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5
1
8
0
9
-1
3 1 -5 -4 0 8
1 0 -1
2 7
1
2
0
5
-1
1 3 -10 -2
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8
1
9
0
3
-1
1 -5 -4 0 8
1 0 -1
2 7 2
1
5
0
1
-1
3 -10 -2 2
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8 9
1
3
0
1
-1 -5 -4 0 8
1 0 -1
2 7 2 5
1
1
0
3
-1
-10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8
4 2 1 6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8 9 3 1 -5 -4 0 8
1 0 -1
2
1
7
0
2
-1
5 1 3 -10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8 0
4
1
2
0
1
-1
6 2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8 9 3 1 -5 -4 0 8
1 0 -1
2 7
1
2
0
5
-1
1 3 -10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8 0 -2
4 2
1
1
0
6
-1
2 8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8 9 3 1 -5 -4 0 8
1 0 -1
2 7 2
1
5
0
1
-1
3 -10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8 0 -2 -4
4 2 1
1
6
0
2
-1
8 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8 9 3 1 -5 -4 0 8
1 0 -1
2 7 2 5
1
1
0
3
-1
-10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8 0 -2 -4 -7
4 2 1 6
1
2
0
8
-1 1 0 -1
2 4 5 2 3 9 3x3
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8 9 3 1 -5 -4 0 8
1 0 -1
2 7 2 5 1 3 -10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8 0 -2 -4 -7
4
1
2
0
1
-1
6 2 8 1 0 -1
-3
3x3
1 0 -1
2 4 5 2 3 9
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8 9 3 1 -5 -4 0 8
1 0 -1
2 7 2 5 1 3 -10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8 0 -2 -4 -7
4 2
1
1
0
6
-1
2 8 1 0 -1
-3 -2
3x3
1 0 -1
2 4 5 2 3 9
4x4
6x6
Vertical edge detection

3 0 1 2 7 4
1 5 8 9 3 1 -5 -4 0 8
1 0 -1
2 7 2 5 1 3 -10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8 0 -2 -4 -7
4 2 1
1
6
0
2
-1
8 1 0 -1
-3 -2 -3
3x3
1 0 -1
2 4 5 2 3 9
4x4
6x6
Vertical edge detection

Feature Map
3 0 1 2 7 4
1 5 8 9 3 1 -5 -4 0 8
1 0 -1
2 7 2 5 1 3 -10 -2 2 3
1 0 -1 * 1 0 -1 =
0 1 3 1 7 8 0 -2 -4 -7
4 2 1 6
1
2
0
8
-1 1 0 -1
-3 -2 -3 -16
3x3
1 0 -1
2 4 5 2 3 9
4x4
6x6
Vertical edge detection

10 10 10 0 0 0 0 30 30 0
10 10 10 0 0 0 1 0 -1
1 0 -1 0 30 30 0
10 10 10 0 0 0 =
* 0 30 30 0
10 10 10 0 0 0 1 0 -1
10 10 10 0 0 0 0 30 30 0
10 10 10 0 0 0
Vertical edge detection
10 10 10 0 0 0
10 10 10 0 0 0 0 30 30 0
1 0 -1
10 10 10 0 0 0 0 30 30 0
10 10 10 0 0 0 * 1
1
0 -1
0 -1
= 0 30 30 0
10 10 10 0 0 0 0 30 30 0
10 10 10 0 0 0

0 0 0 10 10 10
0 0 0 10 10 10 0 -30 -30 0
1 0 -1
0 0 0 10 10 10 0 -30 -30 0
0 0 0 10 10 10 * 1
1
0 -1
0 -1
= 0 -30 -30 0
0 0 0 10 10 10 0 -30 -30 0
0 0 0 10 10 10
Horizontal edge detection
1 0 -1 1 1 1
1 0 -1 0 0 0
1 0 -1 -1 -1 -1
Vertical Horizontal
10 10 10 0 0 0
0 0 0 0
10 10 10 0 0 0 1 1 1
10 10 10
0 0 0
0 0
10 10 10
0
* 0 0 0 =
30
30
10 -10 -30
10 -10 -30
-1 -1 -1
0 0 0 10 10 10 0 0 0 0
0 0 0 10 10 10
Learning to detect edges
1 0 -1 1 0 -1 3 0 -3
1 0 -1 2 0 -2 10 0 -10
1 0 -1 1 0 -1 3 0 -3
Sobel filter Scharr filter

3 0 1 2 7 4
1 5 8 9 3 1
w1 w2 w3
2 7 2 5 1 3
0 1 3 1 7 8 * w4 w5 w6 =
w7 w8 w9
4 2 1 6 2 8
2 4 5 2 3 9
Why convolutions

• Parameter sharing: A feature detector (such as a vertical

edge detector) that’s useful in one part of the image is
probably useful in another part of the image.

• Sparsity of connections: In each layer, each output value

depends only on a small number of inputs.

• Translation invariance: Shared weights across different

spatial locations enable the network to recognize the same
pattern in various positions, reducing sensitivity to feature
location.
1 -1 -1 Filter 1 1 1
-1 1 -1 2 0
-1 -1 1 3 0
4: 0 3
1 0 0 0 0 1

…
0 1 0 0 1 0 0
0 0 1 1 0 0 8 1
1 0 0 0 1 0 9 0
0 1 0 0 1 0 10: 0

…
0 0 1 0 1 0
13 0
6 x 6 image
14 0
fewer parameters! 15 1 Only connect to 9
16 1 inputs, not fully
connected

…
1 -1 -1 1: 1
-1 1 -1 Filter 1 2: 0
-1 -1 1 3: 0
4: 0 3
1 0 0 0 0 1

…
0 1 0 0 1 0 7: 0
0 0 1 1 0 0 8: 1
1 0 0 0 1 0 9: 0 -1
0 1 0 0 1 0 10: 0

…
0 0 1 0 1 0
13: 0
6 x 6 image
14: 0
Fewer parameters 15: 1
16: 1 Shared weights
Even fewer parameters

…
Padding