0% found this document useful (0 votes)

43 views89 pages

9 CNN-1

Uploaded by

8varlock

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views89 pages

9 CNN-1

Uploaded by

8varlock

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 89

Lecture 9 -

Convolutional
Neural Networks
I2DL: Prof. Dai 1
Fully Connected Neural Network
Width

Depth
I2DL: Prof. Dai 2
Problems using FC Layers on Images
• How to process a tiny image with FC layers

5 weights

5
3 3 neuron layer

I2DL: Prof. Dai 3

Problems using FC Layers on Images
• How to process a tiny image with FC layers

25 weights
For the whole 5 × 5
image on 1
5 channel

5
3 3 neuron layer

I2DL: Prof. Dai 4

Problems using FC Layers on Images
• How to process a tiny image with FC layers

75 weights
For the whole 5 × 5
image on the 3
5 channel

5
3 3 neuron layer

I2DL: Prof. Dai 5

Problems using FC Layers on Images
• How to process a tiny image with FC layers

75 weights
For the whole
5 × 5 image on
75 weights the three
5 channels per
neuron
75 weights
5
3 3 neuron layer

I2DL: Prof. Dai 6

Problems using FC Layers on Images
• How to process a normal image with FC layers

1000

1000
3 3 neuron layer

I2DL: Prof. Dai 7

Problems using FC Layers on Images
• How to process a normal image with FC layers

1000 3 𝑏𝑖𝑙𝑙𝑖𝑜𝑛 weights

1000
3 1000 neuron layer

I2DL: Prof. Dai 8

Why not simply more FC Layers?
We cannot make networks arbitrarily complex

• Why not just go deeper and get better?

– No structure!!
– It is just brute force!
– Optimization becomes hard
– Performance plateaus / drops!

I2DL: Prof. Dai 9

Better Way than FC ?
• We want to restrict the degrees of freedom
– We want a layer with structure
– Weight sharing → using the same weights for different
parts of the image

I2DL: Prof. Dai 10

Using CNNs in Computer Vision

[Li et al., CS231n Course Slides] Lecture 12: Detection and Segmentation
I2DL: Prof. Dai 11
Convolutions

I2DL: Prof. Dai 12

What are Convolutions?
∞

𝑓 ∗ 𝑔 = න 𝑓 𝜏 𝑔 𝑡 − 𝜏 𝑑𝜏
−∞

𝑓 = red
𝑔 = blue
𝑓 ∗ 𝑔 = green

Convolution of two box functions Convolution of two Gaussians

Application of a filter to a function
— The ‘smaller’ one is typically called the filter kernel
I2DL: Prof. Dai 13
What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

‘Slide’ filter kernel from left to right; at each position,

compute a single value in the output data

I2DL: Prof. Dai 14

What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

𝑓∗𝑔 3

1 1 1
4⋅ +3⋅ +2⋅ = 3
3 3 3

I2DL: Prof. Dai 15

What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

𝑓∗𝑔 3 0

1 1 1
3⋅ + 2 ⋅ + (−5) ⋅ = 0
3 3 3

I2DL: Prof. Dai 16

What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

𝑓∗𝑔 3 0 0

1 1 1
2⋅ + (−5) ⋅ + 3 ⋅ = 0
3 3 3

I2DL: Prof. Dai 17

What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

𝑓∗𝑔 3 0 0 1

1 1 1
−5 ⋅ +3⋅ +5⋅ =1
3 3 3

I2DL: Prof. Dai 18

What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

𝑓∗𝑔 3 0 0 1 10/3

1 1 1 10
3⋅ +5⋅ +2⋅ =
3 3 3 3

I2DL: Prof. Dai 19

What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

𝑓∗𝑔 3 0 0 1 10/3 4

1 1 1
5⋅ +2⋅ +5⋅ = 4
3 3 3

I2DL: Prof. Dai 20

What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

𝑓∗𝑔 3 0 0 1 10/3 4 4

1 1 1
2⋅ +5⋅ +5⋅ = 4
3 3 3

I2DL: Prof. Dai 21

What are Convolutions?
Discrete case: box filter
𝑓 4 3 2 -5 3 5 2 5 5 6

𝑔 1/3 1/3 1/3

𝑓∗𝑔 3 0 0 1 10/3 4 4 16/3

1 1 1 16
5⋅ +5⋅ +6⋅ =
3 3 3 3

I2DL: Prof. Dai 22

What are Convolutions?
Discrete case: box filter
4 3 2 -5 3 5 2 5 5 6

1/3 1/3 1/3

?? 3 0 0 1 10/3 4 4 16/3 ??

What to do at boundaries?

I2DL: Prof. Dai 23

What are Convolutions?
Discrete case: box filter
4 3 2 -5 3 5 2 5 5 6

1/3 1/3 1/3

?? 3 0 0 1 10/3 4 4 16/3 ??

What to do at boundaries?
Option 1: Shrink

3 0 0 1 10/3 4 4 16/3
I2DL: Prof. Dai 24
What are Convolutions?
Discrete case: box filter
0 4 3 2 -5 3 5 2 5 5 6 0

1/3 1/3 1/3

?? 3 0 0 1 10/3 4 4 16/3 ??

1 1 1 7 What to do at boundaries?
0⋅ +4⋅ +3⋅ =
3 3 3 3 Option 2: Pad (often 0’s)

7/3 3 0 0 1 10/3 4 4 16/3 11/3

I2DL: Prof. Dai 25
Convolutions on Images
-5 3 2 -5 3
Image 5 × 5

4 3 2 1 -3
1 0 3 3 5
-2 0 1 4 4

Output 3 × 3
6
5 6 7 9 -1
Kernel 3 × 3

0 -1 0
-1 5 -1 5 ⋅ 3 + −1 ⋅ 3 + −1 ⋅ 2 + −1 ⋅ 0 + −1 ⋅ 4
0 -1 0 = 15 − 9 = 6