Lec 06

Uploaded by

KhánhLinh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views111 pages

Lec 06

Uploaded by

KhánhLinh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 111

Deep Learning Basics

簡韶逸 Shao-Yi Chien

Department of Electrical Engineering
National Taiwan University

1
References and Slide Credits
• Slides from Deep Learning for Computer Vision, Prof. Yu-
Chiang Frank Wang, National Taiwan University
• Slides from Machine Learning, Prof. Hung-Yi Lee, EE,
National Taiwan University
• Slides from CE 5554 / ECE 4554: Computer Vision, Prof. J.-B.
Huang, Virginia Tech
• http://cs231n.stanford.edu/syllabus.html
• Marc'Aurelio Ranzato, Tutorial in CVPR2014
• Ian Goodfellow, Yoshua Bengio, and Aaron Courville, Deep
Learning
• https://www.deeplearningbook.org/
• Bishop, Pattern Recognition and Machine Learning
• Reference papers
2
Outline
• Introduction of neural network
• Go deeper
• Introduction of convolutional neural network (CNN)
• Modern CNN models

3
History of Neural Network and
Deep Learning [Prof. Hung-Yi Lee]

• 1958: Perceptron (linear model)

• 1969: Perceptron has limitation
• 1980s: Multi layer perceptron
• Do not have significant difference from DNN today
• 1986: Backpropagation
• Usually more than 3 hidden layers is not helpful
• 1989: 1 hidden layer is “good enough”, why deep?
• 2006: RBM initialization (breakthrough)
• 2009: GPU
• 2011: Start to be popular in speech recognition
• 2012: win ILSVRC image competition Geoffrey Hinton

LeCun, Yann; Bengio, Yoshua; Hinton, Geoffrey, “Deep learning,” Nature, 2015.
4
How Powerful?
Object Recognition

Not deep-learning

Deep-learning based

Source:
https://devblogs.nvidia.com/parallelforall/mocha-jl-deep-learning-julia/
https://blogs.nvidia.com/blog/2016/07/29/whats-difference-artificial-intelligence-
machine-learning-deep-learning-ai/
5
Biological neuron and Perceptrons

A biological neuron An artificial neuron (Perceptron)

- a linear classifier
Simple, Complex and Hypercomplex cells

David H. Hubel and Torsten Wiesel

Suggested a hierarchy of feature detectors

in the visual cortex, with higher level features
responding to patterns of activation in lower
level cells, and propagating activation
upwards to still higher level cells.
David Hubel's Eye, Brain, and Vision
Hubel/Wiesel Architecture and Multi-layer Neural Network

Hubel and Weisel’s architecture Multi-layer Neural Network

- A non-linear classifier
Hierarchical Representation Learning
• Successive model layers learn deeper intermediate representations.

9
Recap: Linear Classification
• Linear Classifier
• Let’s take the input image as x, and the linear classifier as W.
We need y = Wx + b as a 10-dimensional output vector, indicating the score for each class.
• For example, an image with 2 x 2 pixels & 3 classes of interest
we need to learn a linear classifier W (plus a bias b),
so that desirable outputs y = Wx + b can be expected.

Image credit: Stanford CS231n 10

Multi-Layer Perceptron: A Nonlinear Classifier

11
Multi-Layer Perceptron: A Nonlinear Classifier (cont’d)

12
Layer 1 in MLP

13
Layer 2 in MLP

14
Multi-Layer Perceptron: A Nonlinear Classifier (cont’d)

15
Let’s Get a Closer Look…

• A single neuron 1

0.5

0
一5 0 5
output of neuron

activity of neuron

inputs to neuron
16
Input-Output Function of a Single Neuron

w = [0,1]
5

0.8

z2
0
0.6
x

0.4
5
0.2