Lecture 11

The lecture covers sequence models, focusing on recurrent neural networks, the vanishing and exploding gradients problem, and long-short term memory (LSTM) networks. It discusses LSTM applications including language models, translation, caption generation, and program execution. Additionally, it touches on advanced topics like Neural Turing Machines and various prediction and recognition tasks.

Uploaded by

Tachbir Dewan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views22 pages

Lecture 11

Uploaded by

Tachbir Dewan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Outline of the lecture

This lecture introduces you sequence models. The goal is for you to
learn about:

 Recurrent neural networks

 The vanishing and exploding gradients problem
 Long-short term memory (LSTM) networks
 Applications of LSTM networks
 Language models
 Translation
 Caption generation
 Program execution
A simple recurrent neural network

[Alex Graves]
Vanishing gradient problem

[Yoshua Bengio et al]

Vanishing gradient problem
Simple solution
LSTM

[Alex Graves]
LSTM
Entry-wise multiplication layer
LSTM cell in Torch
LSTM column in Torch
LSTMs for sequence to sequence prediction

[Ilya Sutskever et al]

LSTMs for sequence to sequence prediction
Learning to parse

[Oriol Vinyals et al]

Learning to execute

[Wojciech Zaremba and Ilya Sutskever]

Video prediction
Hand-writing recognition and synthesis

[Alex Graves]
Neural Turing Machine (NTM)

[Alex Graves, Greg Wayne, Ivo Danihelka]

Neural Turing Machine (NTM)
Neural Turing Machine (NTM)
Translation with alignment (Bahdanau et al)
Show, attend and tell

[Kelvin Xu et al, 2015]

Show, attend and tell

RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
No ratings yet
RNN & LSTM: Nguyen Van Vinh Computer Science Department, UET, Vnu Ha Noi
35 pages
Sequence Models RNNS, LSTMs
No ratings yet
Sequence Models RNNS, LSTMs
3 pages
RNN LSTM
No ratings yet
RNN LSTM
42 pages
cs224n spr2024 Lecture06 Fancy RNN
No ratings yet
cs224n spr2024 Lecture06 Fancy RNN
56 pages
Deep Learning: Sequence Models
No ratings yet
Deep Learning: Sequence Models
85 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Deep Learning: Sequence Models Course
No ratings yet
Deep Learning: Sequence Models Course
1 page
Chapter 2
No ratings yet
Chapter 2
68 pages
Lecture 5
No ratings yet
Lecture 5
102 pages
Recurrent Neural Networks (RNN) : Course 5: Sequence Models
No ratings yet
Recurrent Neural Networks (RNN) : Course 5: Sequence Models
76 pages
ch10 Sequence Modelling - Recurrent and Recursive Nets
No ratings yet
ch10 Sequence Modelling - Recurrent and Recursive Nets
45 pages
Sequential Machine Learning Guide
No ratings yet
Sequential Machine Learning Guide
10 pages
Slides RNN
No ratings yet
Slides RNN
75 pages
Sequence Models - Merged
No ratings yet
Sequence Models - Merged
67 pages
Lecture 4
No ratings yet
Lecture 4
34 pages
Cs224n 2025 Lecture06 Fancy RNN
No ratings yet
Cs224n 2025 Lecture06 Fancy RNN
57 pages
Long Short-Term Memory (LSTM) : A Deep Dive Into Sequential Learning
No ratings yet
Long Short-Term Memory (LSTM) : A Deep Dive Into Sequential Learning
17 pages
RNN StannfordBased
No ratings yet
RNN StannfordBased
102 pages
Recurrent Neural Networks (RNN) : Subtitle
No ratings yet
Recurrent Neural Networks (RNN) : Subtitle
53 pages
LSTM Seq2Seq Models for Text Data
No ratings yet
LSTM Seq2Seq Models for Text Data
44 pages
Unit 4
No ratings yet
Unit 4
27 pages
9 RNN LSTM Gru
No ratings yet
9 RNN LSTM Gru
91 pages
AML - Lecture - 09 - 08nov24
No ratings yet
AML - Lecture - 09 - 08nov24
126 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
Deep Learning for Sequence Data
No ratings yet
Deep Learning for Sequence Data
22 pages
Sequence Modeling
100% (1)
Sequence Modeling
131 pages
Sequence Models Notes
No ratings yet
Sequence Models Notes
4 pages
Sequence Model
No ratings yet
Sequence Model
13 pages
Sequence Models
No ratings yet
Sequence Models
73 pages
Intro to Recurrent Neural Networks
No ratings yet
Intro to Recurrent Neural Networks
79 pages
04 - RNNs
No ratings yet
04 - RNNs
37 pages
LSTM
No ratings yet
LSTM
10 pages
Deep Learning RNN
100% (2)
Deep Learning RNN
53 pages
NLP Lecture 6
No ratings yet
NLP Lecture 6
57 pages
RNN LSTM GRU Transformers
0% (1)
RNN LSTM GRU Transformers
123 pages
RNN Stanford
No ratings yet
RNN Stanford
44 pages
07 - Recurrent Neural Networks
No ratings yet
07 - Recurrent Neural Networks
44 pages
Sequence Models231205
No ratings yet
Sequence Models231205
72 pages
11 RNN
No ratings yet
11 RNN
32 pages
RNNs: Temporal Sequence Processing
No ratings yet
RNNs: Temporal Sequence Processing
45 pages
Unit 3 - Part 02
No ratings yet
Unit 3 - Part 02
40 pages
09-RNN (V.Andicsova)
No ratings yet
09-RNN (V.Andicsova)
30 pages
Unit III - Recurrent Neural Networks
No ratings yet
Unit III - Recurrent Neural Networks
44 pages
Sequence Modeling with RNNs
No ratings yet
Sequence Modeling with RNNs
92 pages
Understanding LSTM Networks - Colah's Blog
No ratings yet
Understanding LSTM Networks - Colah's Blog
7 pages
Recurrent Neural Networks LSTMS, Transformers, Graph Neural Networks
No ratings yet
Recurrent Neural Networks LSTMS, Transformers, Graph Neural Networks
97 pages
RNNs: LSTM vs GRU Lecture
No ratings yet
RNNs: LSTM vs GRU Lecture
22 pages
RNNs & LSTMs for Sequential Data
No ratings yet
RNNs & LSTMs for Sequential Data
32 pages
Module 4-1
No ratings yet
Module 4-1
44 pages
RNN
No ratings yet
RNN
28 pages
LSTM
No ratings yet
LSTM
123 pages
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
No ratings yet
RNNs and Their Types - 15 Slides (Easy Copy-Paste Format)
6 pages
Dis6 Sol
No ratings yet
Dis6 Sol
6 pages
07 RNN Recurrent Neural Networks
No ratings yet
07 RNN Recurrent Neural Networks
115 pages
Part 5
No ratings yet
Part 5
37 pages
LSTM
No ratings yet
LSTM
19 pages
Computer Network
No ratings yet
Computer Network
27 pages
Project Name:: Tetris Game
No ratings yet
Project Name:: Tetris Game
14 pages
1.2 Report
No ratings yet
1.2 Report
13 pages
CMS 1
No ratings yet
CMS 1
1 page
Cover
No ratings yet
Cover
1 page
Lec04 Compression Part2
No ratings yet
Lec04 Compression Part2
23 pages
Chapter 1
No ratings yet
Chapter 1
17 pages
Chapter 5
No ratings yet
Chapter 5
26 pages
Lecture 6
No ratings yet
Lecture 6
16 pages
Chapter 3
No ratings yet
Chapter 3
16 pages
Chapter 18
No ratings yet
Chapter 18
15 pages
Student Project: HR System
No ratings yet
Student Project: HR System
22 pages

Lecture 11

Uploaded by

Lecture 11

Uploaded by

Outline of the lecture

 Recurrent neural networks

[Yoshua Bengio et al]

[Ilya Sutskever et al]

[Oriol Vinyals et al]

[Wojciech Zaremba and Ilya Sutskever]

[Alex Graves, Greg Wayne, Ivo Danihelka]

[Kelvin Xu et al, 2015]

You might also like