Advanced RNN Techniques Explained

The document discusses various machine learning techniques for modeling sequential data including bidirectional RNNs, encoder-decoder architectures, deep recurrent networks, and recursive neural networks. Bidirectional RNNs connect hidden layers in both directions to obtain information from past and future states. Encoder-decoder architectures map a fixed-length input to a fixed-length output of differing lengths. Deep recurrent networks have many hidden layers to process inputs of any length. Recursive neural networks apply the same weights recursively over structured inputs to produce structured predictions.

Uploaded by

Lakshmi Narayanan Ranganatha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

512 views15 pages

Advanced RNN Techniques Explained

Uploaded by

Lakshmi Narayanan Ranganatha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

MACHINE LEARNING TECHNIQUES

AGENDA:

1. Bidirectional RNN
2. Encoder–Decoder Sequence-to-Sequence Architecture
3. Deep Recurrent Networks
4. Recursive Neural Networks
Two Issues of Standard RNNs
• Vanishing Gradient Problem
– Recurrent Neural Networks enable you to model time-
dependent and sequential data problems, such as stock
market prediction, machine translation, and text
generation. You will find, however, RNN is hard to train
because of the gradient problem.
– RNNs suffer from the problem of vanishing gradients. The
gradients carry information used in the RNN, and when
the gradient becomes too small, the parameter updates
become insignificant. This makes the learning of long
data sequences difficult.
• Exploding Gradient Problem
– While training a neural network, if the slope
tends to grow exponentially instead of decaying,
this is called an Exploding Gradient. This
problem arises when large error gradients
accumulate, resulting in very large updates to
the neural network model weights during the
training process.
– Long training time, poor performance, and less
accuracy are the major issues in gradient
problems.
BIDIRECTIONAL RNN
• Connect two hidden layers of opposite
directions to the same output.
• Output layer gets information from past
(backwards) and future (forward) states
simultaneously.
Application:
Handwriting Recognition
Language Translation
Encoder–Decoder Sequence-to-Sequence
Architecture
• Special case of RNN that maps a fixed-length
input with a fixed-length output where their
lengths may differ.
• The model has three parts:
• Encoder
• Intermediate (encode) vector
• Decoder
Encoder

Encoder Vector
RNN RNN

RNN RNN RNN

Decoder
Encoder:

Encoder Vector:

Decoder:
Applications
Sentiment Analysis
Machine Translation
Speech Recognition
Video and Image Captioning
Text Summarization
Chatbot creation
DEEP RECURRENT NETWORKS
• Class of Neural Network where connections
between nodes form a graph with many
hidden layers
• The computation process:
1. Input to the Hidden state
2. Between two hidden states
3.Hidden state to the output.
DEEP RECURRENT NETWORKS
• Advantages:

– Process inputs of any length

– Possess internal memory

• Disadvantages:

– Initialization of model requires care to obtain convergence.

RECURSIVE NEURAL NETWORKS
• Apply the same set of weights recursively over
a structured input, to produce a structured
prediction over variable-size input structures
• Types:
• Inner Approach
• Outer Approach

• Application :Sentiment Analysis

Inner Approach
• Conduct recursion inside the underlying graph
and objective is achieved usually by moving
forward

E Prediction
Outer Approach
• Conduct recursion by outside
the underlying graph
DIFFERENCE BETWEEN RECURSIVE AND
RECURRENT NEURAL NETWORKS

RECURSIVE NEURAL RECURRENT NEURAL

NETWORKS NETWORKS

Tree like structure Chain like structure

Same weights used Different weights are used

repetitively.

Complex and Expensive at Less Complex

Computational Learning phase

Lab I TENSOR FLOW AND KERAS
No ratings yet
Lab I TENSOR FLOW AND KERAS
3 pages
Gradient Descent for Deep Learning
No ratings yet
Gradient Descent for Deep Learning
21 pages
Deep Learning with RBMs and DBNs
No ratings yet
Deep Learning with RBMs and DBNs
79 pages
Neural Networks & SVMs in AI
No ratings yet
Neural Networks & SVMs in AI
19 pages
Tensor Flow
No ratings yet
Tensor Flow
12 pages
ML Unit-2
No ratings yet
ML Unit-2
26 pages
Neural Network Loss & Regularization
No ratings yet
Neural Network Loss & Regularization
112 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
02 ML Supervised Learning
No ratings yet
02 ML Supervised Learning
32 pages
2.neural Network
No ratings yet
2.neural Network
19 pages
AI-Enhanced QA: EmbeddingAlign RAG
No ratings yet
AI-Enhanced QA: EmbeddingAlign RAG
7 pages
Module2.3 Hyperparameter Optimization
No ratings yet
Module2.3 Hyperparameter Optimization
29 pages
Optimization Techniques in Deep Learning
No ratings yet
Optimization Techniques in Deep Learning
14 pages
Answers For End-Sem Exam Part - 2 (Deep Learning)
No ratings yet
Answers For End-Sem Exam Part - 2 (Deep Learning)
20 pages
Single Layer Perceptron Experiment
No ratings yet
Single Layer Perceptron Experiment
11 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Deep Learning Lab Manual - IGDTUW - Vinisky Kumar
100% (1)
Deep Learning Lab Manual - IGDTUW - Vinisky Kumar
33 pages
Deep Learning Exp
100% (1)
Deep Learning Exp
25 pages
Unit - V
No ratings yet
Unit - V
10 pages
Deep Learning - Unit-III Two Marks
100% (2)
Deep Learning - Unit-III Two Marks
3 pages
ML Lab
No ratings yet
ML Lab
21 pages
Intro to k-Nearest Neighbor Algorithm
No ratings yet
Intro to k-Nearest Neighbor Algorithm
3 pages
ML Unit-1
No ratings yet
ML Unit-1
43 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Unit - 3-NNDL - Notes
No ratings yet
Unit - 3-NNDL - Notes
17 pages
DL Question Bank
No ratings yet
DL Question Bank
23 pages
Chap 7-2 Regularization For Deep Learning-Hyun-Lim Yang
No ratings yet
Chap 7-2 Regularization For Deep Learning-Hyun-Lim Yang
49 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Computational Learning Theory Guide
No ratings yet
Computational Learning Theory Guide
24 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
2.building Blocks of Neural Networks
100% (1)
2.building Blocks of Neural Networks
2 pages
Practice Final sp22
No ratings yet
Practice Final sp22
10 pages
Soft Max
No ratings yet
Soft Max
6 pages
Machine Learning Full Question Bank
No ratings yet
Machine Learning Full Question Bank
14 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
No ratings yet
2-Capacity, Underfitting, overfitting-15-Jul-2020Material - I - 15-Jul-2020 - ML - Fundamentals
35 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Machine Learning Theory Essentials
No ratings yet
Machine Learning Theory Essentials
9 pages
ML Viva Questions
100% (1)
ML Viva Questions
4 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
79 pages
Deep Learning Laboratory
No ratings yet
Deep Learning Laboratory
69 pages
Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
RBF, KNN, SVM, DT
No ratings yet
RBF, KNN, SVM, DT
9 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
ML - Expectation-Maximization Algorithm
No ratings yet
ML - Expectation-Maximization Algorithm
3 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Decision Tree and Ensemble
No ratings yet
Decision Tree and Ensemble
92 pages
Machine Learning Deep Learning
No ratings yet
Machine Learning Deep Learning
2 pages
Gaussian Mixture Models Unit-III
No ratings yet
Gaussian Mixture Models Unit-III
13 pages
SVM Guide for Data Science Enthusiasts
100% (1)
SVM Guide for Data Science Enthusiasts
28 pages
Unit 4
No ratings yet
Unit 4
38 pages
Graph Neural Network The Next Frontier in Deep Learning
No ratings yet
Graph Neural Network The Next Frontier in Deep Learning
1 page
Deep Learning CNN Training Guide
No ratings yet
Deep Learning CNN Training Guide
20 pages
Machine Learning Basics for Students
100% (1)
Machine Learning Basics for Students
21 pages
Ad3351 Daa Question Bank
No ratings yet
Ad3351 Daa Question Bank
12 pages
DL Unit-2
No ratings yet
DL Unit-2
51 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
ANN-unit 4
No ratings yet
ANN-unit 4
25 pages
Module 4-1
No ratings yet
Module 4-1
44 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
RFIdetection Deep Learning Mohammed FINAL VERSION
No ratings yet
RFIdetection Deep Learning Mohammed FINAL VERSION
8 pages
Weighted Ensemble Model For Image Classification: Talib Iqball M. Arif Wani
No ratings yet
Weighted Ensemble Model For Image Classification: Talib Iqball M. Arif Wani
8 pages
Ai With Generated Data
No ratings yet
Ai With Generated Data
42 pages
Artificial Neural Networks Overview
No ratings yet
Artificial Neural Networks Overview
11 pages
Machine Learning Course Guide
No ratings yet
Machine Learning Course Guide
3 pages
ChatGPT MASTERY 12 Books in 1 Unlocki... (Z-Library)
No ratings yet
ChatGPT MASTERY 12 Books in 1 Unlocki... (Z-Library)
161 pages
CM20315 03 Shallow
No ratings yet
CM20315 03 Shallow
59 pages
Machine Learning Question Bank (M1 M2 M3)
No ratings yet
Machine Learning Question Bank (M1 M2 M3)
16 pages
What Is Generative Ai v7
No ratings yet
What Is Generative Ai v7
5 pages
Cheatsheet Reflex Models
No ratings yet
Cheatsheet Reflex Models
4 pages
Machine Learning Coursera
100% (1)
Machine Learning Coursera
55 pages
Li Et Al-2020-Journal of Intelligent Manufacturing
No ratings yet
Li Et Al-2020-Journal of Intelligent Manufacturing
16 pages
Unit 1
No ratings yet
Unit 1
3 pages
Ph.D. Thesis Help for ANN Students
100% (2)
Ph.D. Thesis Help for ANN Students
5 pages
Artificial Intelligence Chapter 2: Intelligent Agents
No ratings yet
Artificial Intelligence Chapter 2: Intelligent Agents
12 pages
Smart Agriculture Data Mining
No ratings yet
Smart Agriculture Data Mining
6 pages
Student Electives: AI, Crypto, Cloud
No ratings yet
Student Electives: AI, Crypto, Cloud
4 pages
Machine Learning For Engineering and Science Applications - Unit 14 - Week 11
No ratings yet
Machine Learning For Engineering and Science Applications - Unit 14 - Week 11
2 pages
A Survey On Machine Learning Algorithms Techniques and
No ratings yet
A Survey On Machine Learning Algorithms Techniques and
6 pages
Deep Fake Detection Using Deep Learning - PPT 2
No ratings yet
Deep Fake Detection Using Deep Learning - PPT 2
31 pages
cs231n 2019 Lecture10
No ratings yet
cs231n 2019 Lecture10
106 pages
Data Warehousing & Mining Course
No ratings yet
Data Warehousing & Mining Course
2 pages
Real - Time Human Detection & Counting: Project Report BY
No ratings yet
Real - Time Human Detection & Counting: Project Report BY
20 pages
ChatGPT and The Chinese Room Argument An Eloquent AI Conversationalist Lacking True Understanding and Consciousness
No ratings yet
ChatGPT and The Chinese Room Argument An Eloquent AI Conversationalist Lacking True Understanding and Consciousness
4 pages
of Bayesian Statistics (Chirayu Jain & Group)
No ratings yet
of Bayesian Statistics (Chirayu Jain & Group)
8 pages
Unit 5 - AI PROJECT CYCLE
No ratings yet
Unit 5 - AI PROJECT CYCLE
16 pages
Ai Based Mock Interview System: Ninad Chavan, Prathamesh Shivpuje, Sarthak Mali, Ayesha Sayyad
No ratings yet
Ai Based Mock Interview System: Ninad Chavan, Prathamesh Shivpuje, Sarthak Mali, Ayesha Sayyad
5 pages
Lec 01
No ratings yet
Lec 01
37 pages
RGIS603 Assignment 2
No ratings yet
RGIS603 Assignment 2
3 pages