DL2024

The document outlines the structure and instructions for a comprehensive examination on Deep Learning at the Birla Institute of Technology and Science, Pilani, scheduled for December 20, 2023. It consists of three parts: Part A and B are closed book with multiple-choice and true/false questions, while Part C is open book. The exam covers various topics in deep learning, including neural networks, regularization methods, and sequence modeling.

Uploaded by

f20220384

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views4 pages

DL2024

Uploaded by

f20220384

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, PILANI (RAJ.

)
First Semester 2023-24 CS F425 Deep Learning
COMPREHENSIVE EXAMINATION [Closed Book]
Date: 20th December 2023 Weightage: 35 % Max. Marks: 70 Duration: 180 minutes
Part A [CLOSED BOOK] [28 Marks]

Important Instructions:
Q1 answers (MCQ): Q2 answers (True/False): MARKS
1. The exam has THREE parts – A, B
and C. Part A and B are closed i) i) Marks of Q1:
book and are provided to you in Marks of Q2:
the beginning of the exam. The ii) ii)
Marks of Q3:
recommended time for them is iii) iii)
total two hours. You can collect
Part C (which is Open Book) iv) iv)
whenever you submit Parts A and
v) v)
B. TOTAL
MARKS:
2. This is Part A containing short- vi) vi)
answer type questions.
vii) vii) Recheck request:
3. Any overwritten answers would
not be considered for a recheck viii) viii)
request.
4. For Q1 & Q2: write your answers ix) ix)
in the grid provided on the right. x) x)

1. Multiple Choice Questions (+1 for right answer, -0.5 for wrong). Only one correct answer. [1*10 = 10]

i). Which of the following activation function can lead to the vanishing gradient problem?
A). ReLU, B). tanh, C). Leaky ReLU, D). None of these.

ii). Which of the following techniques can NOT help prevent a model from overfitting?
A). Data augmentation, B). Dropout, C). Early stopping, D). None of these

iii). After training a neural network, you observe a large gap between the training accuracy (95%) and the test accuracy
(35%). Which of the following methods can be used to reduce this gap?
A). Generative adversarial network., B). Sigmoid activation, C). RMSprop optimizer, D). Dropout.

iv). Which of the following regularization methods leads to weight sparsity?

A). L1 regularization, B). L2 regularization, C). Early stopping, D). None of these.

v) Which of the following layers is generally NOT a part of a CNN?

A) Convolutional Layer B) Pooling Layer C) Code Layer D) Fully connected Layer

vi). Which of the below can you use to solve the exploding gradient problem?
A) Use SGD optimization, B) Oversample minority classes, C). Increase the batch size, D) Impose gradient clipping.

vii). If the input to the CNN of size 24x24 is convolved with a kernel of size 7x7 and same padding is used, what will be the
size of the output matrix? Consider stride of 1.
A) 18x18 B) 24x24 C) 17x17 D) Cannot be determined with the information provided

viii). The convolution operation doesn't fully use the pixels at the corners of an image. This is resolved by the use of:
A) Padding B) Striding C) Kernels D) Pooling
ix). Which of the following is true about dropout?
A) Dropout leads to sparsity in the trained weights B) At test time, dropout is applied with inverted keep probability
C) Larger the keep probability of a layer, stronger the regularization of the weights in that layer D) None of these

x). Which of the following is TRUE about Momentum?

A) It helps in accelerating SGD in a relevant direction B) It helps SGD in avoiding local minima
C) It helps in faster convergence D) All of these

2. Answer as TRUE or FALSE. No reasoning or justification required. (+1 for right answer, -0.5 for wrong) [1*10=10]
i) Convolutional networks generally have more parameters than their equivalent fully connected Networks
ii) Autoencoders are able to compress data and thus can be used as a generic data compression algorithm.
iii) The output of the autoencoder will not be exactly the same as the input, and thus they are “lossy”.
iv) Autoencoders are considered a supervised learning technique since they produce the reconstructed image
using the original image as an input.
v) An autoencoder can be forced to learn useful features by adding random noise to its inputs and making it
recover the original noise-free data.
vi) Apart from being an optimization technique, Batch normalization also acts as a regularizer and often eliminates
the need for using Dropout.
vii) Regularization is intended to reduce the training error as well as the generalization error.
viii) Pooling layers involve many fixed computations and hence they slow down the computation in a neural
network.
ix) the basic concept behind RNNs is that RNNs use recurrent features from dataset to find the best optimization.
x) In general, training a GAN involves alternating periods where the discriminator trains for one or more epochs
followed by the generator being trained for one or more epochs

3. If the input is of size 256x256x6 and the neural network structure is as indicated in the first column below, calculate the
output feature map dimensions for each layer. [1*8=8]
The notation follows the convention:
• CONV-K-N denotes a convolutional layer with N filters, each them of size KxK. Padding and stride parameters
are always 0 and 1 respectively.
• POOL-K indicates a KxK pooling layer with stride K and padding 0.
• FC-N stands for a fully-connected layer with N neurons.

Write your answer in the space provided in the table below.

Layer Feature map dimensions

INPUT 256x256x6
CONV-57-64
POOL-2
CONV-5-32
POOL-2
CONV-5-64
POOL-2
POOL-2
FC-9
BIRLA INSTITUTE OF TECHNOLOGY AND SCIENCE, PILANI (RAJ.)
First Semester 2023-24 CS F425 Deep Learning
COMPREHENSIVE EXAMINATION [Closed Book]
Date: 20th December 2023 Weightage: 35 % Max. Marks: 70 Duration: 180 minutes
Part B [CLOSED BOOK] [22 Marks]

This is Part B, Closed Book. Together with Part A, you are recommended to finish this in two hours. Once you submit
Parts A and B, you can collect Part C [Open Book].

Q.1. Consider the following types of sequence modelling scenarios represented using an unfolded recurrent neural network
(RNN) over time-steps: [1+1+1+1=4]
RNN
Output A B C D

Input
Categorize each of the following applications into any one of the above types i.e. A, B, C, or D. No reasoning or explanation
required. (Note: No marks will be awarded if more than one answer is written for an application):
i). Image captioning, ii). Sentiment prediction, iii). Machine translation, iv). Video frame classification.

Q.2. Fill in the blanks in the following graph with regard to the regularization method “Early stopping”. [3]

Q.3. You use vanilla (batch) gradient descent to optimize your loss function, but realise you are getting poor training loss. You
notice that you're not shuffling the training data and feel that it might be a cause. Would shuffling the training data help
in this regard? Give a clear YES or NO as an answer and then give a 1-2 lines justification. [2]
Q.4. Suppose we train two different deep CNNs to classify images:
Using ReLU as the activation function, and (ii) using sigmoid as the activation function. For each of them, we try initializing
weights with the three different initialization methods, while the biases are always initialized to all zeros. We plot the
validation accuracies with different training iterations below: [3]

What is the weight initialization method for A, B, and C in the above plots from zero initialization, Xavier initialization,
and Kaiming He initialization? (Answer with only one initialization method for A, B and C.)

Q.5 You are solving the binary classification task of classifying images as “car vs. no car”. You design a CNN with a single
output neuron. The final output of your network, 𝑦̂ is given by:

𝑦̂ = 𝜎(𝑅𝑒𝐿𝑈(𝑧)) (where z, as usual, is w.x + b)

You classify all inputs with a final value 𝑦̂ ≥ 0.5 as car images. What problem are you going to encounter? Justify. [2]

Q.6. Given N training data points {𝑥𝑖 , 𝑦𝑖 , 𝑖 = 1: 𝑁, }, 𝑥𝑖 ∈ 𝑅𝑑 , labels 𝑦𝑖 ∈ {1, −1}. We need a linear classifier 𝑓(𝑥) =
𝑠𝑖𝑔𝑛(𝑤. 𝑥) (read as w dot x) optimizing the loss function 𝐿(𝑧) = 𝑒 −𝑧 , for 𝑧 = 𝑦. (𝑤. 𝑥). Here, represents data point
of class 1 and represents data point of class -1: [6+2=8]

a). Explain the penalties given by this loss functions for the different data points (1 to 6) shown above in the plot.

b). Derive the stochastic gradient descent update ∆𝑤 for 𝐿(𝑧).

WS 2021
No ratings yet
WS 2021
16 pages
WS 2021 Solutions
No ratings yet
WS 2021 Solutions
16 pages
Neural Network MCQ Answer Scheme
No ratings yet
Neural Network MCQ Answer Scheme
9 pages
Examen Deep Learning
100% (1)
Examen Deep Learning
8 pages
MT1SP19
No ratings yet
MT1SP19
13 pages
19CSE456 - VI Sem May 2022
No ratings yet
19CSE456 - VI Sem May 2022
6 pages
Question Bank
No ratings yet
Question Bank
14 pages
Midpaper
No ratings yet
Midpaper
16 pages
Midterm Csci566
No ratings yet
Midterm Csci566
10 pages
10 Improving Deep Neural Networks Hyperparameter Tuning, Regularization
No ratings yet
10 Improving Deep Neural Networks Hyperparameter Tuning, Regularization
6 pages
Cs230exam Win19 Soln
No ratings yet
Cs230exam Win19 Soln
29 pages
SS 2021
No ratings yet
SS 2021
16 pages
Is The Data Linearly Separable?: A) Yes B) No
No ratings yet
Is The Data Linearly Separable?: A) Yes B) No
19 pages
Huawei: Question & Answers
No ratings yet
Huawei: Question & Answers
149 pages
CS230 Midterm Solutions Fall 2022
No ratings yet
CS230 Midterm Solutions Fall 2022
20 pages
SS 2021 Solutions
No ratings yet
SS 2021 Solutions
16 pages
C6 Sample
No ratings yet
C6 Sample
4 pages
Cs230exam spr21
No ratings yet
Cs230exam spr21
16 pages
Mid Sem Makeup Questions
No ratings yet
Mid Sem Makeup Questions
6 pages
Second Exam 2021-22 Solution
No ratings yet
Second Exam 2021-22 Solution
9 pages
Deep Learning
No ratings yet
Deep Learning
9 pages
APS360H1 20231 631682452284APS360 Midterm Winter 2023
No ratings yet
APS360H1 20231 631682452284APS360 Midterm Winter 2023
16 pages
DL MCQ
No ratings yet
DL MCQ
13 pages
QP3
No ratings yet
QP3
2 pages
Deep Learning Midterm Exam
No ratings yet
Deep Learning Midterm Exam
2 pages
Deep Learning Exam: Technical University of Munich
No ratings yet
Deep Learning Exam: Technical University of Munich
20 pages
F16midterm Sols v2
No ratings yet
F16midterm Sols v2
14 pages
CS230 Midterm Fall 2022
No ratings yet
CS230 Midterm Fall 2022
14 pages
Exercise #1 7 - 4 - 2025
100% (1)
Exercise #1 7 - 4 - 2025
3 pages
Ee782 Es QP 2023
No ratings yet
Ee782 Es QP 2023
2 pages
CS230: Deep Learning: Winter Quarter 2019 Stanford University Midterm Examination 180 Minutes
No ratings yet
CS230: Deep Learning: Winter Quarter 2019 Stanford University Midterm Examination 180 Minutes
29 pages
CS 182 Practice Midterm Questions
No ratings yet
CS 182 Practice Midterm Questions
8 pages
SS 2020
No ratings yet
SS 2020
21 pages
Exam DL 2023
No ratings yet
Exam DL 2023
4 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
DL MID2 Bit Bank 2024-25
No ratings yet
DL MID2 Bit Bank 2024-25
25 pages
Quiz Sol
No ratings yet
Quiz Sol
4 pages
MT1 SP19 Solutions
No ratings yet
MT1 SP19 Solutions
14 pages
QP
No ratings yet
QP
3 pages
Deep Neural Networks Midterm Prep
No ratings yet
Deep Neural Networks Midterm Prep
5 pages
DL Makeup
No ratings yet
DL Makeup
3 pages
Minor 1 - DNN
No ratings yet
Minor 1 - DNN
2 pages
ML Solutions 2
No ratings yet
ML Solutions 2
43 pages
2024 Exam2 Solution
No ratings yet
2024 Exam2 Solution
11 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
BTCS604
No ratings yet
BTCS604
2 pages
Solution Dseclzg524 05-07-2020 Ec3r
No ratings yet
Solution Dseclzg524 05-07-2020 Ec3r
7 pages
DL Quiz1
No ratings yet
DL Quiz1
5 pages
QuestionBank DL
No ratings yet
QuestionBank DL
7 pages
DL CO1 and CO2 Answers
No ratings yet
DL CO1 and CO2 Answers
36 pages
Exam Gen AI
No ratings yet
Exam Gen AI
14 pages
Deep Learning Quiz for CSE Students
No ratings yet
Deep Learning Quiz for CSE Students
3 pages
Intro To Deep Learning Final Exam IT3320E HUST
No ratings yet
Intro To Deep Learning Final Exam IT3320E HUST
8 pages
Cs230exam spr21 Soln
No ratings yet
Cs230exam spr21 Soln
21 pages
Hcia Ai
100% (1)
Hcia Ai
49 pages
Deep Learning
No ratings yet
Deep Learning
24 pages
Aat Assignment I Deep Learning A8707 Iiib - Tech I Sem
No ratings yet
Aat Assignment I Deep Learning A8707 Iiib - Tech I Sem
2 pages
Domande ANN
No ratings yet
Domande ANN
28 pages
Deep Learning MCQ
No ratings yet
Deep Learning MCQ
6 pages
Resume Electrical
No ratings yet
Resume Electrical
2 pages
III13-14 - OOPWITHSolutions
No ratings yet
III13-14 - OOPWITHSolutions
9 pages
Compre 2021
No ratings yet
Compre 2021
2 pages
Compre 2022 2
No ratings yet
Compre 2022 2
2 pages
Birla Institute of Technology and Science, Pilani: Pilani Campus AUGS/ AGSR Division
No ratings yet
Birla Institute of Technology and Science, Pilani: Pilani Campus AUGS/ AGSR Division
4 pages
1Z0 1122 24 Demo
No ratings yet
1Z0 1122 24 Demo
6 pages
Final Report
No ratings yet
Final Report
56 pages
INFOSYS Natural Language Processing
No ratings yet
INFOSYS Natural Language Processing
13 pages
Visualizing and Forecasting Stocks Using Machine Learning
100% (1)
Visualizing and Forecasting Stocks Using Machine Learning
7 pages
Bunde Imoter Complete Work 2024
No ratings yet
Bunde Imoter Complete Work 2024
50 pages
Chandrasekaran, R., & Paramasivan, S. K. (2022) - A State-Of-The-Art Review of Time Series Forecasting Using Deep Learning Approaches.
No ratings yet
Chandrasekaran, R., & Paramasivan, S. K. (2022) - A State-Of-The-Art Review of Time Series Forecasting Using Deep Learning Approaches.
14 pages
Neural Network Language Models Survey
No ratings yet
Neural Network Language Models Survey
7 pages
Stock Market Analysis Using ML & DL
No ratings yet
Stock Market Analysis Using ML & DL
9 pages
Term Paper
No ratings yet
Term Paper
27 pages
ChatGPT Internship Report 2024
No ratings yet
ChatGPT Internship Report 2024
54 pages
Raunaks Resume
No ratings yet
Raunaks Resume
1 page
Builtin Com Artificial Intelligence
100% (1)
Builtin Com Artificial Intelligence
20 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
28 pages
2016-MICCAI-Recognizing End-Diastole and End-Systole
No ratings yet
2016-MICCAI-Recognizing End-Diastole and End-Systole
9 pages
Carbon Futures Price Forecasting Based With ARIMA-CNNLSTM
No ratings yet
Carbon Futures Price Forecasting Based With ARIMA-CNNLSTM
6 pages
Mamba Survey
No ratings yet
Mamba Survey
20 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Stock-Price-Prediction-Using-Machine-Learning Final Project Indu Mam Project Final Project
No ratings yet
Stock-Price-Prediction-Using-Machine-Learning Final Project Indu Mam Project Final Project
38 pages
2023 June ITT478-A
No ratings yet
2023 June ITT478-A
2 pages
CNN vs. LSTM For Turkish Text Classification
No ratings yet
CNN vs. LSTM For Turkish Text Classification
6 pages
Project Proposal 260 Copy
No ratings yet
Project Proposal 260 Copy
38 pages
On The Properties of Neural Machine Translation: Encoder-Decoder Approaches
No ratings yet
On The Properties of Neural Machine Translation: Encoder-Decoder Approaches
9 pages
Introduction To AI-Powered Information Extraction Concepts
No ratings yet
Introduction To AI-Powered Information Extraction Concepts
23 pages
Music Compostion With Magenta
No ratings yet
Music Compostion With Magenta
2 pages
Ai Publishing Python Machine Learning For Beginners Learning
100% (1)
Ai Publishing Python Machine Learning For Beginners Learning
310 pages
Understanding Convolutional Neural Networks With A Mathematical Model
No ratings yet
Understanding Convolutional Neural Networks With A Mathematical Model
21 pages
Diabetes Detection Using Deep Learning Algorithms: ICT Express November 2018
No ratings yet
Diabetes Detection Using Deep Learning Algorithms: ICT Express November 2018
5 pages
Toxic Comment Severity Analysis
No ratings yet
Toxic Comment Severity Analysis
8 pages
Stock Price Prediction Using LSTM RNN and CNN-slid
No ratings yet
Stock Price Prediction Using LSTM RNN and CNN-slid
6 pages
MMMLP: Multi-Modal Multilayer Perceptron For Sequential Recommendations
No ratings yet
MMMLP: Multi-Modal Multilayer Perceptron For Sequential Recommendations
9 pages