100% found this document useful (2 votes)

372 views3 pages

PA12

This document contains practice questions for a reinforcement learning assignment. It includes 5 multiple choice questions about partially observable Markov decision processes and solving them. The questions cover topics like recovering optimal policies from Q-learning, calculating probabilities in grid worlds with noisy observations, and the value of including actions in histories for partially observable systems.

Uploaded by

udayraj singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (2 votes)

372 views3 pages

PA12

Uploaded by

udayraj singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Practice Assignment 12

Reinforcement Learning
Prof. B. Ravindran
Instructions: In the following questions, one or more choices may be correct. Select all that apply.

1. Suppose that we solve a POMDP using a Q-MDP like solution discussed in the lectures - where
we assume that the MDP is known and solve it to learn Q values for the true (state, action)
pairs. Which of the following are true?
(a) We can recover a policy for execution in the partially observable environment
P by weighting
Q values by the belief distribution bel so that π(s) = argmaxa s bel(s)Q(s, a).
(b) We can recover an optimal policy for the POMDP from the Q values that have been
learnt for the true (state, action) pairs.
(c) Policies recovered from Q-MDP like solution methods are always better than policies
learnt by history based methods.
(d) None of the above
Sol. (a)
(a) is true. This strategy can be used to recover a policy for execution.
(b) is false. When learning the Q values, we assumed that state was available. However this is
not true for the partially observable environment, so it will typically not be possible to recover
a policy that is optimal for the POMDP from the learnt Q values.
(c) is false.
2. Consider the below grid-world:

In the figure above, black squares are blocked. Assume the agent can see one step in the 4
cardinal directions. Assume that the agent’s observations are always correct and that there is
no prior information given regarding the states.
Assertion: If the observation is that there are no obstruction to the East or West, but are
present to the North and South, the belief that the agent is in the green shaded square is 0.5.
Reason: Only the green and blue shaded squares have obstructions to the North and South,
but not to the East or West.

1
(a) Assertion and Reason are both true and Reason is a correct explanation of Assertion.
(b) Assertion and Reason are both true and Reason is not a correct explanation of Assertion.
(c) Assertion is true but Reason is false.
(d) Assertion and Reason are both false.
Sol. (d)
The square 3 units to the north of the green square also has the same property, invalidating
the assertion and the reason.
3. Consider the grid world shown below. Walls and obstacles are colored gray. An agent is
dropped into one of the unoccupied cells of the environment uniformly at random. The agent
is equipped with a sensor that can detect the presence of walls or obstacles immediately to its
North, South, East or West. However the sensor is noisy, and an observation made in each
direction may be wrong with a probability of 0.1. Given that the agent senses no obstacles in
any direction, what is the probability that it was dropped into the cell marked ‘x’ ?

(a) 1/5
(b) 82/91
(c) 164/173
(d) None of the above.

4. In the same environment as Question 3, what is the probability that the agent was not dropped
onto the cell marked ‘x’, if the observation made is that there are obstacles present only to the
North and to the South?

(a) 4/5
(b) 82/91
(c) 164/173
(d) None of the above.

2
Sol. (c)
Applying Bayes Rule,
P (A|B) = P (B|A)PP(A)+P
(B|A)P (A)
(B|¬A)P (¬A) with A being the event that the agent was “not dropped
into the cell marked x” and B being “Obstacles observed only to the North and to the South.”
[0.4×0.1×0.93 ]+[0.4×0.13 ×0.9] 164
P (A|B) = [0.4×0.1×0.93 ]+[0.4×0.13 ×0.9]+(0.92 ×0.12 )0.2 = 173

5. Asserion: In partially observable systems, histories that include both the sequence of ob-
servations and the sequence of actions are typically able to disambiguate the true state of an
agent better than histories that include only the sequence of observations.
Reason: Different sequences of actions can lead to different interpretations of the sequence
of sensor observations.
(a) Both Assertion and Reason are true, and Reason is a correct explanation of the Assertion.
(b) Both Assertion and Reason are true, but Reason is not a correct explanation of the
Assertion.
(c) Assertion is true, Reason is false
(d) Both Assertion and Reason are false
Sol. (a)
Both Assertion and Reason are true, and Reason is correct explanation for Assertion. Refer
to the lecture on Solving POMDPs.

Reinforcement Learning - Week 12
No ratings yet
Reinforcement Learning - Week 12
3 pages
Reinforcement Learning - Unit 13 - Week 10
No ratings yet
Reinforcement Learning - Unit 13 - Week 10
3 pages
Assignment 4: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 4: Reinforcement Learning Prof. B. Ravindran
4 pages
Assignment 6 (Sol.) : Reinforcement Learning
No ratings yet
Assignment 6 (Sol.) : Reinforcement Learning
4 pages
Practice Assignment 5: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Practice Assignment 5: Reinforcement Learning Prof. B. Ravindran
2 pages
Non-Graded: Assignment 1: (Https://swayam - Gov.in)
No ratings yet
Non-Graded: Assignment 1: (Https://swayam - Gov.in)
37 pages
Introduction To Machine Learning Assignment-Week 4
No ratings yet
Introduction To Machine Learning Assignment-Week 4
5 pages
Assignment 8: Reinforcement Learning Prof. B. Ravindran
100% (2)
Assignment 8: Reinforcement Learning Prof. B. Ravindran
4 pages
Assignment 7 (Sol.) : Reinforcement Learning
0% (1)
Assignment 7 (Sol.) : Reinforcement Learning
3 pages
Assignment 9: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 9: Reinforcement Learning Prof. B. Ravindran
3 pages
Practice Assignment 6: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Practice Assignment 6: Reinforcement Learning Prof. B. Ravindran
24 pages
Assignment 3: Reinforcement Learning Prof. B. Ravindran
100% (1)
Assignment 3: Reinforcement Learning Prof. B. Ravindran
4 pages
Reinforcement Learning - Unit 14 - Week 11
No ratings yet
Reinforcement Learning - Unit 14 - Week 11
3 pages
Week3 Assignment
No ratings yet
Week3 Assignment
6 pages
Reinforcement Learning Assignment Solutions
100% (1)
Reinforcement Learning Assignment Solutions
4 pages
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
100% (2)
Assignment 11: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Assignment Week 5
100% (1)
Assignment Week 5
5 pages
Machine Learning Quiz for Students
No ratings yet
Machine Learning Quiz for Students
5 pages
Assignment 1
No ratings yet
Assignment 1
7 pages
Reinforcement Learning - Unit 6 - Week 4
0% (1)
Reinforcement Learning - Unit 6 - Week 4
3 pages
Assignment 2
No ratings yet
Assignment 2
7 pages
Reinforcement Learning Quiz
No ratings yet
Reinforcement Learning Quiz
2 pages
Machine Learning Quiz for Students
No ratings yet
Machine Learning Quiz for Students
45 pages
Introduction To Machine Learning - IITKGP - Unit 4 - Week 2
No ratings yet
Introduction To Machine Learning - IITKGP - Unit 4 - Week 2
5 pages
Machine Learning MCQ Assignment
No ratings yet
Machine Learning MCQ Assignment
56 pages
Assignment 7
100% (1)
Assignment 7
3 pages
Introduction To Machine Learning - Unit 4 - Week 2
100% (1)
Introduction To Machine Learning - Unit 4 - Week 2
3 pages
NPTEL Introduction To Machine Learning Assignment 10 Answers
100% (1)
NPTEL Introduction To Machine Learning Assignment 10 Answers
7 pages
Assignment 11: Reinforcement Learning Prof. B. Ravindran
No ratings yet
Assignment 11: Reinforcement Learning Prof. B. Ravindran
4 pages
DEEP LEARNING IIT Kharagpur Assignment - 5 - 2024
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment - 5 - 2024
9 pages
Deep Learning - IIT Ropar - Unit 6 - Week 3
No ratings yet
Deep Learning - IIT Ropar - Unit 6 - Week 3
4 pages
Understanding Machine Learning Solution Manual: 2 Gentle Start
No ratings yet
Understanding Machine Learning Solution Manual: 2 Gentle Start
67 pages
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
0% (1)
Assignment 4: Introduction To Machine Learning Prof. B. Ravindran
2 pages
ML Assignment 2 2019 Nptel
No ratings yet
ML Assignment 2 2019 Nptel
34 pages
IML-IITKGP - Assignment 5 Solution
No ratings yet
IML-IITKGP - Assignment 5 Solution
7 pages
Assignment 1: Reinforcement Learning Prof. B. Ravindran
100% (2)
Assignment 1: Reinforcement Learning Prof. B. Ravindran
4 pages
Machine 2020 Jul-Dec
No ratings yet
Machine 2020 Jul-Dec
45 pages
Week 5
100% (1)
Week 5
3 pages
Week 12
No ratings yet
Week 12
59 pages
Machine Learning Quiz Solutions
No ratings yet
Machine Learning Quiz Solutions
3 pages
2023 ML Assignment
No ratings yet
2023 ML Assignment
57 pages
Deep Learning - IIT Ropar - Unit 4 - Week 1
No ratings yet
Deep Learning - IIT Ropar - Unit 4 - Week 1
8 pages
Introduction To Machine Learning - Unit 3 - Week 1 - Non - Graded
100% (1)
Introduction To Machine Learning - Unit 3 - Week 1 - Non - Graded
3 pages
DEEP LEARNING IIT Kharagpur Assignment - 2 - 2024 - Updated
No ratings yet
DEEP LEARNING IIT Kharagpur Assignment - 2 - 2024 - Updated
6 pages
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
100% (1)
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
4 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Machine Learning Assignment Solutions
No ratings yet
Machine Learning Assignment Solutions
46 pages
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 6 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
10 pages
Machine 2021 Jan-Apr Practice
No ratings yet
Machine 2021 Jan-Apr Practice
26 pages
Assignment 6 2024
No ratings yet
Assignment 6 2024
11 pages
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 2: Introduction To Machine Learning Prof. B. Ravindran
3 pages
Week 5: Logistic Regression & SVM Quiz
100% (1)
Week 5: Logistic Regression & SVM Quiz
4 pages
Assignment 5
No ratings yet
Assignment 5
3 pages
2022 ML Assignments
100% (1)
2022 ML Assignments
45 pages
Assignment 10 2024
No ratings yet
Assignment 10 2024
5 pages
Assignment 6 (COPY)
No ratings yet
Assignment 6 (COPY)
6 pages
Machine Learning Week 1 Quiz
No ratings yet
Machine Learning Week 1 Quiz
3 pages
Sample Questions Answers
No ratings yet
Sample Questions Answers
8 pages
CS2351 AI Question Paper May June 2014
No ratings yet
CS2351 AI Question Paper May June 2014
4 pages
Pai 4
No ratings yet
Pai 4
37 pages

PA12

Uploaded by

PA12

Uploaded by

Practice Assignment 12

You might also like