RL Programming 1

This document outlines the instructions for a programming assignment in Reinforcement Learning due on November 15, 2024. Students are required to work in groups to modify a provided grid-world Python program, complete specific tasks related to MDP algorithms, and submit their work online. The assignment is worth 100 points and constitutes 10% of the final grade.

Uploaded by

nikitha.r-26

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views3 pages

RL Programming 1

Uploaded by

nikitha.r-26

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Reinforcement Learning

Programming Assignment 1
Due: Friday, Nov 15, 2024 (11:59pm)

Instructions
• This assignment is to be done in groups of 3. It is fine to work in groups of 2
or individually, but I prefer groups of 3.

• This assignment is to be submitted online via LMS on or before Nov 15, 2024
(11:59pm). Please submit 2 programs grid-world-1.py and grid-world-2.py and
just one PDF file with your written answer to Question-1, and the policy and
value function images for questions 3 and 4.

• This assignment is for 100 points. It is worth 10% of your final grades.

1
Problems
The objective of this assignment is to give you some practical experience with the
MDP algorithms we learned in class. We will work with the Grid World problem in
Example 3.5 (page 61) from chapter 3 in the textbook. Please download the entire
project from this link: Python code for examples in the textbook. We will take
take the program grid-world.py in the folder chapter03 from this project and make
changes to it.

1. (15 points) Describe in 1-2 sentences what these functions do in grid-world.py.

• f igure 3 2 linear system()
• f igure 3 2()
• f igure 3 5()

2. (25 points) Make a copy of grid-world.py and rename it as grid-world-1.py. Add

a function get epsilon greedy policy(value vector, epsilon) that computes and
returns an ϵ-greedy policy πϵV with respect to value vector V . The function
should take V and ϵ as arguments.

3. (30 points) Add to grid-world-1.py a function policy iteration(epsilon) that

computes and returns an optimal ϵ-greedy policy πϵ∗ using the policy iter-
ation algorithm. The function should take ϵ as argument. Use the func-
tions draw image(image) and draw policy(optimal values) to save the pol-
icy and value function to the Images folder. You need to make a copy of
f igure 3 2 linear system() and change it to take a policy as argument and
evaluate it. Test the function with ϵ values 0.2 and 0.0.

4. (30 points) Make a copy of grid-world-1.py and rename it as grid-world-2.py.

Change the function step(state, action) and all the other functions to make
actions stochastic as defined next. For example, for action left the next-state
should be the square to the left with probability 0.85, the square to the right
with probability 0.05, the square above with probability 0.05, and the square
below with probability 0.05. If the action takes you off the grid, do what the
existing code does to handle that. The other 3 actions are defined similarly.
Test all these functions for the new action definition (please see the next page).

2
• f igure 3 2 linear system()
• f igure 3 2()
• f igure 3 5()
• policy iteration(epsilon) with epsilon values 0.2 and 0

CSCN8020 Assignment1 W24
No ratings yet
CSCN8020 Assignment1 W24
4 pages
CS6700 RL 2024 Wa1
No ratings yet
CS6700 RL 2024 Wa1
7 pages
RL 20241103355 Report
No ratings yet
RL 20241103355 Report
4 pages
RLAI Lab 1 Rahel Benjamin
No ratings yet
RLAI Lab 1 Rahel Benjamin
16 pages
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
No ratings yet
AI 3000 / CS 5500: Reinforcement Learning Assignment 1: Problem 1: Markov Reward Process
5 pages
Lab 2
No ratings yet
Lab 2
3 pages
Program Explanation
No ratings yet
Program Explanation
37 pages
Aston University Machine Learning Portfolio Task 3: Reinforcement Learning
No ratings yet
Aston University Machine Learning Portfolio Task 3: Reinforcement Learning
3 pages
AI Exam Prep for CS Students
No ratings yet
AI Exam Prep for CS Students
4 pages
Reinforcement Learning - Project 3
No ratings yet
Reinforcement Learning - Project 3
9 pages
1 You Will Work With A
No ratings yet
1 You Will Work With A
2 pages
HW 9 - IE 474
No ratings yet
HW 9 - IE 474
5 pages
RL Paper Deepsk
No ratings yet
RL Paper Deepsk
4 pages
Wa 2
No ratings yet
Wa 2
6 pages
15-381 Spring 2007 Final Exam SOLUTIONS
No ratings yet
15-381 Spring 2007 Final Exam SOLUTIONS
18 pages
Exam Prep Exercises034534123124
No ratings yet
Exam Prep Exercises034534123124
20 pages
01 Module 1 Early Reinforcement Learning
No ratings yet
01 Module 1 Early Reinforcement Learning
134 pages
Precept 9
No ratings yet
Precept 9
24 pages
Experiment 3
No ratings yet
Experiment 3
6 pages
DRL - AI309 - A - Assignment - 1 - F24 - GIKI
No ratings yet
DRL - AI309 - A - Assignment - 1 - F24 - GIKI
3 pages
Question Bank RL
No ratings yet
Question Bank RL
4 pages
Assignment 2
No ratings yet
Assignment 2
13 pages
Symbolic AI MDP DP
No ratings yet
Symbolic AI MDP DP
6 pages
CS221 Final Practice Guide
No ratings yet
CS221 Final Practice Guide
68 pages
2 DRL Compre Makeup
No ratings yet
2 DRL Compre Makeup
12 pages
Final
No ratings yet
Final
7 pages
2023-24 First Sem - DRL Mid Sem Regular
No ratings yet
2023-24 First Sem - DRL Mid Sem Regular
2 pages
RLDL
No ratings yet
RLDL
23 pages
CS 188 Fall 2018 Written HW4 Soln
No ratings yet
CS 188 Fall 2018 Written HW4 Soln
6 pages
Graph Game Winning Strategy Analysis
No ratings yet
Graph Game Winning Strategy Analysis
2 pages
HW 2
No ratings yet
HW 2
2 pages
Lab 8 On Markov Decision Process
No ratings yet
Lab 8 On Markov Decision Process
4 pages
Assignment Mathematics For Data Science
No ratings yet
Assignment Mathematics For Data Science
2 pages
Intro to Reinforcement Learning Concepts
No ratings yet
Intro to Reinforcement Learning Concepts
524 pages
L1 Basic Concepts
No ratings yet
L1 Basic Concepts
27 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Assignment 1
No ratings yet
Assignment 1
24 pages
Intro RL Paper GPT
No ratings yet
Intro RL Paper GPT
5 pages
RL 20241103355 Report
No ratings yet
RL 20241103355 Report
4 pages
Learning Expressive and Transferable First-Order Logic Reward Machines
No ratings yet
Learning Expressive and Transferable First-Order Logic Reward Machines
13 pages
cs188 sp16 mt1 Sol
No ratings yet
cs188 sp16 mt1 Sol
23 pages
MDP Solution Methods: Iteration & LP
No ratings yet
MDP Solution Methods: Iteration & LP
34 pages
AI Decision Making & RL Guide
No ratings yet
AI Decision Making & RL Guide
18 pages
Tutorial Questions (Annexure I) Que S-Tion No Questions Co BTL
No ratings yet
Tutorial Questions (Annexure I) Que S-Tion No Questions Co BTL
6 pages
Cs748 s2021 Quizzes Till q4
No ratings yet
Cs748 s2021 Quizzes Till q4
4 pages
Hill Climbing N Queen
No ratings yet
Hill Climbing N Queen
4 pages
Lec17 ReinforcementLearning
No ratings yet
Lec17 ReinforcementLearning
58 pages
Treasure Island MDP Using Value Iteration: Python Code
No ratings yet
Treasure Island MDP Using Value Iteration: Python Code
5 pages
Midterm: CS 188 Spring 2019 Introduction To Artificial Intelligence
No ratings yet
Midterm: CS 188 Spring 2019 Introduction To Artificial Intelligence
23 pages
Lec 08
No ratings yet
Lec 08
59 pages
REINFORCE Algorithm Python Guide
No ratings yet
REINFORCE Algorithm Python Guide
15 pages
Reinforcement Learning 20CAE01
No ratings yet
Reinforcement Learning 20CAE01
2 pages
RL Exam Tutti
No ratings yet
RL Exam Tutti
47 pages
CS 234: Assignment #2: 1 Deep - Networks (DQN) (8 Pts Writeup)
No ratings yet
CS 234: Assignment #2: 1 Deep - Networks (DQN) (8 Pts Writeup)
9 pages
Sp14 Cs188 Lecture 8 - Mdps I
No ratings yet
Sp14 Cs188 Lecture 8 - Mdps I
50 pages
Abu Minhaj Farooqi 37560 Ai Lab Final Exam
No ratings yet
Abu Minhaj Farooqi 37560 Ai Lab Final Exam
14 pages
Offline Imitation Learning From Multiple Baselines With Applications To Compiler Optimization
No ratings yet
Offline Imitation Learning From Multiple Baselines With Applications To Compiler Optimization
10 pages
CS2334 Assignment1 Updated
No ratings yet
CS2334 Assignment1 Updated
3 pages
JWT - Magazine May 2024
100% (2)
JWT - Magazine May 2024
145 pages
PeopleSoft Reporting Optimization
No ratings yet
PeopleSoft Reporting Optimization
92 pages
Visual Basic Programming Assignments
No ratings yet
Visual Basic Programming Assignments
74 pages
Operating Systems BITS
No ratings yet
Operating Systems BITS
8 pages
CCS0007 Midterm Machine ProblemBSVZamora 1
No ratings yet
CCS0007 Midterm Machine ProblemBSVZamora 1
7 pages
DataCaptor Device Interface Help
No ratings yet
DataCaptor Device Interface Help
6 pages
React Sample Programs
No ratings yet
React Sample Programs
12 pages
Social Media Site User Management System Class 12th Informatics Practices Project
100% (1)
Social Media Site User Management System Class 12th Informatics Practices Project
37 pages
Using C To Create Interrupt Driven Systems On Blackfin Processors
No ratings yet
Using C To Create Interrupt Driven Systems On Blackfin Processors
9 pages
Array in C'
No ratings yet
Array in C'
17 pages
Firebird Triggers and Stored Procedures
No ratings yet
Firebird Triggers and Stored Procedures
7 pages
Aspect Oriented Software Engineering: Definition: Aose: Complementing and Improving A Wide Variety of Modern
No ratings yet
Aspect Oriented Software Engineering: Definition: Aose: Complementing and Improving A Wide Variety of Modern
16 pages
PDF
No ratings yet
PDF
6 pages
MySQL SQL Commands & Data Types
No ratings yet
MySQL SQL Commands & Data Types
34 pages
Computer Architecture II Cte 412 Presentation
No ratings yet
Computer Architecture II Cte 412 Presentation
13 pages
Script Quizizz
No ratings yet
Script Quizizz
4 pages
Matlab Activity 1-1
No ratings yet
Matlab Activity 1-1
1 page
Test Stand APIReference Poster
No ratings yet
Test Stand APIReference Poster
1 page
ECTE331
No ratings yet
ECTE331
8 pages
Arunagirinathan.V: Key Skills Profile Summary
No ratings yet
Arunagirinathan.V: Key Skills Profile Summary
6 pages
Exercise 2
No ratings yet
Exercise 2
3 pages
SQL Data Types Overview
100% (1)
SQL Data Types Overview
9 pages
Linear Convolution Program in C Language Using CCStudio
80% (5)
Linear Convolution Program in C Language Using CCStudio
3 pages
Programming Language - Wikipedia
No ratings yet
Programming Language - Wikipedia
21 pages
Lecture 11 16
No ratings yet
Lecture 11 16
82 pages
Lecture-4 (8086 Memory Address Space Partition)
100% (1)
Lecture-4 (8086 Memory Address Space Partition)
23 pages
SAMPLE Internship Report
No ratings yet
SAMPLE Internship Report
19 pages
Systems Development Design, Implementation, Maintenance, and Revie
100% (3)
Systems Development Design, Implementation, Maintenance, and Revie
14 pages
Chpter 4
No ratings yet
Chpter 4
25 pages

RL Programming 1

Uploaded by

RL Programming 1

Uploaded by

Reinforcement Learning

1. (15 points) Describe in 1-2 sentences what these functions do in grid-world.py.

2. (25 points) Make a copy of grid-world.py and rename it as grid-world-1.py. Add

3. (30 points) Add to grid-world-1.py a function policy iteration(epsilon) that

4. (30 points) Make a copy of grid-world-1.py and rename it as grid-world-2.py.

You might also like