rl-notes Reward shaping demo for 1.041/1.200. Dependencies Python 3.11 conda install notebook Credits: Original code courtesy Introduction to Reinforcement Learning by Tim Miller, The University of Queensland.