`tinyrl`

A minimal reinforcement learning toolkit built from scratch. Provides simple RL environments and training utilities for learning and experimenting with RL algorithms.

See the docs here to get started.

Built with the help of claude code and a little bit of "taste"

Install

pip install git+https://github.com/rosikand/tinyrl.git

Usage

from tinyrl import GridWorld, Runner, RandomPolicy

env = GridWorld()
runner = Runner(env)
policy = RandomPolicy(n_actions=env.n_actions)

# run an episode
result = runner.run_episode(policy)
print(result.reward, result.steps)

# with trajectory
result = runner.run_episode(policy, return_trajectory=True)
print(result.trajectory.obs.shape)

# plot training stats
runner.plot()

Package structure

tinyrl/
  core/          # Environment, Policy, Runner, TrainingMonitor, types
  envs/          # GridWorld, ...
  algorithms/    # RandomPolicy, ...

Environments

GridWorld — 5x5 grid, agent navigates from (0,0) to goal at (4,4). Actions: up, right, down, left. Reward: -1 per step, +10 at goal.

Adding a new environment

Subclass Environment and implement reset, step, _get_obs, and render:

from tinyrl import Environment

class MyEnv(Environment):
    def __init__(self):
        self.state_dim = ...
        self.n_actions = ...   # for discrete
        self.action_dim = ...  # for continuous
        self.max_steps = ...

    def reset(self): ...
    def step(self, action): ...
    def _get_obs(self): ...
    def render(self, action=None, step_num=0): ...

Adding a new policy

Subclass Policy and implement __call__:

from tinyrl import Policy, PolicyOutput

class MyPolicy(Policy):
    def __call__(self, obs):
        action = ...
        return PolicyOutput(action=action, logprob=-0.5, entropy=1.2)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
docs		docs
examples		examples
src/tinyrl		src/tinyrl
.gitignore		.gitignore
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`tinyrl`

Install

Usage

Package structure

Environments

Adding a new environment

Adding a new policy

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tinyrl

Install

Usage

Package structure

Environments

Adding a new environment

Adding a new policy

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`tinyrl`

Packages