Mjlab Cartpole example

This is an example of a simple CartPole environment using mjlab for training. This repository is both a pedagogical example using mjlab and an example of organization using mjlab as an external dependency.

Installing

Simply run:

uv sync

Running the training

To run the training, use:

uv run train Mjlab-Cartpole

Playing the environment

Run:

uv run play Mjlab-Cartpole --checkpoint-file [path-to-checkpoint]

The checkpoint will typically appear in logs/rsl_rl/exp1/[date]/model_499.pt

Organization

The structure is as follow:

src/mjlab_cartpole/
- robot/: Robot model
  - xmls/cartpole.xml: MuJoCo MJCF spec
  - cartpole_constants.py: Describing how to load the robot entity (mostly loading the XML spec here)
- tasks/: Task definition
  - __init__.py: Environments are registered here
  - cartpole_env_cfg.py: Environment configurations (actions, rewards, termination, reset etc.)

Here, mjlab is used as an external dependency.

Environment

Action

Actions is the force $f$ applied to the cart (Newton).

Observation

Agent observation is the pole angle and velocity and the cart position and velocity.

Reward

The goal is for the pole not to fall, while navigating the cart to the center, the reward is as follows:

$$ r = 5 \times cos(\theta) + exp(-\frac{x^2}{\sigma^2}) - 10^{-2} ({\frac{f}{20}})^2 $$

Where:

$\theta$ is the pole angle (0 is upright)
$x$ is the cart position
$\sigma = 0.3$ is a deviation to the center
$f$ is the applied cart force (command)

Termination & truncation

A termination is issued when $|\theta| > 30 \space deg$ and a timeout happens after 10s.

References

This is inspired by the Creating a New Task markdown from mjlab

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
imgs		imgs
src/mjlab_cartpole		src/mjlab_cartpole
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mjlab Cartpole example

Installing

Running the training

Playing the environment

Organization

Environment

Action

Observation

Reward

Termination & truncation

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mjlab Cartpole example

Installing

Running the training

Playing the environment

Organization

Environment

Action

Observation

Reward

Termination & truncation

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages