This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.

Python 32 3 Updated Oct 26, 2022

ikostrikov / jaxrl2

Jupyter Notebook 54 17 Updated Jan 20, 2023

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 11,732 1,318 Updated Apr 12, 2026

huggingface / trl

Train transformer language models with reinforcement learning.

Python 18,081 2,648 Updated Apr 17, 2026

Farama-Foundation / Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python 508 63 Updated Jan 10, 2026

google / vizier

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Python 1,639 110 Updated Feb 17, 2026

kayburns / furniture_sim

1 1 Updated Mar 25, 2023

SridharPandian / Holo-Dex

Official Implementation of Holo-Dex: Teaching Dexterity with Immersive Mixed Reality

Python 54 6 Updated Oct 25, 2022

zach-lawless / gym-wordle

Gym environment for playing Wordle with RL agents

Python 42 8 Updated Feb 8, 2022

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,520 32,902 Updated Apr 17, 2026

allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences

Python 2,387 203 Updated Mar 1, 2024

Farama-Foundation / miniwob-plusplus

MiniWoB++: a web interaction benchmark for reinforcement learning

HTML 379 55 Updated Apr 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ilya Kostrikov ikostrikov

Achievements

Achievements

Organizations

Block or report ikostrikov

Stars

openai / evals

google-research / robopianist

mansimov / chatgpt_cli

ikostrikov / rlpd

tysam-code / hlb-CIFAR10

karpathy / nanoGPT

openai / point-e

openai / tiktoken

conglu1997 / v-d4rl

ZohaibAhmed / ChatGPT-Google

google-deepmind / mujoco_mpc

openai / openai-cookbook

peract / peract

microsoft / torchscale

nat / natbot

cohere-ai / sandbox-toy-semantic-search

jax-ml / jax-triton

maxreciprocate / offline

Asap7772 / PTR