Skip to content
View anair13's full-sized avatar

Block or report anair13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A list of Offline to Online RL papers (continually updated)

67 Updated Nov 27, 2025

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 771 106 Updated Oct 27, 2025

hand importer and dataset creation tool for poker

Python 1 Updated Jan 28, 2024

The official Python library for the OpenAI API

Python 29,855 4,538 Updated Feb 6, 2026

🦜🔗 The platform for reliable agents.

Python 126,111 20,740 Updated Feb 6, 2026

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,320 165 Updated Aug 3, 2023
Python 15 1 Updated Mar 8, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,276 11,725 Updated Dec 15, 2025

Example notebooks for Reverse Engineering the Neural Tangent Kernel

Jupyter Notebook 9 Updated Jun 17, 2022

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,846 1,390 Updated Nov 4, 2024

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

313 17 Updated Nov 21, 2022

🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet

186 11 Updated Dec 16, 2025

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,051 981 Updated Jul 8, 2025

PyTorch dataset extended with map, cache etc. (tensorflow.data like)

Python 331 20 Updated Jun 13, 2022

Understanding Deep Networks via Extremal Perturbations and Smooth Masks

Python 348 33 Updated Jul 22, 2020

CNN for rope state estimation

Python 2 Updated Jan 17, 2020

Fast symbolic computation, code generation, and nonlinear optimization for robotics

C++ 1,580 164 Updated Jan 27, 2026

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,346 208 Updated May 19, 2025

Rich Image Visualization with Minimum Dependency (no OpenCV, Matplotlib)

Python 261 29 Updated Jan 28, 2026

Learning to Fill the Seam by Vision: Sub-millimeter Peg-in-hole on Unseen Shapes in Real World

Python 40 7 Updated Jul 24, 2023

Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data

Python 364 59 Updated Mar 21, 2023

Advantage weighted Actor Critic for Offline RL

Python 52 8 Updated Aug 27, 2022

Tasks to get you started with MineRL

Python 39 7 Updated Jan 6, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,474 3,094 Updated Aug 15, 2024

From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

281 29 Updated Jun 16, 2024

An offline deep reinforcement learning library

Python 1,639 263 Updated Sep 10, 2025

Collection of reinforcement learning algorithms

Python 2,858 568 Updated Jun 17, 2024

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 3,048 326 Updated Jan 29, 2026

SAPIEN Embodied AI Platform

C++ 715 64 Updated Feb 2, 2026

A PyTorch implementation of Implicit Q-Learning

Python 94 12 Updated Oct 23, 2021
Next