Skip to content
View anair13's full-sized avatar

Block or report anair13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A list of Offline to Online RL papers (continually updated)

78 Updated Mar 7, 2026

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 795 107 Updated Oct 27, 2025

hand importer and dataset creation tool for poker

Python 1 Updated Jan 28, 2024

The official Python library for the OpenAI API

Python 30,349 4,653 Updated Mar 23, 2026

The agent engineering platform

Python 130,763 21,540 Updated Mar 23, 2026

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,334 164 Updated Aug 3, 2023
Python 15 1 Updated Mar 8, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 96,493 11,918 Updated Dec 15, 2025

Example notebooks for Reverse Engineering the Neural Tangent Kernel

Jupyter Notebook 9 Updated Jun 17, 2022

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,856 1,391 Updated Nov 4, 2024

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

314 17 Updated Nov 21, 2022

🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet

187 11 Updated Dec 16, 2025

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,386 1,028 Updated Jul 8, 2025

PyTorch dataset extended with map, cache etc. (tensorflow.data like)

Python 331 20 Updated Jun 13, 2022

Understanding Deep Networks via Extremal Perturbations and Smooth Masks

Python 349 33 Updated Jul 22, 2020

CNN for rope state estimation

Python 2 Updated Jan 17, 2020

Fast symbolic computation, code generation, and nonlinear optimization for robotics

C++ 1,589 166 Updated Feb 12, 2026

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,389 213 Updated May 19, 2025

Rich Image Visualization with Minimum Dependency (no OpenCV, Matplotlib)

Python 262 29 Updated Jan 28, 2026

Learning to Fill the Seam by Vision: Sub-millimeter Peg-in-hole on Unseen Shapes in Real World

Python 41 7 Updated Jul 24, 2023

Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data

Python 367 59 Updated Mar 21, 2023

Advantage weighted Actor Critic for Offline RL

Python 53 8 Updated Aug 27, 2022

Tasks to get you started with MineRL

Python 40 7 Updated Jan 6, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,965 3,168 Updated Aug 15, 2024

From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

282 29 Updated Jun 16, 2024

An offline deep reinforcement learning library

Python 1,647 264 Updated Sep 10, 2025

Collection of reinforcement learning algorithms

Python 2,886 570 Updated Jun 17, 2024

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 3,102 333 Updated Mar 16, 2026

SAPIEN Embodied AI Platform

C++ 739 69 Updated Mar 10, 2026

A PyTorch implementation of Implicit Q-Learning

Python 97 13 Updated Oct 23, 2021
Next