Skip to content
View anair13's full-sized avatar

Block or report anair13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A list of Offline to Online RL papers (continually updated)

60 Updated Nov 27, 2025

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 747 103 Updated Oct 27, 2025

hand importer and dataset creation tool for poker

Python 1 Updated Jan 28, 2024

The official Python library for the OpenAI API

Python 29,519 4,471 Updated Dec 19, 2025

🦜🔗 The platform for reliable agents.

Python 122,269 20,159 Updated Dec 19, 2025

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,290 162 Updated Aug 3, 2023
Python 15 1 Updated Mar 8, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 92,154 11,546 Updated Dec 15, 2025

Example notebooks for Reverse Engineering the Neural Tangent Kernel

Jupyter Notebook 9 Updated Jun 17, 2022

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,834 1,394 Updated Nov 4, 2024

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

313 17 Updated Nov 21, 2022

🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet

186 10 Updated Dec 16, 2025

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,568 928 Updated Jul 8, 2025

PyTorch dataset extended with map, cache etc. (tensorflow.data like)

Python 332 20 Updated Jun 13, 2022

Understanding Deep Networks via Extremal Perturbations and Smooth Masks

Python 348 33 Updated Jul 22, 2020

CNN for rope state estimation

Python 2 Updated Jan 17, 2020

Fast symbolic computation, code generation, and nonlinear optimization for robotics

C++ 1,554 160 Updated Dec 10, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,282 203 Updated May 19, 2025

Image Visualization Tools (object detection, semantic and instance segmentation)

Python 257 30 Updated Nov 22, 2024

Learning to Fill the Seam by Vision: Sub-millimeter Peg-in-hole on Unseen Shapes in Real World

Python 39 7 Updated Jul 24, 2023

Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data

Python 356 58 Updated Mar 21, 2023

Advantage weighted Actor Critic for Offline RL

Python 51 8 Updated Aug 27, 2022

Tasks to get you started with MineRL

Python 39 7 Updated Jan 6, 2025

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,170 3,036 Updated Aug 15, 2024

From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

278 28 Updated Jun 16, 2024

An offline deep reinforcement learning library

Python 1,609 261 Updated Sep 10, 2025

Collection of reinforcement learning algorithms

Python 2,836 565 Updated Jun 17, 2024

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,986 321 Updated Dec 16, 2025

SAPIEN Embodied AI Platform

C++ 695 63 Updated Dec 18, 2025

A PyTorch implementation of Implicit Q-Learning

Python 93 12 Updated Oct 23, 2021
Next