anair13

Follow

Ashvin Nair anair13

Follow

Graduate student at UC Berkeley

90 followers · 6 following

Berkeley, CA
ashvin.me

Achievements

Achievements

Stars

linhlpv / awesome-offline-to-online-RL-papers

A list of Offline to Online RL papers (continually updated)

67 Updated Nov 27, 2025

rail-berkeley / serl

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Python 771 106 Updated Oct 27, 2025

Morgan-Griffiths / poker_hand_importer

hand importer and dataset creation tool for poker

Python 1 Updated Jan 28, 2024

openai / openai-python

The official Python library for the OpenAI API

Python 29,855 4,538 Updated Feb 6, 2026

langchain-ai / langchain

🦜🔗 The platform for reliable agents.

Python 126,111 20,740 Updated Feb 6, 2026

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,320 165 Updated Aug 3, 2023

patrickhaoy / ptp

Python 15 1 Updated Mar 8, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 94,276 11,725 Updated Dec 15, 2025

james-simon / reverse-engineering

Example notebooks for Reverse Engineering the Neural Tangent Kernel

Jupyter Notebook 9 Updated Jun 17, 2022

google / dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,846 1,390 Updated Nov 4, 2024

craffel / llm-seminar

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

313 17 Updated Nov 21, 2022

cleanlab / label-errors

🛠️ Corrected Test Sets for ImageNet, MNIST, CIFAR, Caltech-256, QuickDraw, IMDB, Amazon Reviews, 20News, and AudioSet

186 11 Updated Dec 16, 2025

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,051 981 Updated Jul 8, 2025

szymonmaszke / torchdatasets

PyTorch dataset extended with map, cache etc. (tensorflow.data like)

Python 331 20 Updated Jun 13, 2022

facebookresearch / TorchRay

Understanding Deep Networks via Extremal Perturbations and Smooth Masks

Python 348 33 Updated Jul 22, 2020

myyan92 / TF_cloth2d

CNN for rope state estimation

Python 2 Updated Jan 17, 2020

symforce-org / symforce

Fast symbolic computation, code generation, and nonlinear optimization for robotics

C++ 1,580 164 Updated Jan 27, 2026

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,346 208 Updated May 19, 2025

wkentaro / imgviz

Rich Image Visualization with Minimum Dependency (no OpenCV, Matplotlib)

Python 261 29 Updated Jan 28, 2026

xieliang555 / SFN

Learning to Fill the Seam by Vision: Sub-millimeter Peg-in-hole on Unseen Shapes in Real World

Python 40 7 Updated Jul 24, 2023

facebookresearch / r3m

Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data

Python 364 59 Updated Mar 21, 2023

hari-sikchi / AWAC

Advantage weighted Actor Critic for Offline RL

Python 52 8 Updated Aug 27, 2022

minerllabs / getting-started-tasks

Tasks to get you started with MineRL

Python 39 7 Updated Jan 6, 2025

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,474 3,094 Updated Aug 15, 2024

montrealrobotics / DeepRLInTheWorld

From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..

281 29 Updated Jun 16, 2024

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,639 263 Updated Sep 10, 2025

rail-berkeley / rlkit

Collection of reinforcement learning algorithms

Python 2,858 568 Updated Jun 17, 2024

google / brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 3,048 326 Updated Jan 29, 2026

haosulab / SAPIEN

SAPIEN Embodied AI Platform

C++ 715 64 Updated Feb 2, 2026

gwthomas / IQL-PyTorch

A PyTorch implementation of Implicit Q-Learning

Python 94 12 Updated Oct 23, 2021