Skip to content
View ikostrikov's full-sized avatar

Organizations

@VisualComputingInstitute

Block or report ikostrikov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,223 2,929 Updated Apr 14, 2026

[CoRL '23] Dexterous piano playing with deep reinforcement learning.

Python 716 58 Updated Nov 2, 2024

Lightweight wrapper of the official ChatGPT API in your terminal

Shell 42 2 Updated Mar 10, 2023
Python 393 42 Updated Feb 13, 2023

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Python 1,300 80 Updated Dec 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,789 9,717 Updated Nov 12, 2025

Point cloud diffusion for 3D model synthesis

Python 6,877 800 Updated Jul 4, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,937 1,433 Updated Mar 27, 2026

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Python 112 9 Updated Apr 16, 2026

Chrome Extension that Integrates ChatGPT (Unofficial) into Google Search

JavaScript 519 53 Updated Dec 3, 2022

Real-time behaviour synthesis with MuJoCo, using Predictive Control

C++ 1,603 257 Updated Mar 20, 2026

Examples and guides for using the OpenAI API

Jupyter Notebook 72,798 12,275 Updated Apr 17, 2026

Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation

Python 487 69 Updated May 9, 2024

Foundation Architecture for (M)LLMs

Python 3,137 224 Updated Apr 11, 2024

Drive a browser with GPT-3

Python 1,933 272 Updated Jun 9, 2024

A demonstration of how a toy (but usable!) semantic search engine can be quickly built using Cohere's platform.

Python 117 6 Updated Jul 25, 2023

jax-triton contains integrations between JAX and OpenAI Triton

Python 444 57 Updated Apr 15, 2026

Offline RL experiments

Python 15 Updated Oct 1, 2022

This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.

Python 32 3 Updated Oct 26, 2022
Jupyter Notebook 54 17 Updated Jan 20, 2023

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 11,732 1,318 Updated Apr 12, 2026

Train transformer language models with reinforcement learning.

Python 18,081 2,648 Updated Apr 17, 2026

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python 508 63 Updated Jan 10, 2026

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Python 1,639 110 Updated Feb 17, 2026

Official Implementation of Holo-Dex: Teaching Dexterity with Immersive Mixed Reality

Python 54 6 Updated Oct 25, 2022

Gym environment for playing Wordle with RL agents

Python 42 8 Updated Feb 8, 2022

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,520 32,902 Updated Apr 17, 2026

A modular RL library to fine-tune language models to human preferences

Python 2,387 203 Updated Mar 1, 2024

MiniWoB++: a web interaction benchmark for reinforcement learning

HTML 379 55 Updated Apr 6, 2026
Next