Skip to content
View ikostrikov's full-sized avatar

Organizations

@VisualComputingInstitute

Block or report ikostrikov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,183 2,919 Updated Apr 6, 2026

[CoRL '23] Dexterous piano playing with deep reinforcement learning.

Python 716 58 Updated Nov 2, 2024

Lightweight wrapper of the official ChatGPT API in your terminal

Shell 42 2 Updated Mar 10, 2023
Python 392 41 Updated Feb 13, 2023

Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)

Python 1,300 80 Updated Dec 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,555 9,664 Updated Nov 12, 2025

Point cloud diffusion for 3D model synthesis

Python 6,875 799 Updated Jul 4, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 17,876 1,431 Updated Mar 27, 2026

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Python 112 9 Updated May 27, 2024

Chrome Extension that Integrates ChatGPT (Unofficial) into Google Search

JavaScript 519 53 Updated Dec 3, 2022

Real-time behaviour synthesis with MuJoCo, using Predictive Control

C++ 1,599 257 Updated Mar 20, 2026

Examples and guides for using the OpenAI API

Jupyter Notebook 72,695 12,268 Updated Apr 11, 2026

Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation

Python 486 69 Updated May 9, 2024

Foundation Architecture for (M)LLMs

Python 3,136 225 Updated Apr 11, 2024

Drive a browser with GPT-3

Python 1,934 273 Updated Jun 9, 2024

A demonstration of how a toy (but usable!) semantic search engine can be quickly built using Cohere's platform.

Python 117 6 Updated Jul 25, 2023

jax-triton contains integrations between JAX and OpenAI Triton

Python 444 57 Updated Mar 26, 2026

Offline RL experiments

Python 15 Updated Oct 1, 2022

This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.

Python 32 3 Updated Oct 26, 2022
Jupyter Notebook 54 17 Updated Jan 20, 2023

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 11,694 1,317 Updated Apr 12, 2026

Train transformer language models with reinforcement learning.

Python 18,011 2,636 Updated Apr 12, 2026

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Python 507 63 Updated Jan 10, 2026

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Python 1,639 111 Updated Feb 17, 2026

Official Implementation of Holo-Dex: Teaching Dexterity with Immersive Mixed Reality

Python 54 6 Updated Oct 25, 2022

Gym environment for playing Wordle with RL agents

Python 42 8 Updated Feb 8, 2022

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,248 32,845 Updated Apr 11, 2026

A modular RL library to fine-tune language models to human preferences

Python 2,384 203 Updated Mar 1, 2024

MiniWoB++: a web interaction benchmark for reinforcement learning

HTML 377 55 Updated Apr 6, 2026
Next