Stars
Graphs that teach > graphs that impress. Turn any code into an interactive knowledge graph you can explore, search, and ask questions about. Works with Claude Code, Codex, Cursor, Copilot, Gemini C…
Standardize benchmark wrapping so the community can wrap various otherwise-incompatible benchmarks uniformly and use them everywhere.
Drive OSS standards and tools for data curation and evaluation creation for state of the art AI agents
The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.
Connect your Claude Code to SilverStream's advanced tracing platform!
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
🔥 Clone and recreate any website as a modern React app in seconds
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Ebiose is a distributed artificial intelligence factory, an open source project from the Inria’s incubator (French lab).
Zant simplifies the deployment and optimization of neural networks on microprocessors
Simulation platform for general-purpose robotics & embodied AI learning.
The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.
A differentiable physics engine and multibody dynamics library for control and robot learning.
Scripts to recreate the D4RL datasets with Minari
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
A modular, easy to extend GFlowNet library
Benchmarking RL generalization in an interpretable way.
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
Deeplake is AI Data Runtime for Agents. It provides serverless postgres with a multimodal datalake, enabling scalable retrieval and training.
A JAX-based simulator for autonomous driving research.
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities
This repository contains implementations and illustrative code to accompany DeepMind publications
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC