Stars
Python bindings to the Zstandard (zstd) compression library
A feature-rich command-line audio/video downloader
Virtual whiteboard for sketching hand-drawn like diagrams
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
text window manager, shell multiplexer, integrated DevOps environment
CLI tool which enables you to login and retrieve AWS temporary credentials using a SAML IDP
Chrome extension to disable youtube video titles autotranslation
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
Web extension to set a default speed for video and audio
The official implementation of "Horizon Reduction Makes RL Scalable"
OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
MR.Q is a general-purpose model-free reinforcement learning algorithm.
Minimal reproduction of DeepSeek R1-Zero
Really Fast End-to-End Jax RL Implementations
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Python tool for converting files and office documents to Markdown.
Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.
Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)
The matrix cookbook, proved in the Lean theorem prover