- Amsterdam
Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A high-throughput and memory-efficient inference and serving engine for LLMs
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
LlamaIndex is the leading framework for building LLM-powered agents over your data.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
⚡ A Fast, Extensible Progress Bar for Python and CLI
A generative world for general-purpose robotics & embodied AI learning.
Fully open reproduction of DeepSeek-R1
SGLang is a fast serving framework for large language models and vision language models.
State-of-the-Art Text Embeddings
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
✨✨Latest Advances on Multimodal Large Language Models
Train transformer language models with reinforcement learning.
verl: Volcano Engine Reinforcement Learning for LLMs
A beautiful, simple, clean, and responsive Jekyll theme for academics
Minimal reproduction of DeepSeek R1-Zero
Open source annotation tool for machine learning practitioners.
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
A Collection of Variational Autoencoders (VAE) in PyTorch.
An open source library for deep learning end-to-end dialog systems and chatbots.
Lean 4 programming language and theorem prover
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Open-source implementation of AlphaEvolve
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
An open-source framework for training large multimodal models.
Environments for LLM Reinforcement Learning
From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection