Lists (1)
Sort Name ascending (A-Z)
Stars
OpenAI Guardrails Python (Preview)
Pretraining data reconstruction scripts for Apertus
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
Opensource benchmark evaluating web operators/agents performance
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
A library for making RepE control vectors
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
Democratizing Reinforcement Learning for LLMs
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
SkyRL: A Modular Full-stack RL Library for LLMs
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Minimal reproduction of DeepSeek R1-Zero
Scalable RL solution for advanced reasoning of language models
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
A lightweight LMM-based Document Parsing Model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Solve Visual Understanding with Reinforced VLMs
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
Environments for LLM Reinforcement Learning
A playbook for systematically maximizing the performance of deep learning models.
🤗 smolagents: a barebones library for agents that think in code.
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
Inference and training library for high-quality TTS models.
verl: Volcano Engine Reinforcement Learning for LLMs