Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
verl: Volcano Engine Reinforcement Learning for LLMs
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
Fully open data curation for reasoning models
Official Repo for Open-Reasoner-Zero
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Training Large Language Model to Reason in a Continuous Latent Space
A Python package for causal inference in quasi-experimental settings
Training Sparse Autoencoders on Language Models
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
Pretraining and inference code for a large-scale depth-recurrent language model
Stanford NLP Python library for understanding and improving PyTorch models via interventions
Tool for data extraction and interacting with Lean programmatically.
Verify Precision of all Kimi K2 API Vendor
Official implementation of X-Master, a general-purpose tool-augmented reasoning agent.
[TMLR 2025] Efficient Reasoning Models: A Survey
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
Automated Hypothesis Testing with Agentic Sequential Falsifications
A language agent gym with challenging scientific tasks
TART: A plug-and-play Transformer module for task-agnostic reasoning