Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,235 899 Updated Oct 9, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 18,703 3,096 Updated Oct 9, 2025

musistudio / claude-code-router

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 19,161 1,457 Updated Oct 9, 2025

philc / vimium

The hacker's browser.

JavaScript 25,472 2,536 Updated Oct 8, 2025

LLM-Red-Team / kimi-cc

Use Kimi latest model(kimi-k2-0711-preview) to drive your Claude Code.

Shell 1,623 117 Updated Jul 15, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,114 2,515 Updated Oct 9, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 15,783 2,226 Updated Oct 9, 2025

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 7,912 786 Updated Sep 19, 2025

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 21,449 1,835 Updated Oct 6, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,555 1,474 Updated Sep 25, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 3,869 534 Updated Oct 9, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,343 4,575 Updated Oct 9, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 10,301 2,771 Updated Oct 9, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,062 250 Updated Oct 9, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 12,208 1,170 Updated Jul 29, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 27,173 3,225 Updated Apr 30, 2025

harvard-edge / cs249r_book

Introduction to Machine Learning Systems

Python 2,520 285 Updated Oct 9, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 59,702 10,580 Updated Oct 9, 2025

mit-han-lab / TinyChatEngine

TinyChatEngine: On-Device LLM Inference Library

C++ 897 93 Updated Jul 4, 2024

apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 12,706 3,668 Updated Oct 8, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,166 2,289 Updated Oct 9, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 19,837 2,046 Updated Oct 8, 2025

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 7,696 895 Updated Oct 9, 2025

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 3,658 436 Updated Aug 5, 2025

Joonchen Liau kaln27

Highlights

Lists (9)

AI

algorithm

awesome

C/CPP

LLMs

LLMs Deploy

Utils

vim plugins

Web

Stars