Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,398 1,303 Updated Mar 28, 2026

datawhalechina / happy-llm

📚 从零开始构建大模型

Jupyter Notebook 28,034 2,588 Updated Mar 16, 2026

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 29,330 2,885 Updated Mar 27, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 12,473 1,798 Updated Nov 3, 2025

yakhyo / uniface

UniFace: A Unified Face Analysis Library in Python built on ONNX Runtime | Actively being maintained by @yakhyo

Python 641 87 Updated Mar 27, 2026

yakhyo / gaze-estimation

MobileGaze: Real-Time Gaze Estimation models using ResNet 18/34/50, MobileNet v2 and MobileOne s0-s4 | In PyTorch >> ONNX Runtime Inference

Python 180 37 Updated Feb 14, 2026

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,381 13,641 Updated Mar 26, 2026

zhangtianhong-1998 / LLM_infra_from_scratch

这是一个基于C++实现的从零开始的大模型推理框架

C++ 10 1 Updated Nov 18, 2024

hahnyuan / LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 630 83 Updated Sep 11, 2024

ZhangGe6 / onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

JavaScript 1,618 198 Updated Nov 19, 2025

xlite-dev / ffpa-attn

🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.

Cuda 254 14 Updated Feb 13, 2026

gabime / spdlog

Fast C++ logging library.

C++ 28,557 5,096 Updated Mar 14, 2026

NVIDIA / cudnn-frontend

cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it

Python 699 146 Updated Mar 27, 2026

YuxueYang1204 / CudaDemo

Implement custom operators in PyTorch with cuda/c++

Python 77 11 Updated Jan 1, 2023

mlc-ai / xgrammar

Fast, Flexible and Portable Structured Generation

C++ 1,602 134 Updated Mar 27, 2026

vllm-project / vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

Python 1,843 986 Updated Mar 28, 2026

herumi / fmath

fast log and exp functions for AVX2/AVX-512

Python 243 37 Updated Mar 12, 2025

zjhellofss / KuiperLLama

校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

C++ 522 134 Updated Oct 28, 2025

huggingface / course

The Hugging Face course on Transformers

MDX 3,803 1,289 Updated Mar 17, 2026

huggingface / optimum-onnx

🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime

Python 128 40 Updated Mar 12, 2026

huggingface / optimum

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 3,341 628 Updated Mar 13, 2026

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 24,393 2,854 Updated Oct 10, 2025

ModelTC / LightLLM

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 3,978 313 Updated Mar 28, 2026

abseil / abseil-cpp

Abseil Common Libraries (C++)

C++ 17,147 2,986 Updated Mar 26, 2026

jarro2783 / cxxopts

Lightweight C++ command line option parser

C++ 4,722 640 Updated Mar 28, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 6,290 843 Updated Mar 22, 2026

2017ZYS

Lists (32)

AI框架与一些工具项目

c++项目

CNN网络

CUDA相关项目

NLP项目

OCR相关项目

opencv相关项目

pdf解析器

tensorrt项目

transformer相关项目

关键点检测项目

动手学深度学习

图像分割项目

图像分类相关项目

图像检索

多模态

对比学习

数据结构与算法刷题

数据集

文本数据合成

文本检测项目

文本识别项目

文档矫正

杂项

深度学习论文精度

目标检测相关项目

直线检测相关仓库

自编码模型

表格识别项目

视频分类

视频切割

骨干网络

Stars