Skip to content
View fvliang's full-sized avatar

Highlights

  • Pro

Block or report fvliang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimal and annotated implementations of key ideas from modern deep learning research.

Python 1,292 107 Updated Jan 29, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 1,515 121 Updated Apr 3, 2026

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 756 193 Updated Apr 2, 2026

Official Implementation of DART (DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference).

Python 50 1 Updated Feb 8, 2026

LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…

JavaScript 31,410 5,974 Updated Jan 9, 2026