Nanjing University/cs phd
-
Nanjing University
- Nanjing
-
18:27
(UTC -12:00) - fvliang.github.io
- https://www.nju.edu.cn/en/
Highlights
- Pro
Stars
Minimal and annotated implementations of key ideas from modern deep learning research.
分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
Official Implementation of DART (DART: Diffusion-Inspired Speculative Decoding for Fast LLM Inference).
LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…