tanjunchen

✨

tanjunchen tanjunchen

✨

@kubernetes @istio https://tanjunchen.github.io

208 followers · 335 following

Achievements

Organizations

Starred repositories

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,001 578 Updated Mar 13, 2026

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 24,016 2,763 Updated Mar 12, 2026

volcano-sh / kthena

Kubernetes-native AI serving platform for scalable model serving.

Go 305 83 Updated Apr 16, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,564 1,034 Updated Apr 16, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 12,941 1,941 Updated Apr 13, 2026

vllm-project / vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

Python 1,935 1,088 Updated Apr 16, 2026

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 3,099 484 Updated Apr 16, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,943 15,696 Updated Apr 16, 2026

baidu / vLLM-Kunlun

vLLM Kunlun (vllm-kunlun) is a community-maintained hardware plugin designed to seamlessly run vLLM on the Kunlun XPU.

Python 397 67 Updated Apr 14, 2026

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 13,891 1,371 Updated Apr 30, 2025

inclusionAI / AReaL

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,049 466 Updated Apr 16, 2026

vllm-project / semantic-router

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

Go 3,720 620 Updated Apr 16, 2026

huangyf2013320506 / bilibili_repository

放b站视频的一些文档和代码 @堂吉诃德拉曼查的英豪

613 74 Updated Aug 16, 2025

ForceInjection / AI-fundermentals

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识

HTML 1,070 167 Updated Apr 16, 2026

amd / Quark

Python 127 19 Updated Feb 19, 2026

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,974 286 Updated May 15, 2025

kubernetes-sigs / lws

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 697 144 Updated Apr 15, 2026

sihyeong / Awesome-LLM-Inference-Engine

186 14 Updated Apr 13, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,918 5,401 Updated Apr 16, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 70,190 8,592 Updated Apr 12, 2026