yukavio

KavioYu yukavio

Work for Tencent-WXG. Focus on model inference optimization, such as inference engine and model compression.

20 followers · 2 following

Shanghai

Achievements

Stars

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,146 11,046 Updated Nov 5, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,761 3,273 Updated Nov 5, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,930 286 Updated May 15, 2025

PaddlePaddle / Paddle-Lite

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎）

C++ 7,171 1,629 Updated May 22, 2025

PaddlePaddle / PaddleClas

A treasure chest for visual classification and recognition powered by PaddlePaddle

Python 5,750 1,193 Updated Oct 27, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,427 673 Updated Nov 5, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,745 292 Updated Nov 5, 2025

PaddlePaddle / PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

Python 1,609 353 Updated Oct 27, 2025

ByteDance-Seed / Triton-distributed

Distributed Compiler based on Triton for Parallel Systems

Python 1,213 104 Updated Oct 17, 2025

xckevin / books

some interesting books

259 105 Updated Sep 4, 2025

Tencent / POINTS-Reader

184 7 Updated Sep 16, 2025

PaddlePaddle / CINN

Compiler Infrastructure for Neural Networks

C++ 147 114 Updated Jul 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KavioYu yukavio

Achievements

Achievements

Block or report yukavio

Stars

vllm-project / vllm

sgl-project / sglang

deepseek-ai / open-infra-index

PaddlePaddle / Paddle-Lite

PaddlePaddle / PaddleClas

ai-dynamo / dynamo

fla-org / flash-linear-attention

PaddlePaddle / PaddleSlim

ByteDance-Seed / Triton-distributed

xckevin / books

Tencent / POINTS-Reader

PaddlePaddle / CINN