AnnaYue

Follow

AnnaYue

Follow

5 followers · 24 following

Ant Group
shanghai

Achievements

Achievements

Stars

59 results for source starred repositories

temporalio / temporal

Temporal service

Go 16,460 1,166 Updated Nov 5, 2025

Project-HAMi / HAMi-core

HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container

C 249 112 Updated Nov 6, 2025

riverqueue / river

Fast and reliable background jobs in Go

Go 4,562 126 Updated Oct 27, 2025

sgl-project / rbg

A workload for deploying LLM inference services on Kubernetes

Go 98 25 Updated Nov 5, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,826 3,285 Updated Nov 6, 2025

SafeAILab / EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

Python 1,975 220 Updated Nov 5, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,227 420 Updated Nov 6, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,173 2,437 Updated Nov 6, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,075 248 Updated Oct 6, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,734 1,518 Updated Nov 5, 2025

vllm-project / production-stack

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,911 313 Updated Nov 6, 2025

kubernetes / kubernetes

Production-Grade Container Scheduling and Management

Go 118,422 41,646 Updated Nov 6, 2025

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 17,579 2,455 Updated Nov 6, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,636 2,111 Updated Jul 17, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 2,958 223 Updated Nov 6, 2025

TuGraph-family / chat2graph

Chat2Graph: Graph Native Agentic System.

Python 363 43 Updated Oct 30, 2025

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,344 479 Updated Nov 6, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,020 558 Updated Nov 6, 2025

kcl-lang / kcl

KCL Programming Language (CNCF Sandbox Project). https://kcl-lang.io

Rust 2,196 152 Updated Oct 30, 2025

ForceInjection / hands-on-ML

动手学机器学习

Jupyter Notebook 21 3 Updated Aug 14, 2025

kubernetes-sigs / lws

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 604 114 Updated Nov 4, 2025

Project-HAMi / HAMi

Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)

Go 2,580 412 Updated Nov 6, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 286 Updated May 15, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,846 896 Updated Sep 30, 2025

antgroup / hugescm

HugeSCM - A next generation cloud-based version control system

Go 122 7 Updated Nov 6, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,051 1,843 Updated Nov 6, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,330 2,264 Updated Sep 24, 2025

AlibabaPAI / llumnix

Efficient and easy multi-instance LLM serving

Python 505 41 Updated Sep 3, 2025

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

904 132 Updated Nov 4, 2025

ForceInjection / AI-fundermentals

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识

HTML 558 88 Updated Nov 2, 2025