AnnaYue

AnnaYue

5 followers · 24 following

Ant Group
shanghai

Achievements

Stars

kubernetes / kubernetes

Production-Grade Container Scheduling and Management

Go 118,415 41,643 Updated Nov 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,196 11,052 Updated Nov 6, 2025

etcd-io / etcd

Distributed reliable key-value store for the most critical data of a distributed system

Go 50,704 10,205 Updated Nov 4, 2025

hashicorp / consul

Consul is a distributed, highly available, and data center aware solution to connect and configure applications across dynamic, distributed infrastructure.

Go 29,487 4,543 Updated Nov 5, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,079 3,264 Updated Jun 26, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 21,707 2,543 Updated Oct 19, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,773 3,276 Updated Nov 6, 2025

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 17,578 2,455 Updated Nov 4, 2025

temporalio / temporal

Temporal service

Go 16,456 1,166 Updated Nov 5, 2025

NVIDIA / open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

C 16,326 1,517 Updated Nov 4, 2025

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 15,962 1,523 Updated Jan 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,146 2,428 Updated Nov 6, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,630 2,110 Updated Jul 17, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,326 2,264 Updated Sep 24, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,044 1,841 Updated Nov 6, 2025

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,943 932 Updated Mar 11, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,843 896 Updated Sep 30, 2025

loft-sh / vcluster

vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace of the underlying k8s cluster. It's cheaper than creating separate full-blown clusters and it …

Go 10,676 535 Updated Nov 5, 2025