AnnaYue

AnnaYue

5 followers · 24 following

Ant Group
shanghai

Achievements

Stars

59 results for source starred repositories

Clear filter

kubernetes / kubernetes

Production-Grade Container Scheduling and Management

Go 118,430 41,650 Updated Nov 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,339 11,080 Updated Nov 6, 2025

etcd-io / etcd

Distributed reliable key-value store for the most critical data of a distributed system

Go 50,708 10,208 Updated Nov 6, 2025

hashicorp / consul

Consul is a distributed, highly available, and data center aware solution to connect and configure applications across dynamic, distributed infrastructure.

Go 29,490 4,544 Updated Nov 6, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,091 3,266 Updated Jun 26, 2025

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 21,719 2,545 Updated Oct 19, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,866 3,288 Updated Nov 6, 2025

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 17,580 2,456 Updated Nov 6, 2025

temporalio / temporal

Temporal service

Go 16,471 1,169 Updated Nov 6, 2025

NVIDIA / open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

C 16,329 1,517 Updated Nov 4, 2025

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 15,966 1,523 Updated Jan 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,179 2,438 Updated Nov 6, 2025

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,640 2,111 Updated Jul 17, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,332 2,264 Updated Nov 6, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,053 1,844 Updated Nov 6, 2025

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,946 932 Updated Mar 11, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,848 896 Updated Sep 30, 2025

loft-sh / vcluster

vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace of the underlying k8s cluster. It's cheaper than creating separate full-blown clusters and it …

Go 10,677 535 Updated Nov 6, 2025