wangyikewxgm

wyike wangyikewxgm

20 followers · 23 following

alibabacloud
Beijing

Achievements

Stars

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,208 85 Updated Aug 28, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 9,847 1,238 Updated Nov 3, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,455 475 Updated Dec 20, 2025

BBuf / tvm_mlir_learn

compiler learning resources collect.

Python 2,618 362 Updated Mar 19, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 4,713 299 Updated Dec 19, 2025

wangshusen / DRL

Deep Reinforcement Learning

4,365 652 Updated Dec 10, 2022

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,649 2,859 Updated Dec 20, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,720 2,370 Updated Dec 20, 2025

google-research / vision_transformer

Jupyter Notebook 12,140 1,431 Updated Mar 6, 2025

volcengine / veScale

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 910 53 Updated Nov 27, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 924 45 Updated Oct 29, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,488 2,293 Updated Dec 11, 2025

ollama / ollama

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 157,971 13,971 Updated Dec 19, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,826 3,814 Updated Dec 20, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,086 869 Updated Dec 17, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 29,870 3,153 Updated Dec 20, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,087 31,495 Updated Dec 20, 2025

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,309 2,130 Updated Dec 18, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,831 12,091 Updated Dec 20, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 21,199 2,232 Updated Dec 20, 2025

kubernetes / community

Kubernetes community content

Jupyter Notebook 12,686 5,330 Updated Dec 19, 2025

kserve / kserve

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Shell 4,924 1,323 Updated Dec 19, 2025

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 180,399 46,188 Updated Dec 20, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,257 8,584 Updated Nov 12, 2025

langchain-ai / langchain

🦜🔗 The platform for reliable agents.

Python 122,315 20,162 Updated Dec 20, 2025

kubevela / kubevela

The Modern Application Platform.

Go 7,641 961 Updated Dec 16, 2025

rakyll / openai-go

Go client libraries for OpenAI

Go 450 34 Updated Nov 29, 2023

alibaba / higress

🤖 AI Gateway | AI Native API Gateway

Go 7,102 927 Updated Dec 20, 2025

gocrane / crane

Crane is a FinOps Platform for Cloud Resource Analytics and Economics in Kubernetes clusters. The goal is not only to help users to manage cloud cost easier but also ensure the quality of applicati…

Go 2,017 401 Updated Dec 20, 2024

h8r-dev / heighliner

An app development platform using cloud native stacks

Go 135 13 Updated Jul 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wyike wangyikewxgm

Achievements

Achievements

Block or report wangyikewxgm

Stars

bytedance / flux

GeeeekExplorer / nano-vllm

kvcache-ai / Mooncake

BBuf / tvm_mlir_learn

zhaochenyang20 / Awesome-ML-SYS-Tutorial

wangshusen / DRL

volcengine / verl

huggingface / trl

google-research / vision_transformer

volcengine / veScale

efeslab / Nanoflow

NVIDIA / TensorRT

ollama / ollama

sgl-project / sglang

microsoft / LoRA

microsoft / graphrag

huggingface / transformers

huggingface / peft

vllm-project / vllm

Dao-AILab / flash-attention

kubernetes / community

kserve / kserve

Significant-Gravitas / AutoGPT

karpathy / nanoGPT

langchain-ai / langchain

kubevela / kubevela

rakyll / openai-go

alibaba / higress

gocrane / crane

h8r-dev / heighliner