UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,142 106 Updated Dec 24, 2025

kingkongshot / prompts

AI 相关的笔记

2,344 243 Updated Dec 2, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

101,974 27,154 Updated Dec 19, 2025

hsliuping / TradingAgents-CN

基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版

Python 14,105 3,101 Updated Nov 24, 2025

skywind3000 / FastMemcpy

Speed-up over 50% in average vs traditional memcpy in gcc 4.9 or vc2012

C 638 153 Updated Apr 7, 2024

aws / aws-ofi-nccl

This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.

C++ 202 83 Updated Dec 20, 2025

axio-project / FuseLink

Efficient GPU communication over multiple NICs.

C++ 21 4 Updated Nov 20, 2025

NVIDIA / open-gpu-doc

Documentation of NVIDIA chip/hardware interfaces

C 1,317 98 Updated Aug 18, 2025

gbxu / autoccl

[NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training

C++ 29 3 Updated May 2, 2025

linux-rdma / perftest

Infiniband Verbs Performance Tests

C 890 366 Updated Dec 14, 2025

fslongjin / fastcommit

AI-based command line tool to quickly generate standardized commit messages.

Rust 5 2 Updated Dec 15, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,945 3,856 Updated Dec 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stary staryxchen

Achievements

Achievements

Block or report staryxchen

Stars

stepfun-ai / InfiniteHBD-Trace

UChi-JCL / CacheGen

llm-d / llm-d

kaito-project / kaito

aibrix / PrisKV

microsoft / vattention

anthropics / skills

ModelEngine-Group / unified-cache-management

DeepLink-org / DLSlime

ovg-project / kvcached

NVIDIA / Megatron-LM

NVIDIA-NeMo / Megatron-Bridge

eTran-NSDI25 / eTran

thustorage / TeRM

MoonshotAI / checkpoint-engine

NVIDIA / gdrcopy

taco-project / FlexKV

fslongjin / update-checker

uccl-project / uccl