Zhang Cao Tom-CaoZH

👋

Focusing

63 followers · 123 following

China
https://tom-caozh.github.io/

Achievements

Highlights

Lists (5)

Sort

Starred repositories

Infini-AI-Lab / vortex_torch

Vortex: Programmable Sparse Attention for Agents as Algorithm Designers

Python 62 8 Updated Jun 8, 2026

uccl-project / uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,421 158 Updated Jun 22, 2026

BBuf / AI-Infra-Auto-Driven-SKILLS

Python 593 52 Updated Jun 20, 2026

uccl-project / mKernel

mKernel: fast multi-node, multi-GPU fused kernels

Cuda 239 22 Updated Jun 21, 2026

PKU-SEC-Lab / SPEX

Python 8 1 Updated May 12, 2026

lightseekorg / tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,477 164 Updated Jun 22, 2026

Dao-AILab / gram-newton-schulz

Fast Polar Decomposition for Muon

Python 157 13 Updated May 2, 2026

tanishqkumar / ssd

A lightweight inference engine supporting speculative speculative decoding (SSD).

Python 956 72 Updated May 10, 2026

MiroMindAI / MiroThinker

MiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7, achieves 74.0 and 75.3 on the BrowseComp and BrowseComp Zh, respectively.

Python 8,308 641 Updated Apr 25, 2026

osayamenja / FlashMoE

Distributed MoE in a Single Kernel [NeurIPS '25]

Cuda 268 38 Updated May 5, 2026

open-tinker / OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

Python 675 63 Updated Mar 21, 2026

hao-ai-lab / DistCA

Efficient Long-context Language Model Training by Core Attention Disaggregation

Python 105 7 Updated Apr 7, 2026

Dao-AILab / sonic-moe

Accelerating MoE with IO and Tile-aware Optimizations

Python 719 90 Updated Jun 15, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,441 707 Updated May 17, 2026

Anionex / banana-slides

一个基于nano banana pro🍌的原生AI PPT生成应用，迈向＂Vibe PPT＂; 支持上传任意模板图片，上传任意素材&智能解析，一句话/大纲/页面描述自动生成PPT，口头修改指定区域、一键导出可编辑ppt - An AI-native slides generator based on nano banana pro🍌

Python 15,009 1,749 Updated Jun 22, 2026