MaoZiming

🔭

Thinking

Ziming Mao MaoZiming

🔭

Thinking

PhD student @ UC Berkeley at @ucbrise and @NetSys, Bytedance Seed. CS @ Yale, @Yale-LILY, @Thesys-lab @ CMU. Prev. @databricks

171 followers · 57 following

UC Berkeley
Berkeley, CA
01:12 (UTC -07:00)
https://maoziming.github.io/
@ziming_mao
in/maoziming

Achievements

x2 x2 x3

Achievements

x2 x2 x3

Organizations

Stars

uccl-project / rdmatop

htop-like TUI for real-time RDMA network monitoring.

Rust 48 3 Updated Jun 23, 2026

uccl-project / CommBench

Can LLMs Write Correct and Efficient GPU Communication Code?

Python 36 1 Updated Jun 9, 2026

StarTrail-org / PixelRAG

The end of web parsing. The beginning of scalable pixel-native search.

Python 3,617 324 Updated Jun 23, 2026

uccl-project / uccl-project.github.io

Astro 3 3 Updated Jun 14, 2026

uccl-project / mKernel

mKernel: fast multi-node, multi-GPU fused kernels

Cuda 241 22 Updated Jun 21, 2026

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 1,026 99 Updated Sep 10, 2025

svg-project / flash-kmeans

Fast and memory-efficient exact kmeans

Python 662 39 Updated Jun 3, 2026

FStarLang / FStar

A Proof-oriented Programming Language

F* 3,047 256 Updated Jun 23, 2026

fmagent-project / FM-Agent

Python 412 28 Updated Jun 22, 2026

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 855 58 Updated Jun 22, 2026

NVIDIA / SOL-ExecBench

A benchmark of real-world DL kernel problems

Python 236 26 Updated May 28, 2026

Multi-V-VM / GPUOS

Share your GPU without MIG or MPS

Python 50 4 Updated Jan 27, 2026

googleworkspace / cli

Google Workspace CLI — one command-line tool for Drive, Gmail, Calendar, Sheets, Docs, Chat, Admin, and more. Dynamically built from Google Discovery Service. Includes AI agent skills.

Rust 27,224 1,431 Updated Jun 10, 2026

caoshiyi / K-Search

Automated High-Performance GPU Kernel Generation

Python 116 22 Updated Jun 1, 2026

test-time-training / discover

Python 591 86 Updated May 24, 2026

HazyResearch / HipKittens

Fast and Furious AMD Kernels

C++ 434 69 Updated Jun 21, 2026

deepreinforce-ai / CUDA-L1

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Python 303 105 Updated Nov 3, 2025

ovg-project / GVM

Shell 22 3 Updated Jan 18, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,405 1,059 Updated Jun 4, 2026

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,329 223 Updated Jun 22, 2026

Tencent / SelfEvolvingAgent

Research works from Tencent AI Lab regarding self-evolving agents

Python 98 5 Updated Jan 30, 2026

specula-org / SysMoBench

SysMoBench: Evaluating AI on Formally Modeling Complex Real-World Systems

Python 21 3 Updated Jun 23, 2026

meta-pytorch / KernelAgent

Autonomous GPU Kernel Generation & Optimization via Deep Agents

Python 456 76 Updated Jun 6, 2026

flashinfer-ai / flashinfer-bench

Building the Virtuous Cycle for AI-driven LLM Systems

Python 250 41 Updated May 1, 2026

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,331 105 Updated Aug 28, 2025

tile-ai / tilescale

Forked from tile-ai/tilelang

Tile-based language built for AI computation across all scales

C++ 170 8 Updated Jun 16, 2026

tile-ai / TileRT

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 1,459 92 Updated Jun 8, 2026

microsoft / OpenRCA

[ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?

Python 366 44 Updated Jun 19, 2026

osayamenja / FlashMoE

Distributed MoE in a Single Kernel [NeurIPS '25]

Cuda 268 39 Updated May 5, 2026

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,313 1,321 Updated Jun 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziming Mao MaoZiming

Achievements

Achievements

Organizations

Block or report MaoZiming

Stars

uccl-project / rdmatop

uccl-project / CommBench

StarTrail-org / PixelRAG

uccl-project / uccl-project.github.io

uccl-project / mKernel

zhuzilin / ring-flash-attention

svg-project / flash-kmeans

FStarLang / FStar

fmagent-project / FM-Agent

SandAI-org / MagiAttention

NVIDIA / SOL-ExecBench

Multi-V-VM / GPUOS

googleworkspace / cli

caoshiyi / K-Search

test-time-training / discover

HazyResearch / HipKittens

deepreinforce-ai / CUDA-L1

ovg-project / GVM

deepseek-ai / DeepGEMM

mirage-project / mirage

Tencent / SelfEvolvingAgent

specula-org / SysMoBench

meta-pytorch / KernelAgent

flashinfer-ai / flashinfer-bench

bytedance / flux

tile-ai / tilescale

tile-ai / TileRT

microsoft / OpenRCA

osayamenja / FlashMoE

kvcache-ai / ktransformers