Skip to content
View MaoZiming's full-sized avatar
🔭
Thinking
🔭
Thinking

Organizations

@Y-Hack @Yale-LILY @yale-nova @skypilot-org @berkeley-cs168 @Trinity-data-store @uccl-project

Block or report MaoZiming

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 22 2 Updated Oct 9, 2025

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Python 1,600 105 Updated Oct 9, 2025

Fast OS-level support for GPU checkpoint and restore

C++ 238 26 Updated Sep 28, 2025

CloudSim: A Framework For Modeling And Simulation Of Cloud Computing Infrastructures And Services

Java 942 536 Updated Sep 25, 2025

Collaborative Datacenter Simulation and Exploration for Everybody

Kotlin 98 63 Updated Oct 3, 2025

Perplexity GPU Kernels

C++ 482 62 Updated Sep 19, 2025

Borg cluster traces from Google

TeX 990 204 Updated Aug 14, 2025

Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple regression tasks.

Python 267 36 Updated Oct 8, 2025

Model Context Protocol Servers

TypeScript 69,882 8,305 Updated Oct 9, 2025
C++ 303 26 Updated Oct 1, 2025

m3fs(Make 3FS) is the toolset designed to deploy 3FS cluster.

Go 51 9 Updated Jul 18, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,317 275 Updated Sep 30, 2025

The P programming language.

C# 3,442 203 Updated Sep 30, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,363 945 Updated Sep 23, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,547 574 Updated Oct 9, 2025

RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 3,089 314 Updated Oct 9, 2025

Open-source implementation of AlphaEvolve

Python 4,082 593 Updated Oct 9, 2025

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 100 17 Updated Oct 3, 2025

Microsoft Collective Communication Library

C++ 362 31 Updated Sep 20, 2023

Analyze computation-communication overlap in V3/R1.

1,102 143 Updated Mar 21, 2025

DeepSeek-V3/R1 inference performance simulator

Jupyter Notebook 169 21 Updated Mar 27, 2025

A High-Throughput Parallel Lossless Compressor for Scientific Data

C++ 71 16 Updated Jan 22, 2023

Expert Parallelism Load Balancer

Python 1,275 196 Updated Mar 24, 2025

llm-d enables high-performance distributed LLM inference on Kubernetes

Makefile 1,858 189 Updated Oct 9, 2025

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,232 172 Updated Aug 19, 2025

Unified Collective Communication Library

C 277 118 Updated Oct 9, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,501 627 Updated Oct 9, 2025

Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)

C 1,467 489 Updated Oct 7, 2025

PyTorch Single Controller

Rust 435 78 Updated Oct 10, 2025
Next