-
UC Berkeley
- Berkeley, CA
-
18:41
(UTC -07:00) - https://maoziming.github.io/
- @ziming_mao
- in/maoziming
Stars
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
Fast OS-level support for GPU checkpoint and restore
CloudSim: A Framework For Modeling And Simulation Of Cloud Computing Infrastructures And Services
Collaborative Datacenter Simulation and Exploration for Everybody
Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple regression tasks.
Model Context Protocol Servers
m3fs(Make 3FS) is the toolset designed to deploy 3FS cluster.
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Open-source implementation of AlphaEvolve
Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond
Analyze computation-communication overlap in V3/R1.
DeepSeek-V3/R1 inference performance simulator
A High-Throughput Parallel Lossless Compressor for Scientific Data
llm-d enables high-performance distributed LLM inference on Kubernetes
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
Supercharge Your LLM with the Fastest KV Cache Layer
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)