Skip to content
View dywsjtu's full-sized avatar
:shipit:
Focusing
:shipit:
Focusing

Highlights

  • Pro

Organizations

@SysML-Princeton

Block or report dywsjtu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup

Python 36 5 Updated Jan 9, 2023

Aequitas enables RPC-level QoS in datacenter networks.

C++ 18 2 Updated Jul 19, 2022

Microsoft Azure Traces

Jupyter Notebook 1,111 178 Updated Dec 6, 2025

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,005 61 Updated Mar 3, 2026

An extremely fast Python package and project manager, written in Rust.

Rust 83,356 2,950 Updated Apr 16, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,867 5,383 Updated Apr 16, 2026

Large Language Model (LLM) Systems Paper List

1,923 98 Updated Mar 24, 2026

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

Cuda 2,194 195 Updated Apr 14, 2026

Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]

Python 24 2 Updated Nov 21, 2024

Measure and optimize the energy consumption of your AI applications!

Python 350 43 Updated Mar 29, 2026

Infiniswap enables unmodified applications to efficiently use disaggregated memory.

C 256 51 Updated Sep 26, 2020

Fine-grained GPU sharing primitives

Python 147 18 Updated Jul 28, 2025

Tiresias is a GPU cluster manager for distributed deep learning training.

Python 166 50 Updated May 7, 2020

Hydra adds resilience and high availability to remote memory solutions.

C 33 6 Updated Feb 22, 2022

FedScale is a scalable and extensible open-source federated learning (FL) platform.

Python 414 121 Updated Dec 18, 2023

Justitia provides RDMA isolation between applications with diverse requirements.

C 43 9 Updated May 25, 2022

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,346 1,854 Updated Jul 3, 2024

A naive kernel.

C 17 6 Updated Aug 12, 2022

Lecture notes for Chris Peikert's graduate-level Theory of Cryptography course

TeX 190 37 Updated Sep 15, 2025

EECS 489: Computer Networks @ the University of Michigan

Python 267 126 Updated May 19, 2025