Stars
A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).
A lightweight design for computation-communication overlap.
A framework for generating realistic LLM serving workloads
DeepSeek-V3/R1 inference performance simulator
DeepEP: an efficient expert-parallel communication library
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Summary of some awesome work for optimizing LLM inference
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
[SIGCOMM'23] DONS: Fast and Affordable Discrete Event Network Simulation with Automatic Parallelization.
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
A Solidity starter template for developing smart contracts.
Web-based Traffic and Security Network Traffic Monitoring
Realtime Robust Malicious Traffic Detection via Frequency Domain Analysis
Parallel sparse direct solver for circuit simulation
A C++ project template for quick start.