Highlights
Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
A Practitioner handbook for production llm serving.
A Datacenter Scale Distributed Inference Serving Framework
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
My learning notes for ML SYS.
SGLang is a high-performance serving framework for large language models and multimodal models.
A framework for efficient model inference with omni-modality models
An MCP server for interacting with Google Colab
my project while learning ML from udemy
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.
Dedicated Resources for the Low-Level System Design. Learn how to design and implement large-scale systems. Prep for the system design interview.
Leetcode Python Solution and Explanation. Also a Guide to Prepare for Software Engineer Interview.
Advanced data structure and algorithm for system design,系统设计需要了解的算法
Ansible playbook for provisioning secured yarn cluster
A middle-to-high level open source algorithm book designed with coding interview at heart!
A complete computer science study plan to become a software engineer.
A curated list of Site Reliability and Production Engineering resources.
A curated list of Chaos Engineering resources.
Helps with visualising the schedule for GHC 2018
Python solutions to Cracking the Coding Interview (6th edition)
Elasticsearch stats exporter for Prometheus
Presentation given at New York Python Meetup
A curated list of cheatsheets for SRE