Skip to content
View hxu296's full-sized avatar
📚
Designing Machine Learning Systems - Chip Huyen
📚
Designing Machine Learning Systems - Chip Huyen

Organizations

@CS559 @Project-Mendota @pypose

Block or report hxu296

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Objective-C++ 12 3 Updated Aug 26, 2025

A generative speech model for daily dialogue.

Python 38,099 4,132 Updated Jul 6, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,308 5,731 Updated Aug 16, 2024

Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)

TypeScript 2,741 518 Updated Oct 10, 2025

An Open Source implementation of Notebook LM with more flexibility and features

TypeScript 9,546 943 Updated Nov 5, 2025

A step by step guide to fine-tuning the DeepSeek R1 Distilled models on Apple Silicon machines.

Python 58 9 Updated Feb 1, 2025

A privacy-first distributed training framework built on MLX for Apple Silicon, enabling secure and efficient AI model training across multiple devices while preserving data privacy.

Python 11 1 Updated Nov 25, 2024

slime is an LLM post-training framework for RL Scaling.

Python 2,372 241 Updated Nov 5, 2025

This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025

Python 6,844 477 Updated May 5, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,727 3,272 Updated Nov 5, 2025
Python 17 2 Updated Sep 26, 2025

kernels, of the mega variety

Python 597 26 Updated Sep 28, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,106 1,902 Updated Nov 1, 2025

ryOS, made with Cursor

TypeScript 792 125 Updated Nov 5, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,382 1,367 Updated Jul 9, 2025

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 56,763 7,596 Updated Nov 13, 2024

Playwright MCP server

TypeScript 22,830 1,832 Updated Nov 4, 2025

Large Language Model Text Generation Inference

Python 10,623 1,234 Updated Sep 17, 2025

Andrej Karpathy's micrograd library implemented in Go

Go 16 Updated May 13, 2025

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 1,835 107 Updated Jul 22, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,167 213 Updated Nov 4, 2025

A lightweight design for computation-communication overlap.

Cuda 183 8 Updated Oct 10, 2025

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

11,570 1,893 Updated Aug 31, 2023

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,385 228 Updated Nov 2, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,505 3,758 Updated Nov 2, 2025

My learning notes/codes for ML SYS.

Python 4,065 247 Updated Oct 6, 2025

100 numpy exercises (with solutions)

Python 13,437 6,371 Updated Aug 26, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,898 690 Updated Nov 5, 2025

CUDA Python: Performance meets Productivity

Python 3,019 217 Updated Nov 4, 2025
Next