hgt312

Follow

Huang, Guangtai hgt312

Follow

81 followers · 77 following

AWS
San Jose
10:15 (UTC -07:00)

Achievements

Achievements

Organizations

Stars

uw-syfi / vibe-serve

Can AI Agents Build Bespoke LLM Serving Systems?

Python 73 13 Updated Jun 21, 2026

tile-ai / TileRT

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 1,456 91 Updated Jun 8, 2026

zhang677 / AccelOpt

[MLSys 2026] AccelOpt: Self-improving Agents for AI Accelerator Kernel Optimization

Python 56 7 Updated Jun 18, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 55,325 7,593 Updated May 5, 2026

aws-neuron / nkipy

NKIPy: Rapid Prototyping on Trainium

Python 28 9 Updated Jun 19, 2026

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,442 707 Updated May 17, 2026

apache / tvm-ffi

Open ABI and FFI for Machine Learning Systems

C++ 418 80 Updated Jun 21, 2026

sgl-project / sglang-jax

JAX backend for SGL

Python 289 107 Updated Jun 22, 2026

google / torchax

torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JAX-Pytorch interoperability, meaning, one can mix JAX & Pytor…

Python 228 34 Updated Jun 17, 2026

vllm-project / tpu-inference

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 360 219 Updated Jun 22, 2026

MoonshotAI / Moonlight

Muon is Scalable for LLM Training

1,494 89 Updated Aug 3, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations for emerging model architectures

Python 5,247 562 Updated Jun 22, 2026

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,572 1,007 Updated Jun 13, 2026

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 2,674 125 Updated May 24, 2026

openai / simple-evals

Python 4,533 491 Updated Apr 22, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,313 1,263 Updated Jun 22, 2026

jianzhnie / awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

734 41 Updated Jun 17, 2026

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,081 4,109 Updated Jun 22, 2026

deepseek-ai / DeepSeek-V3

Python 103,782 16,733 Updated Aug 28, 2025

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,530 6,657 Updated Jun 22, 2026

Mintplex-Labs / anything-llm

Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

JavaScript 61,936 6,752 Updated Jun 19, 2026

mosaicml / composer

Supercharge Your Model Training

Python 5,487 463 Updated Apr 29, 2026

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 4,413 588 Updated Mar 25, 2026

nv-legate / cupynumeric

NumPy and SciPy on Multi-Node Multi-GPU systems

Python 978 86 Updated Jun 18, 2026

patrick-kidger / equinox

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 2,909 202 Updated Jun 13, 2026

aws-neuron / neuronx-distributed

Python 66 22 Updated Apr 9, 2026

bloomberg / memray

Memray is a memory profiler for Python

Python 15,127 453 Updated Jun 19, 2026

apple / axlearn

An Extensible Deep Learning Library

Python 2,367 406 Updated May 16, 2026

volcengine / veScale

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,026 62 Updated Mar 3, 2026

EstrellaXD / Auto_Bangumi

AutoBangumi - 全自动追番工具

Python 8,096 435 Updated Apr 19, 2026