Skip to content
View hgt312's full-sized avatar
  • AWS
  • San Jose
  • 10:15 (UTC -07:00)

Organizations

@awslabs @aws-samples @dmlc

Block or report hgt312

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Can AI Agents Build Bespoke LLM Serving Systems?

Python 73 13 Updated Jun 21, 2026

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 1,456 91 Updated Jun 8, 2026

[MLSys 2026] AccelOpt: Self-improving Agents for AI Accelerator Kernel Optimization

Python 56 7 Updated Jun 18, 2026

The best ChatGPT that $100 can buy.

Python 55,325 7,593 Updated May 5, 2026

NKIPy: Rapid Prototyping on Trainium

Python 28 9 Updated Jun 19, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 4,442 707 Updated May 17, 2026

Open ABI and FFI for Machine Learning Systems

C++ 418 80 Updated Jun 21, 2026

JAX backend for SGL

Python 289 107 Updated Jun 22, 2026

torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JAX-Pytorch interoperability, meaning, one can mix JAX & Pytor…

Python 228 34 Updated Jun 17, 2026

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 360 219 Updated Jun 22, 2026

Muon is Scalable for LLM Training

1,494 89 Updated Aug 3, 2025

🚀 Efficient implementations for emerging model architectures

Python 5,247 562 Updated Jun 22, 2026

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,572 1,007 Updated Jun 13, 2026

Muon is an optimizer for hidden layers in neural networks

Python 2,674 125 Updated May 24, 2026
Python 4,533 491 Updated Apr 22, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,313 1,263 Updated Jun 22, 2026

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

734 41 Updated Jun 17, 2026

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 22,081 4,109 Updated Jun 22, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 29,530 6,657 Updated Jun 22, 2026

Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience

JavaScript 61,936 6,752 Updated Jun 19, 2026

Supercharge Your Model Training

Python 5,487 463 Updated Apr 29, 2026

LLM training code for Databricks foundation models

Python 4,413 588 Updated Mar 25, 2026

NumPy and SciPy on Multi-Node Multi-GPU systems

Python 978 86 Updated Jun 18, 2026

Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/

Python 2,909 202 Updated Jun 13, 2026

Memray is a memory profiler for Python

Python 15,127 453 Updated Jun 19, 2026

An Extensible Deep Learning Library

Python 2,367 406 Updated May 16, 2026

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,026 62 Updated Mar 3, 2026

AutoBangumi - 全自动追番工具

Python 8,096 435 Updated Apr 19, 2026
Next