Skip to content
View jason-huang03's full-sized avatar

Organizations

@thu-nics @thu-ml

Block or report jason-huang03

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient, Python-native, end-to-end MoE training in ~10K lines of code.

Python 64 6 Updated Apr 28, 2026

high-performance linear attention kernel library built on TileLang

Python 355 25 Updated Apr 30, 2026

Codes & examples for "CUDA - From Correctness to Performance"

C++ 128 25 Updated Oct 24, 2024

AI-assisted academic posters.

HTML 490 40 Updated Mar 15, 2026

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,763 271 Updated Jul 18, 2025

Research on Coding Agents

11,767 19,743 Updated Apr 1, 2026
Python 81 5 Updated Apr 29, 2026
Cuda 57 4 Updated Feb 24, 2026

CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-based computation patterns and optimizations targeting NVIDIA te…

C++ 951 77 Updated Apr 1, 2026

NVidia sass disassembler/inline patcher

C++ 72 12 Updated Apr 30, 2026

CUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.

Python 481 50 Updated Apr 24, 2026

The Z3 Theorem Prover

C++ 12,204 1,647 Updated Apr 30, 2026

Beginner, advanced, expert level Rust training material

Rust 14,124 1,081 Updated Apr 26, 2026

A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.

Python 300 23 Updated Apr 27, 2026

Agentic Kernel Optimization for All — automated GPU kernel optimization for any kernel, any hardware, any language

Python 150 7 Updated Apr 2, 2026

Building the Virtuous Cycle for AI-driven LLM Systems

Python 227 40 Updated Apr 28, 2026
Cuda 40 6 Updated Dec 19, 2025

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,331 128 Updated Mar 19, 2026

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 1,010 61 Updated Mar 3, 2026

Official Code Implementation of Translating Flow to Policy via Hindsight Online Imitation

Python 124 Updated Mar 12, 2026

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

HTML 10,147 863 Updated Apr 30, 2026

A claude code skill to delegate prompts to codex

1,211 93 Updated Apr 27, 2026

Lightweight coding agent that runs in your terminal

Rust 79,135 11,337 Updated Apr 30, 2026

omo; the best agent harness - previously oh-my-opencode

TypeScript 55,205 4,467 Updated Apr 30, 2026

Glamourous agentic coding for all 💘

Go 23,694 1,595 Updated Apr 30, 2026
Jupyter Notebook 219 3 Updated Dec 19, 2025

原 [chatlog]项目(一个微信数据库解密读取及提供mcp服务、http服务的开源软件),现已支持通过微信clawbot接口推送消息,可以实时转发全部或指定消息到clawbot

Go 839 502 Updated Apr 26, 2026
Next