sBobHuang

BobHuang sBobHuang

AI Kernel Engineer @zai-org | Bridging DSLs and Hardware with Triton, MLIR, and LLVM.

163 followers · 98 following

Achievements

x3 x2

Achievements

x3 x2

Lists (1)

Sort

✨ Inspiration

1 repository

Stars

recursive-org / first-steps-toward-automated-ai-research

Research artifacts from Recursive's automated AI research system

Python 110 10 Updated Jun 11, 2026

Dogacel / auto-gpu-kernel

Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x

Python 127 10 Updated Jun 10, 2026

tile-ai / TileRT

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 1,400 86 Updated Jun 8, 2026

awslabs / agentcore-rl-toolkit

Toolkit for Seamlessly Enabling RL Training on Any Agent with Bedrock AgentCore.

Python 43 4 Updated Jun 11, 2026

lucifer1004 / VeloQ

Agent-friendly GPU profile-query CLI

Rust 81 2 Updated Jun 12, 2026

mlc-ai / pith-train

Compact and Agent-Native MoE Training System

Python 195 15 Updated Jun 13, 2026

hicccc77 / WeFlow

WeFlow - 一个本地的微信聊天记录导出和年度报告应用

11,631 2,842 Updated Jun 3, 2026

THU-KEG / LongTraceRL

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Python 37 Updated Jun 1, 2026

mit-han-lab / ncu-report-skill

Python 122 16 Updated May 24, 2026

mit-han-lab / KernelWiki

Python 252 27 Updated Jun 9, 2026

mit-han-lab / kernel-design-agents

592 48 Updated Jun 2, 2026

mit-han-lab / mlsys2026-flashinfer-contest

Python 84 3 Updated Jun 13, 2026

NVIDIA / CompileIQ

An Optimizer for Nvidia Compilers.

Python 95 5 Updated Jun 15, 2026

uccl-project / mKernel

mKernel: fast multi-node, multi-GPU fused kernels

Cuda 233 22 Updated Jun 8, 2026

anysphere / kernel-optimization-results

Python 8 Updated Apr 14, 2026

flashinfer-ai / flashinfer-bench

Building the Virtuous Cycle for AI-driven LLM Systems

Python 249 41 Updated May 1, 2026

open-lm-engine / coda-kernels

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Python 212 22 Updated Jun 14, 2026

gevico / machina

A modular full-system emulator written in Rust

Rust 31 24 Updated May 27, 2026

multica-ai / multica

The open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills.

Go 36,787 4,522 Updated Jun 16, 2026

zai-org / Synapse

Self-hosted AI workspace with shareable AI teammates, shared conversations, memory, and governed access to plugins, MCP tools, and local devices.

TypeScript 50 5 Updated May 20, 2026

jiazhihao / agentic-compiler

An Agentic Compiler for CUDA

9 Updated May 17, 2026

BBuf / KDA-Pilot

Python 182 29 Updated Jun 15, 2026

zai-org / GLM-V

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,335 173 Updated May 16, 2026

zai-org / SCAIL

SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (CVPR 2026 Findings)

Python 992 57 Updated May 6, 2026

zai-org / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,786 1,307 Updated Nov 4, 2025

zai-org / Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 445 25 Updated Jul 5, 2024

zhao008 / cubin_cfg_html

Python 11 1 Updated May 8, 2026

CalvinXKY / InfraTech

分享AI Infra知识&代码练习：PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,604 233 Updated May 30, 2026

NVlabs / cuda-oxide

cuda-oxide is an experimental Rust-to-CUDA compiler that lets you write (SIMT) GPU kernels in safe(ish), idiomatic Rust. It compiles standard Rust code directly to PTX — no DSLs, no foreign languag…

Rust 2,761 185 Updated Jun 16, 2026

lightseekorg / tokenspeed

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,440 157 Updated Jun 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BobHuang sBobHuang

Achievements

Achievements

Block or report sBobHuang

Lists (1)

✨ Inspiration

Stars

recursive-org / first-steps-toward-automated-ai-research

Dogacel / auto-gpu-kernel

tile-ai / TileRT

awslabs / agentcore-rl-toolkit

lucifer1004 / VeloQ

mlc-ai / pith-train

hicccc77 / WeFlow

THU-KEG / LongTraceRL

mit-han-lab / ncu-report-skill

mit-han-lab / KernelWiki

mit-han-lab / kernel-design-agents

mit-han-lab / mlsys2026-flashinfer-contest

NVIDIA / CompileIQ

uccl-project / mKernel

anysphere / kernel-optimization-results

flashinfer-ai / flashinfer-bench

open-lm-engine / coda-kernels

gevico / machina

multica-ai / multica

zai-org / Synapse

jiazhihao / agentic-compiler

BBuf / KDA-Pilot

zai-org / GLM-V

zai-org / SCAIL

zai-org / CogVideo

zai-org / Inf-DiT

zhao008 / cubin_cfg_html

CalvinXKY / InfraTech

NVlabs / cuda-oxide

lightseekorg / tokenspeed