iofu728

😶

Focusing

Huiqiang Jiang iofu728

😶

Focusing

Prev. RSDE at @microsoft Research Asia

284 followers · 181 following

Shanghai
https://hqjiang.com

Achievements

x2 x3

Achievements

x2 x3

Organizations

Stars

QwenLM / FlashQLA

high-performance linear attention kernel library built on TileLang

Python 342 24 Updated Apr 30, 2026

yiakwy-xpu-ml-framework-team / flash-float-jit-kernels

Cuda 11 3 Updated Apr 30, 2026

deepseek-ai / TileKernels

A kernel library written in tilelang

Python 1,363 114 Updated Apr 23, 2026

MoonshotAI / FlashKDA

FlashKDA: high-performance Kimi Delta Attention kernels

Cuda 403 30 Updated Apr 22, 2026

redai-infra / Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python 341 32 Updated Apr 30, 2026

NVIDIA / nvbench

CUDA Kernel Benchmarking Library

Cuda 859 105 Updated Apr 22, 2026

inclusionAI / cuLA

CUDA kernels for linear attention variants, written in CuTe DSL and CUTLASS C++.

Python 481 50 Updated Apr 24, 2026

inclusionAI / humming

Python 81 5 Updated Apr 29, 2026

SandAI-org / MagiCompiler

A plug-and-play compiler that delivers free-lunch optimizations for both inference and training.

Python 300 23 Updated Apr 27, 2026

Dao-AILab / AI-workflow

70 2 Updated Mar 24, 2026

fastclaw-ai / weclaw

Connect to any agents with WeChat ClawBot.

Go 1,352 161 Updated Apr 1, 2026

MoonshotAI / Attention-Residuals

3,230 173 Updated Mar 17, 2026

RightNow-AI / autokernel

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,331 128 Updated Mar 19, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,181 552 Updated Apr 30, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 78,027 11,377 Updated Mar 26, 2026

tanishqkumar / ssd

A lightweight inference engine supporting speculative speculative decoding (SSD).

Python 900 63 Updated Mar 22, 2026

stepfun-ai / SteptronOss

A lightweight, AI-native training framework for large language models. Designed for fast iteration, reproducible experiments, and modular configuration across SFT, RLVR, and evaluation workflows.

Python 560 40 Updated Apr 28, 2026

ThunderAgent-org / ThunderAgent

A simple, fast and robust program-aware agentic inference system.

Python 278 23 Updated Mar 16, 2026

flashinfer-ai / mlsys26-agent-baseline

Python 31 9 Updated Mar 12, 2026

flashinfer-ai / flashinfer-bench-starter-kit

FlashInfer Bench @ MLSys 2026: Building AI agents to write high performance GPU kernels

Python 161 130 Updated Apr 26, 2026

flashinfer-ai / flashinfer-bench

Building the Virtuous Cycle for AI-driven LLM Systems

Python 227 40 Updated Apr 28, 2026

Infini-AI-Lab / jackpot

A rejection-sampling based distribution alignment method for extreme actor-policy mismatch RL Training

Python 15 1 Updated Feb 11, 2026

serdes21 / flashtile

FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.

Rust 60 7 Updated Feb 6, 2026

meituan-longcat / SGLang-FluentLLM

Python 77 4 Updated Apr 29, 2026

leepoly / sm-profiler

Python 65 6 Updated Feb 5, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 366,644 75,271 Updated Apr 30, 2026

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,359 330 Updated Jan 14, 2026

z-lab / dflash

DFlash: Block Diffusion for Flash Speculative Decoding

Python 2,431 175 Updated Apr 26, 2026

openai / frontier-evals

OpenAI Frontier Evals

Python 1,183 149 Updated Apr 21, 2026

FrontierCS / Frontier-CS

A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.

C++ 194 32 Updated Apr 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huiqiang Jiang iofu728

Achievements

Achievements

Organizations

Block or report iofu728

Stars

QwenLM / FlashQLA

yiakwy-xpu-ml-framework-team / flash-float-jit-kernels

deepseek-ai / TileKernels

MoonshotAI / FlashKDA

redai-infra / Relax

NVIDIA / nvbench

inclusionAI / cuLA

inclusionAI / humming

SandAI-org / MagiCompiler

Dao-AILab / AI-workflow

fastclaw-ai / weclaw

MoonshotAI / Attention-Residuals

RightNow-AI / autokernel

Gen-Verse / OpenClaw-RL

karpathy / autoresearch

tanishqkumar / ssd

stepfun-ai / SteptronOss

ThunderAgent-org / ThunderAgent

flashinfer-ai / mlsys26-agent-baseline

flashinfer-ai / flashinfer-bench-starter-kit

flashinfer-ai / flashinfer-bench

Infini-AI-Lab / jackpot

serdes21 / flashtile

meituan-longcat / SGLang-FluentLLM

leepoly / sm-profiler

openclaw / openclaw

deepseek-ai / Engram

z-lab / dflash

openai / frontier-evals

FrontierCS / Frontier-CS