Skip to content
View sBobHuang's full-sized avatar

Block or report sBobHuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Research artifacts from Recursive's automated AI research system

Python 110 10 Updated Jun 11, 2026

Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x

Python 127 10 Updated Jun 10, 2026

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 1,400 86 Updated Jun 8, 2026

Toolkit for Seamlessly Enabling RL Training on Any Agent with Bedrock AgentCore.

Python 43 4 Updated Jun 11, 2026

Agent-friendly GPU profile-query CLI

Rust 81 2 Updated Jun 12, 2026

Compact and Agent-Native MoE Training System

Python 195 15 Updated Jun 13, 2026

WeFlow - 一个本地的微信聊天记录导出和年度报告应用

11,631 2,842 Updated Jun 3, 2026

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Python 37 Updated Jun 1, 2026
Python 252 27 Updated Jun 9, 2026

An Optimizer for Nvidia Compilers.

Python 95 5 Updated Jun 15, 2026

mKernel: fast multi-node, multi-GPU fused kernels

Cuda 233 22 Updated Jun 8, 2026

Building the Virtuous Cycle for AI-driven LLM Systems

Python 249 41 Updated May 1, 2026

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Python 212 22 Updated Jun 14, 2026

A modular full-system emulator written in Rust

Rust 31 24 Updated May 27, 2026

The open-source managed agents platform. Turn coding agents into real teammates — assign tasks, track progress, compound skills.

Go 36,787 4,522 Updated Jun 16, 2026

Self-hosted AI workspace with shareable AI teammates, shared conversations, memory, and governed access to plugins, MCP tools, and local devices.

TypeScript 50 5 Updated May 20, 2026

An Agentic Compiler for CUDA

9 Updated May 17, 2026
Python 182 29 Updated Jun 15, 2026

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,335 173 Updated May 16, 2026

SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations (CVPR 2026 Findings)

Python 992 57 Updated May 6, 2026

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,786 1,307 Updated Nov 4, 2025

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 445 25 Updated Jul 5, 2024
Python 11 1 Updated May 8, 2026

分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等

Jupyter Notebook 2,604 233 Updated May 30, 2026

cuda-oxide is an experimental Rust-to-CUDA compiler that lets you write (SIMT) GPU kernels in safe(ish), idiomatic Rust. It compiles standard Rust code directly to PTX — no DSLs, no foreign languag…

Rust 2,761 185 Updated Jun 16, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,440 157 Updated Jun 16, 2026
Next