The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 219,221 33,602 Updated Jun 21, 2026

NVlabs / rcm

rCM & Causal-rCM: Leading and Unified Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale

Python 704 26 Updated Jun 5, 2026

thu-ml / TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,536 265 Updated Jun 17, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 5,226 1,150 Updated Jun 21, 2026

Tongyi-MAI / Z-Image

Python 11,590 790 Updated Feb 9, 2026

Tencent-Hunyuan / flex-block-attn

flex-block-attn: an efficient block sparse attention computation library

Jupyter Notebook 130 14 Updated Dec 26, 2025

kohya-ss / musubi-tuner

Python 1,868 282 Updated Jun 19, 2026

moonmath-ai / LiteAttention

Transforming Video Diffusion with Temporal Sparse Attention

Python 49 5 Updated Apr 8, 2026

pytorch / helion

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 890 154 Updated Jun 21, 2026

Bluear7878 / H2-Cache-A-Hierarchical-Dual-Stage-Cache

Python 22 2 Updated Nov 3, 2025

meta-pytorch / torchcomms

torchcomms: a modern PyTorch communications API

C++ 372 153 Updated Jun 21, 2026

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 687 97 Updated Jun 21, 2026

thu-ml / SLA

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

Python 313 19 Updated Feb 24, 2026

SandAI-org / MagiAttention

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 852 58 Updated Jun 21, 2026

Shenyi-Z / Cache4Diffusion

Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.

Python 104 11 Updated Oct 23, 2025

ModelTC / LightX2V

Lightweight Image Video Action Generation Inference Framework

Python 2,428 220 Updated Jun 21, 2026

HKUSTDial / flash-sparse-attention

Trainable fast and memory-efficient sparse attention

Python 709 52 Updated Jun 21, 2026

flagos-ai / FlagGems

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 1,030 417 Updated Jun 21, 2026

vipshop / cache-dit

A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.

Python 1,204 75 Updated Jun 16, 2026

tianweiy / CausVid

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,370 82 Updated Aug 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yh8899

Block or report yh8899

Starred repositories

Tencent-Hunyuan / UniRL

nv-tlabs / PiD

meta-pytorch / monarch

open-lm-engine / coda-kernels

NVlabs / AnyFlow

vvvvvjdy / D-OPSD

multica-ai / andrej-karpathy-skills

ai-dynamo / aitune

SandAI-org / MagiCompiler

shareAI-lab / learn-claude-code

affaan-m / ECC