Skip to content
View helloyongyang's full-sized avatar
  • SenseTime
  • Shanghai, China

Block or report helloyongyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"

Python 209 3 Updated Feb 3, 2026

Advancing Open-source World Models

Python 2,407 183 Updated Feb 2, 2026

High Performance LLM Inference Operator Library

C++ 646 56 Updated Feb 3, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,689 545 Updated Feb 4, 2026

Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…

Python 1,290 108 Updated Dec 23, 2025

Combine all open source AI image-generated models and video-generated models, to generate AI videos in predefined workflow easily.

TypeScript 4 1 Updated Jan 31, 2026

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,747 271 Updated Feb 3, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 830 96 Updated Feb 4, 2026

Nano vLLM

Python 11,459 1,510 Updated Nov 3, 2025

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 924 83 Updated Dec 31, 2025

PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.

Python 3,217 275 Updated Jan 26, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,303 226 Updated Jan 29, 2026

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 564 28 Updated Jan 26, 2026
Python 290 14 Updated Jul 29, 2025

Enjoy the magic of Diffusion models!

Python 11,696 1,124 Updated Feb 4, 2026

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 2,040 130 Updated Apr 3, 2025

Allow torch tensor memory to be released and resumed later

Python 216 37 Updated Jan 13, 2026

Official implementation of BLIP3o-Series

Python 1,633 78 Updated Nov 29, 2025

flex-block-attn: an efficient block sparse attention computation library

Jupyter Notebook 108 6 Updated Dec 26, 2025

StreamDiffusion, Live Stream APP

Python 346 32 Updated Feb 4, 2026

ComfyUI custom node for lightx2v

Python 75 7 Updated Feb 3, 2026

Tile primitives for speedy kernels

Cuda 3,119 234 Updated Feb 4, 2026
Python 43 8 Updated Nov 3, 2025

Code for Draft Attention

Python 99 2 Updated May 22, 2025

Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"

Python 60 3 Updated Jul 1, 2025

LTX-Video Support for ComfyUI

Python 3,083 313 Updated Jan 29, 2026

A minimal implementation of DeepMind's Genie world model

Python 1,139 93 Updated Nov 22, 2025

📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉

Python 516 25 Updated Jan 18, 2026
Next