Stars
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Post-training with Tinker
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
Lynx: Towards High-Fidelity Personalized Video Generation
Chrome DevTools for coding agents
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
A community driven registry service for Model Context Protocol (MCP) servers.
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
A nearly-live implementation of OpenAI's Whisper.
Render any git repo into a single static HTML page for humans or LLMs
We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a re…
[SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off
Official implementation of "Normalized Attention Guidance"
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
🔥 Clone and recreate any website as a modern React app in seconds
A unified inference and post-training framework for accelerated video generation.
Text-audio foundation model from Boson AI
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
The ultimate training toolkit for finetuning diffusion models
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
kingbri1 / flash-attention
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
A curated list of recent diffusion models for video generation, editing, and various other applications.