-
18:03
(UTC +08:00)
Lists (2)
Sort Name ascending (A-Z)
Stars
AI PPT赛道终结者,史上最最最强 PPT Skill!!! 使用GPT生成豪华的图片格式PPT,然后转换为完全可编辑的PPTX文件。
A programmable, explicit world model on Godot — worlds are pure JSON run by a fixed primitives + interpreter engine. Built by Claude, for Claude.
Official implementation of LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing
[CVPR 2026 Oral] Official implementation for ChordEdit: One-Step Low-Energy Transport for Image Editing
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
Multimodal RL training framework for diffusion & omni models
[TOG 2024]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players
A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models
Long-form audio-visual generation evaluation framework.
[AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models"
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
[ICLR 26] TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based generation.
Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework for efficient and causal video generation using adversarial s…
Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation
Lightweight coding agent that runs in your terminal
【Accepted by TPAMI】Human Motion Video Generation: A Survey (https://ieeexplore.ieee.org/document/11106267)
Official Repo of "D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models"
Official Pytorch Code of the Paper "FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization"
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
[ICML 2026] | Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
Information collection for the Happy Horse AI video generator model. Official demo and updates at happyhorses.io.
Unified Codebase for Advanced World Models.