Official implementation of "Towards One-Step Causal Video Generation via Adversarial Self-Distillation" (arXiv 2025). A novel framework for efficient and causal video generation using adversarial s…

19 Updated Nov 4, 2025

meituan-longcat / LongCat-Flash-Omni

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 445 24 Updated Dec 15, 2025

Soul-AILab / SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 2,768 342 Updated Dec 11, 2025

meituan-longcat / LongCat-Video

Python 1,599 209 Updated Dec 20, 2025

NVlabs / rcm

rCM: SOTA Diffusion Distillation & Few-Step Video Generation based on sCM/MeanFlow

Python 385 14 Updated Dec 12, 2025

facebookresearch / SSDD

Official implementation for SSDD Single-Step Diffusion Decoder for Efficient Image Tokenization.

Jupyter Notebook 50 4 Updated Nov 12, 2025

character-ai / Ovi

Python 1,461 152 Updated Nov 15, 2025

Vchitect / Cut2Next

Cut2Next: Generating Next Shot via In-Context Tuning

30 Updated Aug 21, 2025

AlmondGod / tinyworlds

A minimal implementation of DeepMind's Genie world model

Python 1,062 84 Updated Nov 22, 2025

NVlabs / LongLive

LongLive: Real-time Interactive Long Video Generation

Python 921 63 Updated Dec 4, 2025

meituan-longcat / LongCat-Flash-Thinking

259 25 Updated Dec 15, 2025

XiaomiMiMo / MiMo-Audio

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 906 87 Updated Sep 20, 2025

OpenBMB / VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

Python 3,035 328 Updated Dec 20, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,682 1,354 Updated Dec 17, 2025

Phantom-video / HuMo

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Python 1,029 178 Updated Oct 19, 2025

PKU-YuanGroup / UAE

Official repository for the UAE paper, unified-GRPO, and unified-Bench

Python 151 6 Updated Sep 12, 2025

NJU-3DV / SpatialVID

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Python 449 14 Updated Dec 15, 2025

NoahBishop / index-tts

Python 13 1 Updated Dec 1, 2025

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 16,907 2,036 Updated Dec 2, 2025

FunAudioLLM / ThinkSound

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,108 65 Updated Nov 25, 2025

meituan-longcat / LongCat-Flash-Chat

1,246 60 Updated Dec 15, 2025

meituan-longcat / Meeseeks

A iterative feedback driven benchmark on LLM's instruction following ability

Python 46 4 Updated Sep 24, 2025

csbhr / Vivid-VR

The codes for Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration

Python 205 19 Updated Oct 30, 2025

Kunbyte-AI / OmniTry

Official Repository of "OmniTry: Virtual Try-On Anything without Masks"

Python 234 29 Updated Aug 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yongzhang6782 yzhang2016

Achievements