Skip to content
View ybbbbt's full-sized avatar

Highlights

  • Pro

Organizations

@zju3dv

Block or report ybbbbt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

942 122 Updated Aug 27, 2025

[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Python 2,443 161 Updated Apr 16, 2026

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,825 92 Updated Nov 28, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,096 112 Updated Dec 19, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,225 110 Updated Oct 15, 2025

Lets make video diffusion practical!

Python 17,047 1,708 Updated Oct 16, 2025

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Python 1,703 185 Updated Apr 18, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 2,141 164 Updated Jun 22, 2026

[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling

Python 596 8 Updated Oct 26, 2025

ComfyUI Node

Python 725 41 Updated Jun 18, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 163 11 Updated Sep 16, 2025

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 856 42 Updated Dec 17, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,303 2,892 Updated Mar 5, 2026

Collection of scripts to build small-scale datasets for fine-tuning video generation models.

Python 81 7 Updated Mar 17, 2025

Lora traing script for Lightricks LTX-video

Python 71 5 Updated Feb 12, 2025

[NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead

Python 42 2 Updated Oct 3, 2025

📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors

JavaScript 40,059 3,430 Updated May 30, 2026

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,428 434 Updated Jan 17, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,863 537 Updated Jun 19, 2026

[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".

Python 525 32 Updated Mar 19, 2026

The ultimate training toolkit for finetuning diffusion models

Python 10,955 1,368 Updated Jun 22, 2026

Scalable and memory-optimized training of diffusion models

Python 1,358 140 Updated May 26, 2026

A pipeline parallel training script for diffusion models.

Python 1,976 278 Updated Jun 7, 2026

musubi-tuner modified to tune image2video/video infilling

Python 33 3 Updated Jan 30, 2025

Official repository for LTX-Video

Python 10,544 1,045 Updated Jan 5, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 21,779 2,509 Updated May 25, 2026

A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and professional-looking video outputs by incorporating iconic …

Python 48 3 Updated Dec 31, 2024

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,862 282 Updated Jun 21, 2026

Code for our paper: Learning Camera Movement Control from Real-World Drone Videos

Python 36 5 Updated Apr 16, 2025
Next