Skip to content
View ybbbbt's full-sized avatar

Highlights

  • Pro

Organizations

@zju3dv

Block or report ybbbbt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

939 121 Updated Aug 27, 2025

[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Python 2,440 161 Updated Apr 16, 2026

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,821 92 Updated Nov 28, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,095 113 Updated Dec 19, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,225 110 Updated Oct 15, 2025

Lets make video diffusion practical!

Python 17,022 1,697 Updated Oct 16, 2025

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Python 1,689 183 Updated Apr 18, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 2,130 162 Updated Jun 11, 2026

[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling

Python 595 9 Updated Oct 26, 2025

ComfyUI Node

Python 727 41 Updated Jun 18, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 163 11 Updated Sep 16, 2025

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 854 42 Updated Dec 17, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,256 2,855 Updated Mar 5, 2026

Collection of scripts to build small-scale datasets for fine-tuning video generation models.

Python 81 7 Updated Mar 17, 2025

Lora traing script for Lightricks LTX-video

Python 71 5 Updated Feb 12, 2025

[NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead

Python 42 2 Updated Oct 3, 2025

📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors

JavaScript 39,986 3,419 Updated May 30, 2026

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,418 435 Updated Jan 17, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,856 527 Updated Jun 12, 2026

[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".

Python 521 32 Updated Mar 19, 2026

The ultimate training toolkit for finetuning diffusion models

Python 10,860 1,353 Updated Jun 13, 2026

Scalable and memory-optimized training of diffusion models

Python 1,361 140 Updated May 26, 2026

A pipeline parallel training script for diffusion models.

Python 1,972 278 Updated Jun 7, 2026

musubi-tuner modified to tune image2video/video infilling

Python 33 3 Updated Jan 30, 2025

Official repository for LTX-Video

Python 10,475 1,035 Updated Jan 5, 2026

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 21,642 2,496 Updated May 25, 2026

A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and professional-looking video outputs by incorporating iconic …

Python 48 3 Updated Dec 31, 2024

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,860 282 Updated Jun 13, 2026

Code for our paper: Learning Camera Movement Control from Real-World Drone Videos

Python 36 5 Updated Apr 16, 2025
Next