Stars
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
[ACM MM 2023] Official implementation of "Hierarchical Masked 3D Diffusion Model for Video Outpainting"
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
[ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745
[AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation"
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
[DEIMv2] Real Time Object Detection Meets DINOv3
🎓Automatically Update CV Papers Daily using Github Actions
Command-line program to download videos from YouTube.com and other video sites
High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios
Materials for the Hugging Face Diffusion Models Course
[ICLR 2026] Official implementation of "SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction"
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
wtfpython的中文翻译/持续🚧.../ 能力有限,欢迎帮我改进翻译
[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
project page for "Wan-Animate: Unified Character Animation and Replacement with Holistic Replication"
Benchmarking Knowledge Transfer in Lifelong Robot Learning
SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Action Chunking Transformer implementation for low cost robot
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
A pipeline parallel training script for diffusion models.
SkyReels-A2: Compose anything in video diffusion transformers