Skip to content
View wyhsirius's full-sized avatar

Block or report wyhsirius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Python 105 1 Updated Oct 27, 2025

[ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".

Python 116 2 Updated Feb 6, 2026

LIA-X: Interpretable Latent Portrait Animator

Python 105 12 Updated Sep 17, 2025

[ICLR2026] Video-GPT via Next Clip Diffusion.

Python 46 1 Updated Jun 2, 2025

[CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models

Python 296 23 Updated May 17, 2025

Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion (CVPR2025)

Python 149 8 Updated Oct 22, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,150 425 Updated Jun 18, 2026

[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.

Python 1,943 191 Updated Oct 30, 2025

[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts

Python 309 12 Updated Jun 9, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,660 126 Updated Mar 23, 2026

[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Python 952 64 Updated Nov 13, 2024

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Python 967 65 Updated Nov 13, 2024

[ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation

Python 22 1 Updated Oct 25, 2023

Training-Free Condition-Guided Text-to-Video Generation

Python 62 1 Updated Oct 23, 2025

[ICML2023] Long-Term Rhythmic Video Soundtracker

Python 63 1 Updated Jul 28, 2025

An open-source tool-augmented conversational language model from Fudan University

Python 12,140 1,134 Updated May 27, 2026

Official PyTorch implementation of LongVideoGAN

Python 319 30 Updated Nov 5, 2022

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Python 4,043 458 Updated Oct 1, 2025
Python 3,333 358 Updated Jun 10, 2023

3D-Aware Video Generation

Python 76 3 Updated Nov 15, 2022

[ICLR 22, TPAMI 24] LIA: Latent Image Animator

Python 652 69 Updated Oct 22, 2025

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,802 281 Updated Feb 15, 2023

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,415 800 Updated Oct 7, 2024

[CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation

Python 542 53 Updated Jul 30, 2024

A curated list of awesome 3d generation papers

1,194 61 Updated Mar 9, 2023

Official PyTorch implementation of "Playable Environments: Video Manipulation in Space and Time", CVPR 2022

Python 72 10 Updated Oct 16, 2022

A curated list of resources on implicit neural representations.

2,639 145 Updated Feb 11, 2024

[WACV 2021]"Guided Attentive Feature Fusion for Multispectral Pedestrian Detection"

29 2 Updated Jan 13, 2021

Localize to Classify and Classify to Localize: Mutual Guidance in Object Detection

Python 110 12 Updated Jan 28, 2023

[BMVC 2021 Oral] Official implementation of our paper "A Unified Framework for Real-world Skeleton-based Action Recognition" on Toyota Smarthome/Penn Action/NTU-RGB+D/Posetics datasets

Python 53 11 Updated Sep 2, 2022
Next