-
Chongqing University of Posts and Telecommunications
- Chongqing, China
-
01:37
(UTC +08:00)
Lists (1)
Sort Name ascending (A-Z)
Stars
CNCF Sandbox project: A Cloud-Native Proxyless Service Mesh based on Java Bytecode Enhancement Technology
Enjoy the magic of Diffusion models!
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
Code of "Style Customization of Text-to-Vector Generation with Image Diffusion Priors"
Lets make video diffusion practical!
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…
通过Anki Fsrs算法速成力扣:自动推荐题目,每日复习(支持导入外部题目:手撕、洛谷、codeforce、牛客、一题多解)。Master LeetCode via Anki Fsrs:auto-recommend problems, review daily.
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…
[CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
Official implementation of FIFO-Diffusion: Generating Infinite Videos from Text without Training (NeurIPS 2024)
Mobius: Text to Seamless Looping Video Generation via Latent Shift
A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
peanutcocktail / CogVideo
Forked from zai-org/CogVideoText-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
To practice your competitive programming skills, try solving daily Codeforces problems!
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Quick scripts to calculate CLIP text-image similarity
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
Video-Infinity generates long videos quickly using multiple GPUs without extra training.
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
[ICLR 2024] Code for FreeNoise based on VideoCrafter
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Open-Sora: Democratizing Efficient Video Production for All