Skip to content
View delldu's full-sized avatar
  • Vision
  • ShenZhen,China

Block or report delldu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner

Python 192 12 Updated May 23, 2026

The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation

Python 41 Updated May 4, 2025

Implementation of the AAAI 2025 paper "SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers".

2 Updated Mar 25, 2026

JoyAI-Echo: Pushing the Frontier of Long Audio-Visual Generation

Python 1,549 135 Updated Jun 8, 2026

Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…

Python 110 17 Updated Nov 6, 2025

The official repository of our ICLR 2026 paper "Vivid-VR: Distilling Concepts from Text-to-Video Diffusion Transformer for Photorealistic Video Restoration".

Python 239 19 Updated Feb 4, 2026

Real-time 3D full-body reconstruction from a single camera, Multiperson BVH output, Pure C++ runtime, ONNX + ggml, 70-joint skeleton with hands.

C 520 71 Updated Jun 14, 2026
Python 203 9 Updated Apr 23, 2026

[ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.

Python 116 2 Updated Apr 13, 2026

[AAAI 2024] "LDMVFI: Video Frame Interpolation with Latent Diffusion Models", Duolikun Danier, Fan Zhang, David Bull

Python 190 24 Updated Aug 13, 2023
Python 289 17 Updated May 10, 2026

[CVPR2026] VOSR: A Vision-Only Generative Model for Image Super-Resolution

Python 127 9 Updated Apr 12, 2026

[ICCV 2025] This is the official PyTorch codes for the paper: "DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution"

Python 250 7 Updated Mar 11, 2026

[CVPR2026] ODTSR: This repo is the official implementation of "One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution"

Python 169 4 Updated Feb 21, 2026

[ICML26 Spotlight] UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Python 143 1 Updated May 1, 2026

[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Python 2,440 161 Updated Apr 16, 2026

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 1 Updated May 22, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 1 Updated Jun 10, 2025

[ICCV 2025] Official code of DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Python 724 35 Updated Dec 17, 2025

[ICLR 2025 spotlight] 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation

Jupyter Notebook 259 8 Updated Jun 3, 2025

ComfyUI-OmniGen - A ComfyUI custom node implementation of OmniGen, a powerful text-to-image generation and editing model.

Python 301 19 Updated Apr 18, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,324 361 Updated Dec 4, 2025

有趣的80后程序员的工作流分享

1,783 448 Updated Jun 15, 2026

Lightweight Python framework that provides a high-level API for creating and rendering scenes with Blender.

Python 863 20 Updated Sep 13, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1 Updated Mar 15, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 3,121 266 Updated Jun 14, 2026

🙌 OpenHands: AI-Driven Development

Python 77,078 9,796 Updated Jun 15, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 1 Updated Mar 9, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,256 2,859 Updated Mar 5, 2026

A C++ header-only HTTP/HTTPS server and client library

C++ 1 Updated Feb 12, 2025
Next