Skip to content
View MingtaoGuo's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Sichuan University
  • Chengdu

Block or report MingtaoGuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ICLR 2025 paper X-NeMo & Project X-Portrati2

Python 91 5 Updated Aug 7, 2025
Python 7,320 424 Updated Dec 14, 2025

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 540 30 Updated Mar 12, 2025

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,464 94 Updated Sep 11, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,874 1,493 Updated Dec 17, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,422 359 Updated Nov 11, 2025

An inference and training framework for multiple image input in Flux Kontext dev

Jupyter Notebook 425 30 Updated Sep 1, 2025

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 2,123 139 Updated Nov 2, 2025

[CAD/Graphics 2025][Computers & Graphics] Navigating Large-Pose Challenge for High-Fidelity Face Reenactment with Video Diffusion Model

Python 5 2 Updated Sep 2, 2025

Code of π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,477 75 Updated Dec 4, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)

Python 1,706 127 Updated Jul 25, 2025

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,718 460 Updated Dec 18, 2025

FACM: Flow-Anchored Consistency Models

Python 133 2 Updated Aug 6, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,009 1,272 Updated Oct 11, 2025

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,487 2,365 Updated Apr 29, 2025

从零手搓Flow Matching(Rectified Flow)

Python 559 32 Updated Dec 10, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,651 3,968 Updated Apr 19, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,047 113 Updated Nov 12, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,503 244 Updated Oct 17, 2025

[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models

Python 315 10 Updated Apr 24, 2025

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,663 80 Updated Nov 28, 2025

Lets make video diffusion practical!

Python 16,353 1,593 Updated Oct 16, 2025

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Python 822 55 Updated Apr 27, 2025

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python 1,330 109 Updated Mar 11, 2025

Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model" (SIGGRAPH 2025)

Python 215 23 Updated Jul 14, 2024

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

1,407 79 Updated Nov 6, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,938 2,207 Updated Dec 15, 2025

Enjoy the magic of Diffusion models!

Python 11,161 1,054 Updated Dec 18, 2025

[CVPR 2025] High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model

Python 57 5 Updated Jun 4, 2025
Next