Skip to content
View mdswyz's full-sized avatar

Block or report mdswyz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Elevate your AI research writing, no more tedious polishing ✨

28,541 2,223 Updated May 18, 2026
Python 351 32 Updated Feb 9, 2026

A collection of awesome video generation studies.

TeX 770 40 Updated Mar 31, 2026

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,225 110 Updated Oct 15, 2025

[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.

Python 768 51 Updated Feb 21, 2026

Enjoy the magic of Diffusion models!

Python 12,582 1,230 Updated Jun 15, 2026

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

Python 658 94 Updated Jul 2, 2024

State-of-the-art 2D and 3D Face Analysis Project

Python 28,998 6,040 Updated May 23, 2026

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,819 266 Updated Oct 17, 2025

Lets make video diffusion practical!

Python 17,027 1,701 Updated Oct 16, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,260 2,864 Updated Mar 5, 2026

Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"

Jupyter Notebook 315 11 Updated Sep 28, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,786 1,307 Updated Nov 4, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,985 380 Updated Mar 12, 2026

Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection CVPR 2025

Python 27 1 Updated Feb 28, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,592 414 Updated Nov 11, 2025
BibTeX Style 1,542 377 Updated Mar 19, 2026

AugTarget data augmentation for infrared small target detection.

Python 20 2 Updated May 19, 2023

[AAAI2025] FedCFA: Alleviating Simpson’s Paradox in Model Aggregation with Counterfactual Federated Learning

Python 24 Updated Jan 23, 2025

✨ [AAAI 2025] Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification

Python 54 2 Updated Apr 16, 2025

[AAAI 2025] Official implementation of the paper "Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation"

Python 41 Updated Dec 17, 2024

Official implementation of the paper "Attentive Eraser: Unleashing Diffusion Model’s Object Removal Potential via Self-Attention Redirection Guidance" (AAAI 2025 Oral)

Jupyter Notebook 221 10 Updated May 9, 2025

[AAAI 2025] Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

Python 34 2 Updated Dec 16, 2025

Source code for AAAI'25 paper "Component-Level Segmentation for Oracle Bone Inscription Decipherment"

Python 20 1 Updated Oct 13, 2025

[AAAI'2025] The official implementation code of SIGMA

Python 41 11 Updated Oct 14, 2025
Python 35 3 Updated Nov 22, 2024

[AAAI2025] Offical code implementation of "Context-aware Inductive Knowledge Graph Completion with Latent Type Constraints and Subgraph Reasoning"

Python 17 Updated Aug 26, 2025

Code implement for FastToG

Python 89 10 Updated Apr 13, 2025
Next