Skip to content
View mt-cly's full-sized avatar
🐒
Focusing
🐒
Focusing
  • PolyU
  • HongKong, China

Highlights

  • Pro

Block or report mt-cly

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
7 Updated Jun 15, 2026

Rethinking One-Step Image Editing through ChordEdit: Reproduction, Simplification, and New Insights

Python 5 Updated Jun 15, 2026

DepthMaster: Unified Monocular Depth Estimation for Perspective and Panoramic Images

Python 24 Updated Jun 13, 2026

[CVPR 2026 Best Paper Finalist] Pixel Diffusion Transformers for Image Generation

Python 804 60 Updated Jun 16, 2026

Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation

42 2 Updated Mar 6, 2026

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

Python 100 Updated May 27, 2026

GenClaw: Code-Driven Agentic Image Generation

222 2 Updated Jun 6, 2026

GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration

Python 60 4 Updated Jun 1, 2026

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Python 215 9 Updated May 30, 2026

Awesome Audio-Visual Intelligence, Survey of Audio-Visual Intelligence

80 1 Updated May 8, 2026

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,189 277 Updated Jun 15, 2026

[CVPR 2026] WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories (WorldExpand of HY-World 2.0)

Python 165 8 Updated Apr 24, 2026

[CVPR 2026] Official code of the paper "Meta-CoT: Enhancing Granularity and Generalization in Image Editing"

Python 75 2 Updated May 6, 2026

Project Lyra: Open Generative 3D World Models

Python 2,097 224 Updated Jun 11, 2026

HY-SOAR:Self-Correction for Optimal Alignment and Refinement in Diffusion Models

Python 630 64 Updated Apr 21, 2026

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Python 2,230 186 Updated May 27, 2026

[CVPR 2026 (Highlight)] Scal3R: Scalable Test-Time Training for Large-Scale 3D Reconstruction

Python 485 37 Updated May 11, 2026

FrameCrafter: Novel View Synthesis as Video Completion

Python 64 2 Updated May 19, 2026
Python 799 83 Updated May 6, 2026
Python 917 68 Updated Apr 13, 2026

Information collection for the Happy Horse AI video generator model. Official demo and updates at happyhorses.io.

635 60 Updated May 12, 2026
Python 32 Updated Apr 29, 2026

[CVPR2026] VOSR: A Vision-Only Generative Model for Image Super-Resolution

Python 128 9 Updated Apr 12, 2026
Python 783 62 Updated Apr 16, 2026

[ICML 2026] WorldMirror: Fast and Universal 3D reconstruction model for versatile tasks

Python 1,140 114 Updated May 27, 2026

Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"

Python 22 Updated May 8, 2026

Now, Stronger AI Pushes Frontiers, Stronger Our Shared Future.

TypeScript 3,062 324 Updated Jun 15, 2026

Our method reconstructs 3D worlds from video diffusion models using non-rigid alignment to resolve inherent 3D inconsistencies in the generated sequences.

Python 257 23 Updated Apr 27, 2026

[CVPR 2026] DROID-SLAM in the Wild

Python 391 44 Updated Mar 26, 2026

CheXOne: A Reasoning-Enabled Vision–Language Foundation Model for Chest X-ray Interpretation

Python 42 2 Updated Apr 12, 2026
Next