mt-cly

🐒

Focusing

Chen Liyi mt-cly

🐒

Focusing

呱呱呱

53 followers · 68 following

PolyU
HongKong, China

Achievements

Highlights

Lists (29)

Sort

Stars

PolyU-VCLab / TVEdit

7 Updated Jun 15, 2026

MinghanLi / one-step-editing

Rethinking One-Step Image Editing through ChordEdit: Reproduction, Simplification, and New Insights

Python 5 Updated Jun 15, 2026

PolyU-VCLab / DepthMaster

DepthMaster: Unified Monocular Depth Estimation for Perspective and Panoramic Images

Python 24 Updated Jun 13, 2026

NVlabs / PixelDiT

[CVPR 2026 Best Paper Finalist] Pixel Diffusion Transformers for Image Generation

Python 804 60 Updated Jun 16, 2026

ysy31415 / EffectMaker

Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation

42 2 Updated Mar 6, 2026

ali-vilab / DiffusionOPD

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

Python 100 Updated May 27, 2026

yejy53 / GenClaw

GenClaw: Code-Driven Agentic Image Generation

222 2 Updated Jun 6, 2026

PolyU-VCLab / GGT-100K

GGT-100K: Generative Ground Truth for Generalizable Real-World Image Restoration

Python 60 4 Updated Jun 1, 2026

yyfz / Warp-as-History

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Python 215 9 Updated May 30, 2026

JavisVerse / Awesome-AVI

Awesome Audio-Visual Intelligence, Survey of Audio-Visual Intelligence

80 1 Updated May 8, 2026

OpenSenseNova / SenseNova-U1

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,189 277 Updated Jun 15, 2026

FuchengSu / WorldStereo

[CVPR 2026] WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories (WorldExpand of HY-World 2.0)

Python 165 8 Updated Apr 24, 2026

shiyi-zh0408 / Meta-CoT

[CVPR 2026] Official code of the paper "Meta-CoT: Enhancing Granularity and Generalization in Image Editing"

Python 75 2 Updated May 6, 2026

nv-tlabs / lyra

Project Lyra: Open Generative 3D World Models

Python 2,097 224 Updated Jun 11, 2026

Tencent-Hunyuan / HY-SOAR

HY-SOAR:Self-Correction for Optimal Alignment and Refinement in Diffusion Models

Python 630 64 Updated Apr 21, 2026

Tencent-Hunyuan / HY-World-2.0

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Python 2,230 186 Updated May 27, 2026

zju3dv / Scal3R

[CVPR 2026 (Highlight)] Scal3R: Scalable Test-Time Training for Large-Scale 3D Reconstruction

Python 485 37 Updated May 11, 2026

szqwu / FrameCrafter

FrameCrafter: Novel View Synthesis as Video Completion

Python 64 2 Updated May 19, 2026

inspatio / worldfm

Python 799 83 Updated May 6, 2026

inspatio / inspatio-world

Python 917 68 Updated Apr 13, 2026

brooks376 / Happy-Horse-1.0

Information collection for the Happy Horse AI video generator model. Official demo and updates at happyhorses.io.

635 60 Updated May 12, 2026

leeruibin / hybrid-forcing

Python 32 Updated Apr 29, 2026

cswry / VOSR

[CVPR2026] VOSR: A Vision-Only Generative Model for Image Super-Resolution

Python 128 9 Updated Apr 12, 2026

apple / ml-ssd

Python 783 62 Updated Apr 16, 2026

Tencent-Hunyuan / HunyuanWorld-Mirror

[ICML 2026] WorldMirror: Fast and Universal 3D reconstruction model for versatile tasks

Python 1,140 114 Updated May 27, 2026

zjr2000 / SPES

Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"

Python 22 Updated May 8, 2026

ResearAI / DeepScientist

Now, Stronger AI Pushes Frontiers, Stronger Our Shared Future.

TypeScript 3,062 324 Updated Jun 15, 2026

lukasHoel / video_to_world

Our method reconstructs 3D worlds from video diffusion models using non-rigid alignment to resolve inherent 3D inconsistencies in the generated sequences.

Python 257 23 Updated Apr 27, 2026

MoyangLi00 / DROID-W

[CVPR 2026] DROID-SLAM in the Wild

Python 391 44 Updated Mar 26, 2026

YBZh / CheXOne

CheXOne: A Reasoning-Enabled Vision–Language Foundation Model for Chest X-ray Interpretation

Python 42 2 Updated Apr 12, 2026

Chen Liyi mt-cly

Highlights

Lists (29)

2d edit

3D

3D edit

4D

agent

attention encoder

COT

cross-modality

depth

detection

diffusion

diffusion+3D

frameworks

GAN

gaussian splatting

interactive segmentation

LV-models

mamba

multi-modal

multi-modalities

NLP

open-vocabulary

others

PEFT

RL

segmentation

tracking

video

WSSS

Stars