KHao123

Kanghao Chen KHao123

Ph.D student in AI Thrust, HKUST(GZ)

17 followers · 9 following

HKUST(GZ)
Guangdong, China
KHao123.github.io
@KaneChen9707

Achievements

Highlights

Lists (3)

Sort

Starred repositories

EnVision-Research / LatentMorph

LatentMorph: Morphing Latent Reasoning into Image Generation

Python 25 Updated Feb 3, 2026

EthanLiang99 / EvLight

The source code for "Towards Robust Event-guided Low-Light Image Enhancement: A Large-Scale Real-World Event-Image Dataset and Novel Approach" (CVPR24 Oral & TPAMI25)

Python 98 6 Updated Feb 2, 2026

EnVision-Research / DualCamCtrl

Official Implementation of Paper [DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation]

Python 74 1 Updated Dec 29, 2025

EnVision-Research / TiViBench

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Python 64 1 Updated Nov 27, 2025

Bria-AI / FIBO

FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation.

Python 303 14 Updated Jan 7, 2026

FYYDCC / IVT-LR

Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”

Python 18 1 Updated Jan 27, 2026

zhangquanchen / 3DThinker

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Python 182 5 Updated Dec 9, 2025

bytedance / mammothmoda

Python 57 4 Updated Jan 30, 2026

EnVision-Research / MTI

Official implementation of "Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention"

Python 35 Updated Jan 8, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,555 1,188 Updated Feb 5, 2026

EnVision-Research / PhysToolBench

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

Python 27 3 Updated Oct 20, 2025

showlab / Paper2Video

Automatic Video Generation from Scientific Papers

Python 2,108 304 Updated Oct 20, 2025

weijiawu / Awesome-RL-for-Multimodal-Foundation-Models

📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.

410 20 Updated Feb 5, 2026

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,348 60 Updated Dec 7, 2025