Skip to content
View jliu-ac's full-sized avatar
🎯
Focusing
🎯
Focusing
  • The University of Hong Kong
  • Hong Kong SAR

Organizations

@CVMI-Lab

Block or report jliu-ac

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"LLaFEA: Frame-Event Complementary Fusion for Fine-Grained Spatiotemporal Understanding in LMMs", accepted by ICCV 2025

Python 12 Updated Jul 28, 2025
Python 5 Updated Dec 23, 2025

(NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Python 66 Updated Oct 14, 2025

(ICCV 2025) How Far are AI-generated Videos from Simulating the 3D Visual World: A Learned 3D Evaluation Approach

Python 6 Updated Nov 10, 2025

[NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Python 517 27 Updated Nov 29, 2025

ICCV 2025

14 Updated Mar 26, 2026

[ICLR 2026]QeRL enables RL for 32B LLMs on a single H100 GPU.

Python 495 51 Updated Mar 30, 2026

[ICLR 2026] LongLive: Real-time Interactive Long Video Generation

Python 1,136 105 Updated Feb 26, 2026

Implementation of Paper “GV-VAD : Exploring Video Generation for Weakly-Supervised Video Anomaly Detection”

Python 8 Updated Oct 9, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,720 2,508 Updated Mar 5, 2026

(ICCV 2025) Holistic Tokenizer for Autoregressive Image Generation

Python 33 1 Updated Oct 9, 2025
Python 296 40 Updated Mar 25, 2026

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Python 1,173 177 Updated Sep 15, 2024

Official code for the CVPR 2025 paper "Navigation World Models".

Python 577 58 Updated Nov 24, 2025

The official implementation of the paper "UrbanWorld: An Urban World Model for 3D City Generation"

Python 50 5 Updated Nov 9, 2024
Python 109 9 Updated Sep 26, 2025

[NeurIPS 2025]Official repositories for "Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought".

Python 21 Updated Jan 30, 2026

Official Implementation of "VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning".

Python 63 5 Updated Nov 20, 2025

[CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

Python 365 25 Updated Mar 9, 2026

[ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness

Python 68 1 Updated Jul 22, 2025

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,396 216 Updated May 19, 2025

OpenEQA Embodied Question Answering in the Era of Foundation Models

Jupyter Notebook 346 28 Updated Sep 20, 2024

Universal Monocular Metric Depth Estimation

Python 1,153 108 Updated May 18, 2025

[CVPR 2023 Highlight] Perspective Fields for Single Image Camera Calibration

Jupyter Notebook 306 23 Updated Nov 2, 2024

[NeurIPS 2024] Geometry-Aware Large Reconstruction Model for Efficient and High-Quality 3D Generation

Python 173 9 Updated Sep 30, 2024

[ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Python 242 12 Updated Jul 14, 2025

[CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Python 315 13 Updated Sep 16, 2025

[ECCV 2024] SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM

Jupyter Notebook 501 46 Updated Nov 20, 2025

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 2,024 209 Updated Aug 7, 2024

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 21,190 3,068 Updated Oct 17, 2025
Next