Skip to content
View Lxiangyue's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@HKUST-SAIL

Block or report Lxiangyue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A 3B-active-parameter native unified multimodal model for image and video understanding, generation, and editing.

Python 1,208 80 Updated Jun 16, 2026

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 10,060 783 Updated Sep 22, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,875 2,773 Updated Aug 12, 2024
Python 10 Updated Mar 31, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 13,370 1,490 Updated May 19, 2026

[CVPR 2026]Official implementation of "UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes"

Python 197 9 Updated Mar 19, 2026

Official implementation of "NoiseAR: AutoRegressing Initial Noise Prior for Diffusion Models"

Python 17 Updated Jun 3, 2025

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Python 874 59 Updated Sep 8, 2025

Lets make video diffusion practical!

Python 17,028 1,700 Updated Oct 16, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,238 69 Updated Feb 25, 2025

A simulation platform for versatile Embodied AI research and developments.

Python 1,261 78 Updated Sep 4, 2025

[CVPR2025] EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild

Python 108 6 Updated Feb 11, 2026
Python 28 2 Updated Dec 9, 2025

[CoRL 2024] DexGraspNet 2.0: Learning Generative Dexterous Grasping in Large-scale Synthetic Cluttered Scenes

Python 144 20 Updated Jan 23, 2025
Python 439 49 Updated Jan 6, 2025

[ICLR'25] 🍀 DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References

Python 119 7 Updated Apr 14, 2026

DressRecon: Freeform 4D Human Reconstruction from Monocular Video (3DV'25 Oral)

Python 142 6 Updated Feb 13, 2025

[3DV'25] GaussianAvatar-Editor: Photorealistic Animatable Gaussian Head Avatar Editor

Python 40 3 Updated Feb 6, 2025

A Video Tokenizer Evaluation Dataset

Python 158 12 Updated Jan 13, 2025

NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

Jupyter Notebook 10,261 674 Updated Jun 15, 2026

Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"

Python 245 25 Updated Jan 15, 2025

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 1,318 79 Updated Jan 5, 2026

Depth Any Video with Scalable Synthetic Data (ICLR 2025)

Python 517 29 Updated Dec 4, 2024

RaDe-GS: Rasterizing Depth in Gaussian Splatting

C++ 683 50 Updated Mar 19, 2026

More relighting!

Python 8,443 524 Updated Feb 20, 2025

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Python 3,160 209 Updated Dec 10, 2025

[ECCV 2024] HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.

Python 216 13 Updated Apr 10, 2025

[SIGGRAPH'24] Official code of HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation

Python 78 5 Updated Aug 16, 2024

[SIGGRAPH 2024] Official PyTorch Implementation of "BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry".

Python 414 53 Updated Sep 13, 2024
Next