Skip to content
View YixunLiang's full-sized avatar

Organizations

@EnVision-Research

Block or report YixunLiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo for UAE

Python 72 1 Updated Dec 24, 2025

Code for the paper "Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations"

HTML 28 Updated Dec 23, 2025

A Foundation Model for Generalist Gaming Agents

Python 889 105 Updated Dec 23, 2025

End-to-end pipeline converting generative videos (Veo, Sora) to humanoid robot motions

Python 410 43 Updated Dec 18, 2025
Python 53 1 Updated Dec 23, 2025

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 355 8 Updated Dec 16, 2025

A paper list for spatial reasoning

552 32 Updated Dec 24, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 1,604 90 Updated Dec 24, 2025

Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models

Python 49 2 Updated Dec 21, 2025

Atom3d, atomising geometry, is a mesh processing toolbox specifically designed for 3D learning.

Python 52 2 Updated Dec 23, 2025

The official implementation of StereoPilot

Python 69 1 Updated Dec 19, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,781 181 Updated Dec 20, 2025

RealSee3D: A multi-view RGB-D dataset combining real-world captures and procedurally generated scenes, with extensible annotations for diverse 3D vision research.

Python 208 8 Updated Dec 18, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 2,076 89 Updated Dec 15, 2025

Native and Compact Structured Latents for 3D Generation

Python 2,297 158 Updated Dec 23, 2025

Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"

34 Updated Dec 12, 2025

Official Implementation of Particulate: Feed-Forward 3D Object Articulation

Python 78 4 Updated Dec 15, 2025

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

Jupyter Notebook 1,020 66 Updated Dec 15, 2025

NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks

129 4 Updated Oct 20, 2025

A Cross-Platform Backend for High-Performance Sparse Convolutions

Python 86 7 Updated Dec 20, 2025

JavaScript 3D Library.

JavaScript 110,002 36,183 Updated Dec 24, 2025

Cuda mesh utils.

Cuda 85 10 Updated Dec 24, 2025
HTML 50 1 Updated Dec 8, 2025

Official implementation of Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model

Python 205 12 Updated Dec 8, 2025

Vision Bridge Transformer at Scale

Python 126 6 Updated Dec 1, 2025
37 Updated Nov 26, 2025

Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) by learning transferable latent camera pose representations.

Python 76 1 Updated Oct 22, 2025
Python 7,814 460 Updated Dec 24, 2025

Matterport3D is a pretty awesome dataset for RGB-D machine learning tasks :)

C++ 1,148 156 Updated Nov 3, 2025

Official inference repo for FLUX.2 models

Python 1,268 64 Updated Dec 1, 2025
Next