Skip to content
View xiyichen's full-sized avatar
💪
💪

Highlights

  • Pro

Block or report xiyichen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Latent Spatial Memory for Video World Models

189 5 Updated Jun 9, 2026
Python 448 21 Updated Jun 8, 2026

Facial Expression Analysis Toolbox

Python 364 92 Updated Jun 10, 2026
Python 15 3 Updated Mar 20, 2024

A collection of examples for the MediaPipe Task APIs that can run fully inside your browser.

TypeScript 40 10 Updated Jun 12, 2026

PaGeR — Unified Panoramic Geometry Estimation via Multi-View Foundation Models

Python 122 9 Updated May 29, 2026

Official Code for the CVPR 2026 Paper "MATCH: Feed-forward Gaussian Registration for Head Avatar Creation and Editing"

Python 14 Updated Jun 6, 2026

[CVPR2026] Official Implementation of Voxify3D

Python 35 1 Updated May 21, 2026

[CVPR 2026 Oral] Official implementation for ChordEdit: One-Step Low-Energy Transport for Image Editing

Python 260 9 Updated May 13, 2026

A novel multi-view feedforward network that enables direct and robust object pose estimation from a query image.

17 Updated Jun 5, 2026

Official implementation for the CVPR'23 paper: Visibility Aware Human-Object Interaction Tracking from Single RGB Camera

Python 79 3 Updated Jun 10, 2023

[CVPR 2026 Oral] 4D Primitive-Mâché: Glueing Primitives for Persistent 4D Scene Reconstruction

Python 22 1 Updated Jun 4, 2026

Awesome Unified Multimodal Models

1,281 40 Updated Mar 24, 2026

TripoSplat converts a single 2D image into high-quality and variable number of 3D Gaussians, developed by TripoAI.

Python 641 63 Updated Jun 2, 2026

Finetune HunyuanImage 3.0, a 80B unified understanding and generation model

Python 36 3 Updated Jan 7, 2026

[CVPR 2026] Official code for BulletTime: Decoupled Control of Time and Camera Pose for Video Generation

Python 9 Updated Jun 1, 2026

[CVPR 2026] Official code for BulletTime: Decoupled Control of Time and Camera Pose for Video Generation

Python 5 Updated Jun 1, 2026

TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

Python 310 20 Updated Jun 12, 2026

Official code for MAMMA: Markerless Accurate Multi-person Motion Acquisition.

Python 512 45 Updated Jun 11, 2026

[SIGGRAPH 2026 Conference] FreeOrbit4D: Training-free Arbitrary Camera Redirection for Monocular Videos via Foreground-Complete 4D Reconstruction

Python 58 4 Updated May 14, 2026

Re-implementation Code for "Archon: A Unified Multimodal Model for Holistic Digital Human Generation", CVPR 2026

8 Updated May 29, 2026

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation (NeurIPS 2023 Spotlight)

Python 1,566 44 Updated Nov 22, 2023

A Comprehensive Survey of Interactive Video World Models

176 11 Updated Jun 12, 2026

Official implementation of No Pose, No Problem in 4D: Feed-Forward Dynamic Gaussians from Unposed Multi-View Videos

Python 58 1 Updated May 27, 2026

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 3,123 166 Updated Feb 3, 2026

Claude Code skill: read & write Overleaf projects via the git bridge. Works on Mac/Linux/WSL.

24 Updated May 7, 2026

[CVPR 2026 Oral] VGGT Omega

Python 2,928 118 Updated May 18, 2026

Implementation of Open-World Visual Odometry with Temporal Dynamics Awareness (CVPR'26)

Python 108 8 Updated May 23, 2026

[SIGGRAPH 2026] Pixal3D: Pixel-Aligned 3D Generation from Images

Python 1,742 157 Updated May 24, 2026

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Python 201 1 Updated May 26, 2026
Next