Skip to content
View YuxuanSnow's full-sized avatar

Highlights

  • Pro

Organizations

@tum-phoenix

Block or report YuxuanSnow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
253 stars written in Python
Clear filter

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,690 445 Updated May 29, 2024

s1: Simple test-time scaling

Python 6,593 762 Updated Jun 25, 2025

[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"

Python 6,456 932 Updated May 13, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,345 469 Updated Aug 7, 2024

TripoSR: Fast 3D Object Reconstruction from a Single Image

Python 5,867 716 Updated Aug 16, 2024

Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans

Python 5,328 1,765 Updated Nov 6, 2025

Use commands in English to control Blender with OpenAI's GPT-4

Python 4,880 390 Updated Jun 5, 2024

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,867 381 Updated Apr 7, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,739 450 Updated Aug 19, 2024
Python 4,375 417 Updated Sep 14, 2025

Make human motion capture easier.

Python 4,320 523 Updated Feb 26, 2025

[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation

Python 4,243 387 Updated Jan 2, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 4,185 371 Updated Apr 8, 2024

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Python 4,127 461 Updated Jan 3, 2025

Model summary in PyTorch similar to `model.summary()` in Keras

Python 4,064 415 Updated Mar 2, 2024

Witness the aha moment of VLM with less than $3.

Python 3,976 290 Updated May 19, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,650 257 Updated Feb 13, 2025

The best OSS video generation models, created by Genmo

Python 3,494 452 Updated Sep 5, 2025

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,446 280 Updated May 31, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,958 239 Updated Sep 8, 2024

Unofficial Implementation of Animate Anyone

Python 2,936 242 Updated Jul 9, 2024

Isaac Gym Reinforcement Learning Environments

Python 2,724 496 Updated Oct 26, 2024

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Python 2,588 373 Updated Mar 3, 2025

【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models

Python 2,267 140 Updated Jul 15, 2025

Next-Token Prediction is All You Need

Python 2,249 88 Updated Mar 17, 2025

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,235 94 Updated Feb 16, 2025

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Python 2,230 309 Updated Oct 4, 2023

A Unified Framework for Surface Reconstruction

Python 2,082 196 Updated Jul 11, 2024

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 1,982 148 Updated Mar 13, 2025