Skip to content
View xishuxishu's full-sized avatar

Block or report xishuxishu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Native and Compact Structured Latents for 3D Generation

Python 1,833 116 Updated Dec 17, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 11,338 1,043 Updated Nov 5, 2025

Unified Multimodal Model for image generation/editing/understanding

Python 818 38 Updated Sep 8, 2025

Official Implementation of Paper Transfer between Modalities with MetaQueries

Python 279 9 Updated Oct 12, 2025
Python 7,489 442 Updated Dec 14, 2025

Official implementation of BLIP3o-Series

Python 1,610 72 Updated Nov 29, 2025

Native Multimodal Models are World Learners

Python 1,367 52 Updated Nov 28, 2025

[NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think

Python 227 17 Updated Oct 4, 2025

Official PyTorch Implementation of "Latent Diffusion Model Without Variational Autoencoder".

Python 380 13 Updated Dec 15, 2025

[NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis

Python 105 5 Updated Nov 3, 2025

A part-based 3D generation framework & the largest and most comprehensively annotated 3D part dataset.

Jupyter Notebook 113 4 Updated Dec 15, 2025

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

102 4 Updated Nov 28, 2025

SAM 3D Objects

Python 4,996 461 Updated Dec 16, 2025

Official code for VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator

81 Updated Oct 16, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,290 1,450 Updated Nov 28, 2025

[NeurIPS 2025] Improving Video Generation with Human Feedback

Python 378 9 Updated Sep 24, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,761 104 Updated Nov 4, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,924 1,505 Updated Dec 17, 2025

Enjoy the magic of Diffusion models!

Python 11,176 1,054 Updated Dec 20, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,437 361 Updated Dec 19, 2025

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,226 40 Updated Oct 26, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,708 128 Updated Dec 19, 2025

[NeurIPS 2025]SyncHuman: Synchronizing 2D and 3D Generative Models for Single-view Human Reconstruction.

Jupyter Notebook 58 1 Updated Oct 13, 2025
JavaScript 1 Updated Oct 22, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,702 1,265 Updated Oct 28, 2025

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Python 5,989 1,130 Updated Jul 25, 2024