Skip to content
View cyw-3d's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@Gorilla-Lab-SCUT

Block or report cyw-3d

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Sharp Monocular View Synthesis in Less Than a Second

Python 8,547 619 Updated Dec 19, 2025

official repo for ArtiLatent (siggraph asia 2025)

Jupyter Notebook 57 2 Updated Jun 1, 2026

🌐 3D and 4D World Modeling: A Survey

HTML 932 53 Updated May 19, 2026

(ICLR2026) ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation

Python 607 18 Updated Apr 4, 2026

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,569 164 Updated Apr 15, 2026

[3DV 2026] FastMesh: Efficient Artistic Mesh Generation via Component Decoupling

Python 132 5 Updated Nov 11, 2025

Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination

Python 958 61 Updated Sep 25, 2025

From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Python 3,577 523 Updated Oct 17, 2025

[NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding

Python 569 30 Updated Oct 20, 2025

[NeurIPS 2025] Direct3D‑S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention

Python 1,254 106 Updated Sep 26, 2025

[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

Python 468 17 Updated Feb 5, 2026

Open-source unified multimodal model

Python 6,017 533 Updated May 4, 2026

Official implementation of BLIP3o-Series

Python 1,658 79 Updated Nov 29, 2025

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Python 191 6 Updated May 21, 2025

A curated list of awesome Neural Computer-Aided Design (CAD) papers.

HTML 189 18 Updated Mar 13, 2026

[AAAI2025] CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs

55 5 Updated Jun 13, 2025

[CVPR 2025] Official repository for "Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders"

Python 584 31 Updated Jul 2, 2025

Emu Series: Generative Multimodal Models from BAAI

Python 1,775 83 Updated Jan 12, 2026

[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory

Python 367 16 Updated Feb 21, 2026

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 428 25 Updated Jun 20, 2025

[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

Python 541 39 Updated Jun 30, 2025

SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling

Python 751 57 Updated Apr 7, 2025

[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…

Python 2,535 95 Updated Mar 1, 2026

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Python 1,696 184 Updated Apr 18, 2025

OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation [SIGGRAPH 2025]

Python 204 8 Updated Sep 18, 2025

[ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction

Python 434 16 Updated Jun 6, 2025

Multimodal Models in Real World

Jupyter Notebook 557 24 Updated Feb 24, 2025
Python 2,508 246 Updated Jul 16, 2025

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

826 41 Updated Oct 10, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 8,294 643 Updated Jun 16, 2026
Next