-
Nanyang Technological University
- Singapore
-
06:19
(UTC +08:00) - https://cyw-3d.github.io/
Highlights
- Pro
Stars
Sharp Monocular View Synthesis in Less Than a Second
official repo for ArtiLatent (siggraph asia 2025)
🌐 3D and 4D World Modeling: A Survey
(ICLR2026) ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation
Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.
[3DV 2026] FastMesh: Efficient Artistic Mesh Generation via Component Decoupling
Official Code Release for [SIGGRAPH 2025] RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global Illumination
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
[NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding
[NeurIPS 2025] Direct3D‑S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
[NeurIPS 2025 Spotlight] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
A curated list of awesome Neural Computer-Aided Design (CAD) papers.
[AAAI2025] CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs
[CVPR 2025] Official repository for "Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders"
Emu Series: Generative Multimodal Models from BAAI
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
SparseFlex: High-Resolution and Arbitrary-Topology 3D Shape Modeling
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models
OctGPT: Octree-based Multiscale Autoregressive Models for 3D Shape Generation [SIGGRAPH 2025]
[ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer