Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 275 9 Updated Sep 30, 2025

facebookresearch / map-anything

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 1,915 103 Updated Oct 9, 2025

NJU-3DV / SpatialVID

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Python 378 10 Updated Oct 8, 2025

SarahWeiii / pamo

[PG2025] PaMO: Parallel Mesh Optimization for Intersection-Free Low-Poly Modeling on the GPU

Python 67 5 Updated Sep 12, 2025

ByteDance-Seed / m3-agent

Python 1,000 85 Updated Oct 9, 2025

ByteDance-Seed / seed-oss

Python 825 44 Updated Sep 15, 2025

FoundationVision / Waver

A video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

616 42 Updated Aug 27, 2025

scottpetrovic / mesh2motion-app

Import a 3D Model and automatically assign and export animations

TypeScript 1,525 119 Updated Sep 24, 2025

attention-survey / Efficient_Attention_Survey

A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention

187 4 Updated Aug 26, 2025

NVIDIA / warp

A Python framework for accelerated simulation, data generation and spatial computing.

Python 5,639 368 Updated Oct 10, 2025

Seed3D / Puppeteer

[NeurIPS 2025 Spotlight] Official repository for “Puppeteer: Rig and Animate Your 3D Models”

Python 216 13 Updated Sep 19, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 7,597 483 Updated Oct 3, 2025

nv-tlabs / vipe

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,341 103 Updated Sep 25, 2025

XunhaoLai / native-sparse-attention-triton

Efficient triton implementation of Native Sparse Attention.

Python 235 17 Updated May 23, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,599 301 Updated Sep 30, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,756 1,838 Updated Oct 6, 2025

ByteVisionLab / DetailFlow

🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"

Python 156 9 Updated Jul 10, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 9,869 991 Updated Sep 19, 2025

alibaba-damo-academy / Lumos

Lumos Project: Frontier generative model research by Alibaba DAMO Academy, including Lumos-1, etc.

Python 137 3 Updated Jul 17, 2025

wzzheng / StreamVGGT

Code for Streaming 4D Visual Geometry Transformer

Python 655 27 Updated Aug 15, 2025

yyfz / Pi3

Code of π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,263 56 Updated Sep 10, 2025

liruilong940607 / prope

Cameras as Relative Positional Encoding

Python 584 7 Updated Sep 15, 2025

hkdsc / copart

CoPart (ICCV 2025): A part-based 3D generation framework & the first large-scale part-level 3D dataset.

Python 188 3 Updated Jul 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guan Luo logan0601

Achievements

Achievements

Block or report logan0601

Stars

Relaxed-System-Lab / Flash-Sparse-Attention

Mengmouxu / SceneGen

triton-lang / triton

thu-ml / SLA

cwchenwang / physctrl

NVlabs / DiffusionNFT

Tencent-Hunyuan / Hunyuan3D-Part

HorizonWind2004 / reconstruction-alignment