[ICLR 2026] PyTorch implementation of "The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images with Mimimal 3D Knowledge".

Python 63 6 Updated May 13, 2026

houyuanchen111 / UniVidX

[SIGGRAPH 2026 / TOG] Official code of the paper "UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors".

Python 233 9 Updated May 15, 2026

Luo-Yihang / 4RC

[ICML 2026] 4RC: 4D Reconstruction via Conditional Querying Anytime and Anywhere

Python 176 3 Updated May 18, 2026

thinkwee / AwesomeOPD

Awesome List for On-Policy Distillation

653 11 Updated Jun 13, 2026

thunlp / OPD

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Python 683 43 Updated May 30, 2026

SIBench / Awesome-Visual-Spatial-Reasoning

This is a project about visual spatial reasoning.

HTML 136 5 Updated May 6, 2026

mll-lab-nu / Awesome-Spatial-Intelligence-in-VLM

A paper list for spatial reasoning

755 42 Updated Jan 19, 2026

Vegetebird / CA-MLLM

[ICLR 2026] Official implementation of the paper "📷 On the Generalization Capacities of MLLMs for Spatial Intelligence"

Python 29 1 Updated Mar 17, 2026

facebookresearch / vjepa2

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 4,185 512 Updated Mar 23, 2026

visinf / MARCO

[CVPR 2026 Oral] "MARCO: Navigating the Unseen Space of Semantic Correspondence"

Python 139 6 Updated Apr 21, 2026

NOVAglow646 / Monet

[CVPR 2026] Official codes of "Monet: Reasoning in Latent Visual Space Beyond Image and Language"

Python 199 3 Updated Mar 19, 2026

Visual-Agent / DeepEyes

Python 1,237 78 Updated Nov 20, 2025

Wakals / CoVT

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

Python 368 20 Updated Apr 17, 2026

ZizhuoLi / CoMatch

[ICCV '25 Highlight] CoMatch: Dynamic Covisibility-Aware Transformer for Bilateral Subpixel-Level Semi-Dense Image Matching

Python 37 4 Updated Jul 25, 2025

knightnemo / Awesome-World-Models

A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.

3,032 125 Updated Jun 12, 2026

LaVi-Lab / VG-LLM

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Jupyter Notebook 240 8 Updated Nov 28, 2025

H-EmbodVis / VEGA-3D

[ECCV 2026] Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding

Python 418 22 Updated Jun 18, 2026

ZhangGongjie / 2D-3D-Lifting

Python 77 2 Updated Oct 1, 2025

amap-cvlab / ABot-PhysWorld

Python 339 16 Updated Apr 24, 2026

QitaoZhao / E-RayZer

[CVPR 2026] "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.

Python 296 15 Updated May 30, 2026

yyfz / Pi3

[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning

Python 2,023 156 Updated May 18, 2026

zhangquanchen / 3DThinker

[CVPR 2026] Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Python 237 7 Updated May 7, 2026

AnjieCheng / SR-3D

[ICLR'26] This repository is the implementation of "3D Aware Region Prompted Vision Language Model"

Python 26 Updated Feb 19, 2026

usememos / memos

Open-source, self-hosted note-taking tool built for quick capture. Markdown-native, lightweight, and fully yours.

Go 60,898 4,478 Updated Jun 15, 2026

ranrhuang / SPFSplat

[ICCV 2025 Highlight] No Pose at All: Self-Supervised Pose-Free 3D Gaussian Splatting from Sparse Views

Python 152 8 Updated Dec 5, 2025

sugarfly sugar-fly

Starred repositories

active-learning

data-selection