Skip to content
View MemorySlices's full-sized avatar

Highlights

  • Pro

Block or report MemorySlices

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Python 761 37 Updated Jun 3, 2026

Official implementation of Déjà View: Looping Transformers for Multi-View 3D Reconstruction

Python 333 14 Updated Jun 1, 2026

The first "ImageNet" 3D dataset.

Python 75 Updated Jun 17, 2026
Python 443 32 Updated Mar 19, 2026

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 10,613 1,595 Updated Jun 15, 2026

GLUEMAP: Global Structure-from-Motion Meets Feedforward Reconstruction

Python 265 12 Updated May 26, 2026

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation

Python 288 10 Updated Nov 18, 2025

[CVPR 2026 Oral] VGGT Omega

Python 3,071 135 Updated May 18, 2026
Python 16 1 Updated Jun 17, 2026

[SIGGRAPH 2026] Pixal3D: Pixel-Aligned 3D Generation from Images

Python 1,784 165 Updated May 24, 2026

Tools for the Embody 3D Dataset

Python 256 11 Updated Oct 30, 2025

[CVPR 2022] Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation

Python 239 12 Updated Mar 11, 2022

Perception toolkit for sim2real training and validation in Unity

C# 994 186 Updated Nov 8, 2024

1K resolution vision transformers pretrained on 1B human images.

Python 805 52 Updated May 24, 2026

[ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI

Python 330 26 Updated Jun 2, 2026
Jupyter Notebook 169 7 Updated Jun 8, 2026

Generate images of code and terminal output 📸

Go 4,683 97 Updated Jun 2, 2026

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Python 2,247 187 Updated May 27, 2026

Official code base for LeWorldModel: Stable End-to-End Joint-Embedding Predictive Architecture from Pixels

Python 3,892 536 Updated May 26, 2026

Code for the ShapeR research paper

Python 793 59 Updated Apr 30, 2026

A ~9M parameter LLM that talks like a small fish.

Python 3,250 286 Updated Apr 15, 2026

A tutorial and a set of tools to compute depth-from-stereo with Project Aria Gen2 devices. This includes stereo image rectification as well as disparity estimation

Jupyter Notebook 96 14 Updated May 26, 2026

Reimplementation of LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Python 592 44 Updated Apr 27, 2026
Python 20 Updated May 22, 2026

Masked Depth Modeling for Spatial Perception

Python 1,223 95 Updated Jun 18, 2026

Official code for Zero-Shot Depth from Defocus (https://arxiv.org/abs/2603.26658)

Python 50 4 Updated Apr 5, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 16,273 2,021 Updated Mar 17, 2026

[NeurIPS 2025] Sekai: A Video Dataset towards World Exploration

Python 297 6 Updated Dec 31, 2025

WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG

349 3 Updated Mar 25, 2026

[ECCV 2026] WAFT-Stereo: Warping-Alone Field Transforms for Stereo Matching

Python 90 5 Updated May 5, 2026
Next