Skip to content
View BITcats's full-sized avatar
Focusing
Focusing

Block or report BITcats

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,836 108 Updated Dec 8, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,765 105 Updated Nov 4, 2025

Official implementation for "SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation"

44 Updated Dec 1, 2025

[IROS 24] Official repository of "Mind the Error! Detection and Localization of Instruction Errors in Vision-and-Language Navigation". We present the first dataset - R2R-IE-CE - to benchmark instru…

Python 18 1 Updated Jan 8, 2025

G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 230 4 Updated Nov 27, 2025

Official Repo of "SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization"

Python 272 13 Updated Sep 14, 2025

Depth Anything 3

Python 3,640 311 Updated Dec 12, 2025

Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)

Python 64 5 Updated Dec 22, 2025

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

Python 2,042 155 Updated Mar 13, 2025

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Python 302 10 Updated Dec 1, 2025

[CVPR 2025 Hightlight] PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes

Python 64 1 Updated Sep 22, 2025

[NeurIPS 2025] the official project page of a paper, "PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splatting"

Python 57 1 Updated Oct 24, 2025

PlaneRCNN detects and reconstructs piece-wise planar surfaces from a single RGB image

Python 602 129 Updated Oct 9, 2022

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,284 964 Updated Feb 25, 2022

Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.

Python 1,548 94 Updated Nov 9, 2025

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,722 334 Updated Jan 21, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,314 1,449 Updated Nov 28, 2025

[CVPR'25 Highlight] Official repository of Sonata: Self-Supervised Learning of Reliable Point Representations

Python 637 34 Updated Jun 4, 2025

Python3 library for downloading YouTube Videos.

Python 1,388 173 Updated Dec 7, 2025

[NeurIPS 2025] Pixel-Perfect Depth

Python 683 28 Updated Dec 21, 2025

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,540 155 Updated Dec 18, 2025

The public CGAL repository, see the README below

C++ 5,675 1,512 Updated Dec 19, 2025

A collection of useful functions for 3D vision & graphics research in Python.

Python 227 24 Updated Dec 18, 2025

[ICLR 2025 Oral] NeuralPlane: Structured 3D Reconstruction in Planar Primitives with Neural Fields

Python 55 Updated Jul 2, 2025

A general and accurate MACs / FLOPs profiler for PyTorch models

Python 632 43 Updated Jul 29, 2025

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Python 270 17 Updated Dec 3, 2025

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

Python 745 48 Updated Oct 26, 2025

ViPE: Video Pose Engine for Geometric 3D Perception

Python 1,572 120 Updated Dec 9, 2025

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 2,136 138 Updated Nov 2, 2025

Code of π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,482 78 Updated Dec 20, 2025
Next