Skip to content
View udaysankar01's full-sized avatar

Block or report udaysankar01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A curated list of papers & resources on anomaly detection foundation models using large language model, vision-language model, graph foundation model, time series foundation model, etc

120 7 Updated Dec 15, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 2,576 257 Updated Aug 28, 2025

OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer

Python 222 11 Updated Dec 3, 2025

Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"

Python 91 1 Updated Nov 25, 2025

Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use

Python 25 Updated Nov 4, 2025

A fork to add multimodal model training to open-r1

Python 1,433 70 Updated Feb 8, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,293 7,793 Updated Dec 21, 2025

Code for Streaming 4D Visual Geometry Transformer

Python 758 32 Updated Oct 27, 2025

Code for FastVGGT: Training-Free Acceleration of Visual Geometry Transformer

Python 636 34 Updated Nov 15, 2025
Python 12 Updated Nov 17, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,887 655 Updated Nov 20, 2025

Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted at ICLR 2025

Jupyter Notebook 23 1 Updated Jun 13, 2025

Official repository for "AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos" (CVPR 2025)

Python 286 12 Updated May 7, 2025

[CVPR 2025 (Highlight)] Towards Unbiased and Robust Spatio-Temporal Scene Graph Generation and Anticipation

Python 12 1 Updated Jul 14, 2025

This is the official implementation of "DiffVsgg: Diffusion-Driven Online Video Scene Graph Generation" (Accepted at CVPR 2025).

10 Updated Mar 5, 2025

This is the project for 'USG'.

CSS 31 Updated Apr 7, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,827 2,571 Updated Dec 19, 2025

Code of ฯ€^3: Permutation-Equivariant Visual Geometry Learning

Python 1,482 78 Updated Dec 20, 2025

Official implement of VGGT-Long

Python 710 43 Updated Dec 16, 2025

This is a repository for listing papers on scene graph generation and application.

530 35 Updated Dec 11, 2025

[arXiv'25]๐ŸŒˆ Unseen 3D Geometry Reasoning from a Single Image.

Python 73 2 Updated Jul 10, 2025
Python 10 1 Updated Apr 16, 2025

VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold

Python 667 61 Updated Nov 19, 2025

Paper Survey for Transformer-based SLAM

229 15 Updated Dec 21, 2025

๐Ÿ“Œ PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map [RSS' 25]

Python 201 9 Updated Sep 2, 2025

Platform for Deep Learning based SLAM

Python 376 34 Updated Nov 6, 2024

Code for "LiftFeat: 3D Geometry-Aware Local Feature Matching", ICRA2025

Python 223 20 Updated Sep 23, 2025

[ICRA 2025] Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems

C++ 243 11 Updated Jul 23, 2025

[ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.

Python 495 25 Updated Apr 1, 2025
Next