Skip to content
View ug-kim's full-sized avatar
🌟
🌟

Block or report ug-kim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 2,294 166 Updated Dec 19, 2025
Python 45 Updated Mar 14, 2025

Code accompanying our ECCV-2020 paper on 3D Neural Listeners.

C++ 137 15 Updated Jun 29, 2021

Describe Anything, Anywhere, at Any Moment (DAAAM), a novel approach to real-time, large-scale, spatio-temporal memory

117 2 Updated Dec 7, 2025

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,202 95 Updated Dec 17, 2025

Official implementation of "Emergent Outlier View Rejection in Visual Geometry Grounded Transformers"

Python 123 4 Updated Dec 8, 2025

Official repository of "Multi-view Pyramid Transformer: Look Coarser to See Broader"

Python 118 6 Updated Dec 17, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,009 881 Updated Dec 4, 2025

A simple state update rule to enhance length generalization for CUT3R

Python 543 17 Updated Oct 1, 2025

[CVPR 2025 Oral & Best Paper Finalist] Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Python 970 76 Updated Jun 28, 2025

SAM 3D Objects

Python 5,026 464 Updated Dec 16, 2025

Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)

Python 64 5 Updated Dec 22, 2025

[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

Python 98 5 Updated Feb 2, 2025

Official repo and evaluation implementation of VSI-Bench

Python 656 39 Updated Aug 5, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 6,312 731 Updated Dec 21, 2025

Depth Anything 3

Python 3,640 311 Updated Dec 12, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,837 108 Updated Dec 8, 2025
Python 25 1 Updated Nov 17, 2025

VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold

Python 668 61 Updated Nov 19, 2025

Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"

Python 371 11 Updated Nov 24, 2025

Open-world 3D part segmentation of point clouds

Python 107 5 Updated Jul 27, 2025

[Neurips DB 2025] PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding

Python 87 1 Updated Nov 4, 2025

An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.

Python 445 14 Updated Dec 2, 2025

[NeurIPS 2025] Pixel-Perfect Depth

Python 682 28 Updated Dec 21, 2025

[NeurIPS 2024] AV-Cloud: Spatial Audio Rendering Through Audio-Visual Cloud Splatting

Python 12 2 Updated Nov 22, 2025

Official implementation of DepthLM

Python 276 12 Updated Oct 7, 2025

[CVPR 2025] Towards In-the-wild 3D Plane Reconstruction from a Single Image

Python 67 Updated Oct 6, 2025

Codebase for SparseGS paper

Python 210 23 Updated May 16, 2024

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Python 2,540 155 Updated Dec 18, 2025

SpatialVID: A Large-Scale Video Dataset with Spatial Annotations

Python 449 14 Updated Dec 15, 2025
Next