Skip to content
View YonghaoXu's full-sized avatar

Block or report YonghaoXu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICCV 2025] SAM4D: Segment Anything in Camera and LiDAR Streams

Jupyter Notebook 205 8 Updated Sep 23, 2025

A curated list of papers that focus on how to represent Earth data in embedding space — spatial, temporal, or semantic — and how those embeddings behave or are applied.

41 2 Updated Nov 25, 2025

[NeurIPS 2025] DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response

Python 87 1 Updated Nov 6, 2025

[NeurIPS 2025 D&B] RSCC: A Real-World Remote Sensing Change Caption Dataset

Python 39 Updated Nov 14, 2025

[CVPR 2025 🔥] EarthDial: Turning Multi-Sensory Earth Observations to Interactive Dialogues.

Python 100 8 Updated Jun 20, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,320 2,131 Updated Dec 18, 2025

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,497 499 Updated Mar 22, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,168 6,626 Updated Dec 22, 2025

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

Python 520 48 Updated Dec 20, 2025

Comparing MOD14 and VNP14 fire products.

Python 5 Updated Mar 12, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,576 595 Updated Dec 22, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,415 462 Updated Dec 18, 2025

[ESSD 2025] BRIGHT: A globally distributed multimodal VHR dataset for all-weather disaster response

Python 195 27 Updated Dec 17, 2025

A ready-to-use curated list of Spectral Indices for Remote Sensing applications.

Python 1,055 168 Updated Oct 11, 2025

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Python 672 60 Updated Nov 28, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,922 599 Updated Jul 17, 2024

[JAG 2024] UAD-RS: Universal adversarial defense in remote sensing based on pre-trained denoising diffusion models

Python 12 Updated Nov 8, 2024

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 14,069 3,017 Updated Oct 22, 2025

ROS-Industrial Universal Robots support (https://wiki.ros.org/universal_robot)

C++ 1,356 1,086 Updated Oct 13, 2025

Prototyping robots for PyBullet (F1/10 MIT Racecar, Sawyer, Baxter and Dobot arm, Boston Dynamics Atlas and Botlab environment)

Python 534 201 Updated Nov 25, 2025

An open source implementation of CLIP.

Python 13,146 1,220 Updated Nov 4, 2025

PyTorch implementation of popular datasets and models in remote sensing

Python 405 50 Updated May 7, 2023

S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions

Python 50 6 Updated May 26, 2023

[NeurIPS 2024 Spotlight] Official repository of SynRS3D

Python 68 6 Updated May 15, 2025

A PyTorch implementation of "GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis"

Python 107 12 Updated Nov 29, 2024

The official repo for [TPAMI'25] "HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model"

Python 322 26 Updated Dec 9, 2025

Official PyTorch implementation and benchmark dataset for IGARSS 2024 ORAL paper: "Composed Image Retrieval for Remote Sensing"

Python 78 2 Updated Dec 21, 2024

RS5M: a large-scale vision language dataset for remote sensing [TGRS]

Python 291 15 Updated Mar 17, 2025

Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"

Python 193 14 Updated Dec 10, 2024
Next