Stars
[ICCV 2025] SAM4D: Segment Anything in Camera and LiDAR Streams
A curated list of papers that focus on how to represent Earth data in embedding space — spatial, temporal, or semantic — and how those embeddings behave or are applied.
[NeurIPS 2025] DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response
[NeurIPS 2025 D&B] RSCC: A Real-World Remote Sensing Change Caption Dataset
[CVPR 2025 🔥] EarthDial: Turning Multi-Sensory Earth Observations to Interactive Dialogues.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Using Low-rank adaptation to quickly fine-tune diffusion models.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]
Comparing MOD14 and VNP14 fire products.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
[ESSD 2025] BRIGHT: A globally distributed multimodal VHR dataset for all-weather disaster response
A ready-to-use curated list of Spectral Indices for Remote Sensing applications.
[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
[JAG 2024] UAD-RS: Universal adversarial defense in remote sensing based on pre-trained denoising diffusion models
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
ROS-Industrial Universal Robots support (https://wiki.ros.org/universal_robot)
Prototyping robots for PyBullet (F1/10 MIT Racecar, Sawyer, Baxter and Dobot arm, Boston Dynamics Atlas and Botlab environment)
An open source implementation of CLIP.
PyTorch implementation of popular datasets and models in remote sensing
S-CLIP: Semi-supervised Vision-Language Pre-training using Few Specialist Captions
[NeurIPS 2024 Spotlight] Official repository of SynRS3D
A PyTorch implementation of "GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis"
The official repo for [TPAMI'25] "HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model"
Official PyTorch implementation and benchmark dataset for IGARSS 2024 ORAL paper: "Composed Image Retrieval for Remote Sensing"
RS5M: a large-scale vision language dataset for remote sensing [TGRS]
Official repo for "SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing"