Stars
The official repository of the first version of ACE-Brain foundation model.
Official repo for "Any2Any: Unified Arbitrary Modality Translation for Remote Sensing"
Official repo for "Seeing Clearly without Training: Mitigating Hallucinations in Multimodal LLMs for Remote Sensing"
Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding
Official repo for "VLRS-Bench: A Vision-Language Reasoning Benchmark for Remote Sensing"
Official repo for "Degradation-Aware Metric Prompting for Hyperspectral Image Restoration"
Official repo for [CVPR 2026] "SARMAE: Masked Autoencoder for SAR Representation Learning"
Official repo for [CVPR 2026] "GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-Localization"
Official repo for [CVPR 2026] "UniGeoSeg: Towards Unified Open-World Segmentation for Geospatial Scenes"
Official repo for "GeoZero: Incentivizing Reasoning from Scratch on Geospatial Scenes"
Official repo for [CVPR 2026 Findings] "DeepSketcher: Internalizing Visual Manipulation for Multimodal Reasoning"
Official repo for [CVPR 2026] "Residual Diffusion Bridge Model for Image Restoration"
[Information Fusion 2025] MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image Classification
Official repo for "Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field"
[arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
Official repo for [NeurlPS 2025] "DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration"
Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"
Advancing Weakly-Supervised Change Detection in Satellite Images via Adversarial Class Prompting
Official repo for "REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation"
Official repo for [IEEE TGRS'26] "SPEX: A Vision-Language Model for Land Cover Extraction on Spectral Remote Sensing Images"
Official repo for "OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data"
Official repo for [NeurlPS 2025 Spotlight] "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"
Official repo for [AAAI 2026 Oral] "S5: Scalable Semi-Supervised Semantic Segmentation in Remote Sensing"
Official repo for "TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series"
[CVPR 2025 HIghlight] XLRS-Bench: ould Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?
Official repo for [NeurlPS 2025] "RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing"
[Neural Networks 2025] Dual Selective Fusion Transformer Network for Hyperspectral Image Classification