Skip to content
View YiLiu1999's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report YiLiu1999

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo for "GeoZero: Incentivizing Reasoning from Scratch on Geospatial Scenes"

Python 21 1 Updated Dec 4, 2025

Official code for the paper “Look Where It Matters: Training-Free Ultra-HR Remote Sensing VQA via Adaptive Zoom Search”.

Python 24 Updated Dec 8, 2025

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 149 11 Updated Oct 21, 2025
JavaScript 4 Updated Dec 1, 2025

[NeurIPS 2025 D&B] RSCC: A Real-World Remote Sensing Change Caption Dataset

Python 39 Updated Nov 14, 2025

[CVPR 2025 🔥] EarthDial: Turning Multi-Sensory Earth Observations to Interactive Dialogues.

Python 100 8 Updated Jun 20, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,785 1,076 Updated Dec 23, 2025

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

930 39 Updated Sep 27, 2025

VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis

Python 107 7 Updated Feb 19, 2025

Code and updates for the ScoreRS project.

Python 35 2 Updated Sep 19, 2025

Official repo for "S5: Scalable Semi-Supervised Semantic Segmentation in Remote Sensing"

Python 33 1 Updated Dec 4, 2025

Official repo for "[ICCV 2025] Harnessing Massive Satellite Imagery with Efficient Masked Image Modeling"

Python 129 11 Updated Aug 12, 2025

Official repo for "REX-RAG: Reasoning Exploration with Policy Correction in Retrieval-Augmented Generation"

Python 30 3 Updated Sep 28, 2025

Official repo for "SPEX: A Vision-Language Model for Land Cover Extraction on Spectral Remote Sensing Images"

20 Updated Aug 8, 2025

Official repo for "Advances in Radiance Field for Dynamic Scene: From Neural Field to Gaussian Field"

9 1 Updated Sep 19, 2025

Official repo for [NeurlPS 2025 Spotlight] "GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution"

Python 38 1 Updated Oct 27, 2025

[arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?

Python 34 2 Updated Dec 1, 2025

Official repo for "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"

Python 21 1 Updated Sep 26, 2025

Official repo for "TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series"

27 1 Updated May 14, 2025

Official repo for [NeurlPS 2025] "DGSolver: Diffusion Generalist Solver with Universal Posterior Sampling for Image Restoration"

Python 139 1 Updated May 6, 2025

Official repo for [NeurlPS 2025] "RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing"

Python 122 16 Updated Sep 24, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,908 658 Updated Nov 20, 2025
2 Updated Sep 27, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,298 328 Updated Dec 15, 2025

Towards a Unified Copernicus Foundation Model for Earth Vision

Jupyter Notebook 116 6 Updated Oct 2, 2025

This is the implement of the paper "DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding"

Python 82 2 Updated Jun 9, 2025

This is an official implementation for "HyperFree: A Channel-adaptive and Tuning-free Foundation Model for Hyperspectral Remote Sensing Imagery" (CVPR2025)

Python 106 11 Updated Dec 3, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,200 2,686 Updated Aug 12, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,709 2,870 Updated Dec 23, 2025

Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)

Python 59 4 Updated Apr 12, 2025
Next