GitHub - fscdc/ReasonMap: [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

A Fine-Grained Visual Reasoning Benchmark: ReasonMap

[HuggingFace Daily Paper] [Twitter] [量子位]

💡 Interested in evaluating your model on ReasonMap or ReasonMap-Plus?

🙋 Please let us know if you find out a mistake or have any suggestions!

🌟 If you find this resource helpful, please consider to star this repository and cite our research!

Updates

2025-09-30: 🚀 We released ReasonMap-Plus for the following research - RewardMap!
2025-05-15: 🚀 We released evaluation code and our website online!
2025-05-15: 🚀 We released ReasonMap!

Usage

1. Install dependencies

If you face any issues with the installation, please feel free to open an issue. We will try our best to help you.

conda env create -f reasonmap-py310.yaml

2. Download the dataset

You can download ReasonMap and ReasonMap-Plus from HuggingFace.

3. Evaluation

You can evaluate the model performance on ReasonMap by running the following command:

## ReasonMap Evaluation
# open-source models
bash script/run.sh
# closed-source models
bash script/run-closed-models.sh

## ReasonMap-Plus Evaluation
bash script/run_plus.sh

# after running the above scripts, you can analyze the results by:
python cal_metrics.py

Citation

If you find this benchmark useful in your research, please consider citing our paper:

@article{feng2025can,
  title={Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps},
  author={Feng, Sicheng and Wang, Song and Ouyang, Shuyi and Kong, Lingdong and Song, Zikai and Zhu, Jianke and Wang, Huan and Wang, Xinchao},
  journal={arXiv preprint arXiv:2505.18675},
  year={2025}
}

# further research
@article{feng2025rewardmap,
  title={RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning},
  author={Feng, Sicheng and Tuo, Kaiwen and Wang, Song and Kong, Lingdong and Zhu, Jianke and Wang, Huan},
  journal={arXiv preprint arXiv:2510.02240},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
maps		maps
script		script
stations		stations
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cal_metrics.py		cal_metrics.py
main.py		main.py
main_closed_models.py		main_closed_models.py
main_plus.py		main_plus.py
reasonmap-py310.yaml		reasonmap-py310.yaml
tools.py		tools.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

A Fine-Grained Visual Reasoning Benchmark: ReasonMap

Updates

Usage

1. Install dependencies

2. Download the dataset

3. Evaluation

Citation

About

Uh oh!

Releases

Packages

Languages

License

fscdc/ReasonMap

Folders and files

Latest commit

History

Repository files navigation

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

A Fine-Grained Visual Reasoning Benchmark: ReasonMap

Updates

Usage

1. Install dependencies

2. Download the dataset

3. Evaluation

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages