MIME: Mime Identification Multimodal Evaluation

📹 Watch Sample Video
Watch more videos on AWS S3

Overview

This is the official repository for the paper "Can Vision Language Models Understand Mimed Actions?" and the MIME benchmark.

MIME contains 86 animated videos, each with 8 different variants for systematically analyzing recognition robustness of mimed actions.

Refer to our project page for more details: https://justin-cho.com/mime

Hugging Face Dataset options

wise-east/mime-cropped: cropped videos with 8 variants (what we use in our paper)
wise-east/mime-real-resized: resized REAL videos with 8 variants (what we use in our paper)
wise-east/mime-original: original MIME videos without cropping
wise-east/mime-real-original: original REAL videos without resizing

mime-original and mime-real-original are also available on AWS S3.

Variants in MIME

Environment Setup

conda create -n mime python=3.11
conda activate mime
pip install -e .

Evaluating on MIME

mimeeval run <mcq,ff> --dataset-name <dataset_name> \
    --model-name <model_name> --model <model_type> \
    --api-key <api_key> --eval-type <eval_type>  --variant <variant>

# Example 
mimeeval run mcq --dataset-name wise-east/mime-cropped \
    --model-name Qwen/Qwen2.5-VL-7B-Instruct --model qwen25vl \
    --api-key none --eval-type zero-shot --variant all

You can use run_eval.sh for convenience for replicating results from the paper.

Example: ./run_eval.sh wise-east/mime-cropped all zero-shot mcq qwen3b
- This will run zero-shot MCQ evaluation on all variants of the MIME dataset using Qwen2.5-VL-3B-Instruct.
See commands.sh for all commands that were used for the paper.

Models currently supported are in `src/MimeEval/models/`:

OpenAI (gpt-4o-mini)
Gemini (gemini-1.5-flash)
Qwen (Qwen2.5-VL-3B-Instruct, Qwen2.5-VL-7B-Instruct, Qwen2-VL-3B-Instruct, Qwen2-VL-7B-Instruct)
InternVL (InternVL2_5-8B)
Phi 3.5

Refer to the implementations in src/MimeEval/models/ to add support for custom models.

Creating MIME

Refer to MIME_pipeline_README.md for details on how to create the datasets. It contains information on where all the digital assets (motion capture data, background images, Mixamo characters, etc.) needed for creating MIME are stored.

Miscellaneous Scripts

Raw human evaluation results: results/mime-cropped/human_eval.jsonl and results/mime-real-resized/human_eval.jsonl
Compute human evaluation results: python src/MimeEval/utils/compute_human_eval_results.py
aggregate.py: aggregate results from different models and datasets to produce tables/figures in the paper.

Contact

Reach out to Justin Cho for any questions or open an issue.

Citations

@misc{cho2025visionlanguagemodelsunderstand,
    title={Can Vision Language Models Understand Mimed Actions?}, 
    author={Hyundong Cho and Spencer Lin and Tejas Srinivasan and Michael Saxon and Deuksin Kwon and Natali T. Chavez and Jonathan May},
    year={2025},
    eprint={2506.21586},
    archivePrefix={arXiv},
    primaryClass={cs.CL},
    url={https://arxiv.org/abs/2506.21586}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MIME: Mime Identification Multimodal Evaluation

Overview

Hugging Face Dataset options

Variants in MIME

Environment Setup

Evaluating on MIME

Models currently supported are in `src/MimeEval/models/`:

Creating MIME

Miscellaneous Scripts

Contact

Citations

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
bash_scripts		bash_scripts
blender_scripts		blender_scripts
data		data
results		results
src/MimeEval		src/MimeEval
.gitignore		.gitignore
MIME_pipeline_README.md		MIME_pipeline_README.md
README.md		README.md
aggregate.py		aggregate.py
commands.sh		commands.sh
pyproject.toml		pyproject.toml
run_eval.sh		run_eval.sh

wise-east/mime

Folders and files

Latest commit

History

Repository files navigation

MIME: Mime Identification Multimodal Evaluation

Overview

Hugging Face Dataset options

Variants in MIME

Environment Setup

Evaluating on MIME

Models currently supported are in src/MimeEval/models/:

Creating MIME

Miscellaneous Scripts

Contact

Citations

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Models currently supported are in `src/MimeEval/models/`:

Packages