CROC
_{^{CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks}}

In this repository, we release the source code of our dataset CROC and the metric CROCScore. For usage instructions, please see the respective folder's readme files, as well as the main part of the scripts and comments in the code.

📁 Repository Structure

├── croc_hum            # Everything related to CROChum, our human supervised dataset, i.e., scripts for data generation, evaluation and human evaluation
├── croc_syn            # Everything related to CROCsyn, our synthetic dataset, i.e., scripts for data generation, evaluation and human evaluation
├── metrics             # Wrapper scripts to apply T2I metrics including CROCScore
├── pyproject.toml      # List of poetry dependencies
├── README.md           # This Project documentation
└── ...

🛠️ Setup & Installation

To set up the environment and install dependencies:

pip install uv

git clone https://github.com/Gringham/CROC.git
cd CROC

uv venv --python 3.11
uv pip install -U timm  flash-attn==2.7.3 vllm transformers==4.49 diffusers[torch]==0.34.0 --no-build-isolation --no-cache-dir 
#Note that some parts of CROC have conflicting dependencies and may require different environments.

After installing the dependencies, please set your cache directory in the following locations

- project_root.py
- croc_hum/metric_apply.sh
- croc_syn/img_gen/img_gen.sh
- further_benchmarks/3_genaibench_experiment/t2v_metrics/genai_image_eval_customscore.py
- metrics/apply_metrics.sh
- metrics/VQAScore.py

In all slurm scripts, specify the activation of your specific environment.

🚀 Usage

Please view the readme files in the respective subfolders.

📌 Notice on AI Generation

Some parts of the code were written with AI support by Github Copilot and GPT-o3/o4.

📖 Citation

If you use this work in your research, please cite it as:

@misc{leiter2025crocevaluatingtrainingt2i,
      title={CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks}, 
      author={Christoph Leiter and Yuki M. Asano and Margret Keuper and Steffen Eger},
      year={2025},
      eprint={2505.11314},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2505.11314}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CROC
_{^{CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks}}

📁 Repository Structure

🛠️ Setup & Installation

🚀 Usage

Please view the readme files in the respective subfolders.

📌 Notice on AI Generation

📖 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
croc_hum		croc_hum
croc_syn		croc_syn
further_benchmarks		further_benchmarks
metrics		metrics
.gitignore		.gitignore
README.md		README.md
project_root.py		project_root.py

Folders and files

Latest commit

History

Repository files navigation

CROC CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks

📁 Repository Structure

🛠️ Setup & Installation

🚀 Usage

Please view the readme files in the respective subfolders.

📌 Notice on AI Generation

📖 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

CROC
_{^{CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks}}

Packages