GutBrain IE Challenge @ CLEF 2026

Benedikt Kantz, Peter Walder, Stefan Lengauer, Tobias Schreck

reloaded for '26 - I have become CHASTE, master of CLEANR

Constrained
enHanced
Annotator
uSing
onTological
Entities

Our appraoch

extract noun phrases using spacy
constrain LLM using simple (!) grammar
Use finetuned HERMES 3.2 1B or 3B

Setup

# Install dependencies
git submodule update
git submodule update --init --recursive
CMAKE_ARGS="-DGGML_CUDA=on -DCMAKE_BUILD_PARALLEL_LEVEL=8" uv sync --prerelease=allow  
# might need to specify your CUDA arch:
CMAKE_BUILD_PARALLEL_LEVEL=8 CMAKE_ARGS="-DGGML_CUDA=on -DCMAKE_CUDA_ARCHITECTURES="75"" uv sync --prerelease=allow  
# # if you want to use you metal processor:
CMAKE_ARGS="-DGGML_METAL=on" uv sync --prerelease=allow  
source .venv/bin/activate
# in case you want to edit the llama python bindings
# # if you want to use the GPU:
# CMAKE_BUILD_PARALLEL_LEVEL=8 CMAKE_ARGS="-DGGML_CUDA=on" uv pip install -e llama-cpp-python
# CMAKE_BUILD_PARALLEL_LEVEL=8 CMAKE_ARGS="-DGGML_METAL=on"  uv pip install -e llama-cpp-python
# on a cluster you could start into a interactive environment to compile with nvcc!
srun --gres=gpu -c 12 -u kanben  --time=1:00:00  --pty   bash

# dowload models (make sure to set you HF token!)
tune download NousResearch/Hermes-3-Llama-3.2-3B  --output-dir models/hermes-3-2-3B
huggingface-cli download meta-llama/Llama-3.2-3B original/tokenizer.model --local-dir  models/hermes-3-2-3B
tune download NousResearch/Hermes-3-Llama-3.1-8B  --output-dir models/hermes-3-1-8B
huggingface-cli download meta-llama/Llama-3.1-8B original/tokenizer.model --local-dir  models/hermes-3-1-8B
python manage_models/quantize_all.py

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
GutBrainIE_2025_Baseline		GutBrainIE_2025_Baseline
GutBrainIE_2026_Baseline @ 124f883		GutBrainIE_2026_Baseline @ 124f883
NEREL-BIO @ a7d0d16		NEREL-BIO @ a7d0d16
beam-search		beam-search
configs		configs
constrerl		constrerl
data		data
llama-cpp-python @ b285176		llama-cpp-python @ b285176
llama.cpp @ 8fcb563		llama.cpp @ 8fcb563
manage_models		manage_models
models		models
quants		quants
slurm		slurm
submission		submission
.gitignore		.gitignore
.gitmodules		.gitmodules
.keep		.keep
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
appendix.pdf		appendix.pdf
deduplicate.ipynb		deduplicate.ipynb
encode_datasets.ipynb		encode_datasets.ipynb
finetune_all.sh		finetune_all.sh
finetune_ned.ipynb		finetune_ned.ipynb
inference.ipynb		inference.ipynb
inference.py		inference.py
inference_nner.ipynb		inference_nner.ipynb
inference_nner.py		inference_nner.py
prepare.py		prepare.py
prepare_dataset.ipynb		prepare_dataset.ipynb
prepare_dataset_chats.ipynb		prepare_dataset_chats.ipynb
pyproject.toml		pyproject.toml
relation_grammar.ebnf		relation_grammar.ebnf
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GutBrain IE Challenge @ CLEF 2026

reloaded for '26 - I have become CHASTE, master of CLEANR

Our appraoch

Setup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GutBrain IE Challenge @ CLEF 2026

reloaded for '26 - I have become CHASTE, master of CLEANR

Our appraoch

Setup

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages