Adapted from: https://huggingface.co/mistralai/Devstral-Small-2-24B-Instruct-2512
Call run.py with the desired test:
$ python run.py test-1.pyOutputs to console and test-1.log.
Test 3 tries to fill up the context of 256K. I do not have a machine that can
run at that context, so I reduced it by a factor of 10. You'll need at least
28k context to run, 32k to be safe.