Skip to main content

TowerInstruct (Mistral)


NOTE To make your life easier, run these commands from the recipe directory (here recipes/wmt22_with_TowerInstruct-Mistral).

Retrieve and convert model

Set environment variables

export EOLE_MODEL_DIR=<where_to_store_models>
export HF_TOKEN=<your_hf_token>

Download and convert model

eole convert HF --model_dir Unbabel/TowerInstruct-Mistral-7B-v0.2 --output $EOLE_MODEL_DIR/TowerInstruct-Mistral-7b-v0.2 --token $HF_TOKEN

Inference

Build the prompt for translation of newstest2022-src.en

python promptize_mistral.py

Run inference

eole predict -c tower-inference.yaml -src newstest2022-src-prompt.en -output newstest2022-hyp.de

Then you can score newstest2022-hyp.de against newstest2022-ref.de with a scorer (sacrebleu or comet) or just use cometkiwi for reference-less score.