Flexibility in choosing which checkpoint to save during multi-head training. #479

LarsSchaaf · 2024-06-21T13:12:38Z

Current behaviour

If the errors on the mp-head increase and the default-head decrease the overall loss may increases, meaning that no checkpoint is saved. This means that eg. after 200 epochs of training, where the error on the fine-tuning dataset decreases significantly the last saved checkpoint could be eg. epoch 8.

Desired behaviour

Choose to save and select the final model based on the loss of a desired head (or a weighting of multiple heads). Such that if the error decreases on the finetuning validation set, I save this model.

Current workaround

Save checkpoints after each epoch.

ilyes319 · 2024-06-21T13:21:00Z

@LarsSchaaf Good point, I made a change yesterday to save only based on the last head (which is the finetuning head) loss but I agree more flexibility should be welcomed.

ilyes319 added the multi-head label Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flexibility in choosing which checkpoint to save during multi-head training. #479

Flexibility in choosing which checkpoint to save during multi-head training. #479

LarsSchaaf commented Jun 21, 2024

ilyes319 commented Jun 21, 2024

Flexibility in choosing which checkpoint to save during multi-head training. #479

Flexibility in choosing which checkpoint to save during multi-head training. #479

Comments

LarsSchaaf commented Jun 21, 2024

ilyes319 commented Jun 21, 2024