metadata
base_model: yhavinga/ul2-large-dutch
library_name: peft
license: apache-2.0
tags:
- generated_from_trainer
model-index:
- name: ul2-large-dutch-finetuned-oba-book-search
results: []
ul2-large-dutch-finetuned-oba-book-search
This model is a fine-tuned version of yhavinga/ul2-large-dutch on the None dataset. It achieves the following results on the evaluation set:
- Loss: 4.1161
- Top-5-accuracy: 4.1679
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.3
- train_batch_size: 16
- eval_batch_size: 16
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 10
Training results
Training Loss | Epoch | Step | Validation Loss | Top-5-accuracy |
---|---|---|---|---|
6.2541 | 0.2577 | 200 | 4.6137 | 0.0579 |
5.8635 | 0.5155 | 400 | 4.5076 | 0.1158 |
5.5301 | 0.7732 | 600 | 4.4350 | 0.1447 |
5.5298 | 1.0309 | 800 | 4.4449 | 0.1447 |
5.3296 | 1.2887 | 1000 | 4.4621 | 0.1158 |
5.3336 | 1.5464 | 1200 | 4.4232 | 0.1447 |
5.2192 | 1.8041 | 1400 | 4.3842 | 0.1447 |
5.2348 | 2.0619 | 1600 | 4.3465 | 0.1447 |
5.0988 | 2.3196 | 1800 | 4.3129 | 0.2026 |
5.1633 | 2.5773 | 2000 | 4.3007 | 0.1737 |
5.1103 | 2.8351 | 2200 | 4.2722 | 0.2026 |
5.0057 | 3.0928 | 2400 | 4.3158 | 0.1447 |
5.0554 | 3.3505 | 2600 | 4.2731 | 0.4342 |
4.9774 | 3.6082 | 2800 | 4.2467 | 0.3763 |
4.9769 | 3.8660 | 3000 | 4.2320 | 0.5789 |
4.9825 | 4.1237 | 3200 | 4.2115 | 0.8394 |
4.9692 | 4.3814 | 3400 | 4.2172 | 1.3893 |
4.9681 | 4.6392 | 3600 | 4.2093 | 1.5630 |
4.8661 | 4.8969 | 3800 | 4.2003 | 2.2865 |
4.942 | 5.1546 | 4000 | 4.2047 | 2.3734 |
4.8974 | 5.4124 | 4200 | 4.1583 | 2.8654 |
4.8827 | 5.6701 | 4400 | 4.1852 | 2.9522 |
4.8705 | 5.9278 | 4600 | 4.1661 | 3.4732 |
4.8714 | 6.1856 | 4800 | 4.1478 | 3.7916 |
4.7909 | 6.4433 | 5000 | 4.1748 | 3.6179 |
4.8357 | 6.7010 | 5200 | 4.1471 | 3.9074 |
4.8723 | 6.9588 | 5400 | 4.1518 | 4.0232 |
4.8838 | 7.2165 | 5600 | 4.1428 | 4.1389 |
4.804 | 7.4742 | 5800 | 4.1468 | 4.0232 |
4.8232 | 7.7320 | 6000 | 4.1390 | 4.1389 |
4.8571 | 7.9897 | 6200 | 4.1305 | 4.0810 |
4.7454 | 8.2474 | 6400 | 4.1297 | 4.1679 |
4.8652 | 8.5052 | 6600 | 4.1262 | 4.1968 |
4.7882 | 8.7629 | 6800 | 4.1227 | 4.1679 |
4.8025 | 9.0206 | 7000 | 4.1134 | 4.1679 |
4.8124 | 9.2784 | 7200 | 4.1211 | 4.1389 |
4.7157 | 9.5361 | 7400 | 4.1122 | 4.1389 |
4.8666 | 9.7938 | 7600 | 4.1161 | 4.1679 |
Framework versions
- PEFT 0.11.0
- Transformers 4.44.2
- Pytorch 1.13.0+cu116
- Datasets 3.0.0
- Tokenizers 0.19.1