reverse_sentence_conv
This model is a fine-tuned version of gpt2 on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 3.5315
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0001
- train_batch_size: 32
- eval_batch_size: 32
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: reduce_lr_on_plateau
- lr_scheduler_warmup_steps: 500
- num_epochs: 1
Training results
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
4.5866 | 0.0254 | 1000 | 4.4049 |
4.2274 | 0.0508 | 2000 | 4.1030 |
4.0215 | 0.0762 | 3000 | 3.9975 |
3.9753 | 0.1016 | 4000 | 3.8979 |
3.9045 | 0.1270 | 5000 | 3.8442 |
3.839 | 0.1524 | 6000 | 3.8197 |
3.8273 | 0.1778 | 7000 | 3.7559 |
3.7514 | 0.2032 | 8000 | 3.7565 |
3.7582 | 0.2286 | 9000 | 3.7251 |
3.7526 | 0.2540 | 10000 | 3.6833 |
3.6384 | 0.2794 | 11000 | 3.6979 |
3.6941 | 0.3048 | 12000 | 3.6516 |
3.6874 | 0.3302 | 13000 | 3.6499 |
3.6126 | 0.3556 | 14000 | 3.6601 |
3.6481 | 0.3810 | 15000 | 3.6137 |
3.6138 | 0.4064 | 16000 | 3.6151 |
3.5954 | 0.4318 | 17000 | 3.6257 |
3.6109 | 0.4572 | 18000 | 3.5878 |
3.5476 | 0.4826 | 19000 | 3.6129 |
3.5847 | 0.5080 | 20000 | 3.5751 |
3.6001 | 0.5334 | 21000 | 3.5630 |
3.4921 | 0.5588 | 22000 | 3.5947 |
3.5602 | 0.5842 | 23000 | 3.5612 |
3.5586 | 0.6096 | 24000 | 3.5619 |
3.5002 | 0.6350 | 25000 | 3.5781 |
3.5385 | 0.6604 | 26000 | 3.5442 |
3.5121 | 0.6858 | 27000 | 3.5469 |
3.5019 | 0.7112 | 28000 | 3.5632 |
3.5243 | 0.7366 | 29000 | 3.5321 |
3.456 | 0.7620 | 30000 | 3.5610 |
3.5086 | 0.7874 | 31000 | 3.5285 |
3.5329 | 0.8128 | 32000 | 3.5219 |
3.4271 | 0.8382 | 33000 | 3.5543 |
3.4957 | 0.8636 | 34000 | 3.5173 |
3.4912 | 0.8890 | 35000 | 3.5269 |
3.4407 | 0.9144 | 36000 | 3.5433 |
3.4831 | 0.9398 | 37000 | 3.5118 |
3.4514 | 0.9652 | 38000 | 3.5186 |
3.4561 | 0.9906 | 39000 | 3.5315 |
Framework versions
- Transformers 4.45.2
- Pytorch 2.4.1+cu121
- Datasets 3.0.1
- Tokenizers 0.20.1
- Downloads last month
- 25
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for fpadovani/reverse_sentence_conv
Base model
openai-community/gpt2