org_modelorg_model / README.md
Stern5497's picture
mistral-org_aug_a
5387d00 verified
|
raw
history blame
2.84 kB
metadata
license: apache-2.0
library_name: peft
tags:
  - generated_from_trainer
base_model: mistralai/Mistral-7B-v0.1
model-index:
  - name: org_modelorg_model
    results: []

org_modelorg_model

This model is a fine-tuned version of mistralai/Mistral-7B-v0.1 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.0772
  • F1 Micro: 0.5885
  • F1 Macro: 0.5884
  • F1 Weighted: 0.5884

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 32
  • eval_batch_size: 32
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • training_steps: 400

Training results

Training Loss Epoch Step Validation Loss F1 Micro F1 Macro F1 Weighted
1.6909 0.0064 25 1.5440 0.5095 0.5089 0.5089
1.4631 0.0127 50 1.4130 0.5483 0.5458 0.5458
1.3764 0.0191 75 1.2988 0.5543 0.5543 0.5543
1.2402 0.0255 100 1.2464 0.5623 0.5621 0.5621
1.1982 0.0318 125 1.2415 0.5580 0.5544 0.5544
1.1759 0.0382 150 1.1822 0.5732 0.5728 0.5728
1.0769 0.0446 175 1.1590 0.5788 0.5787 0.5787
1.0388 0.0510 200 1.1416 0.5821 0.5820 0.5820
1.1786 0.0573 225 1.1273 0.5815 0.5806 0.5806
1.2269 0.0637 250 1.1233 0.5823 0.5798 0.5798
1.1746 0.0701 275 1.1105 0.5833 0.5819 0.5819
1.1455 0.0764 300 1.0927 0.5868 0.5864 0.5864
1.0494 0.0828 325 1.0905 0.5873 0.5867 0.5867
1.0199 0.0892 350 1.0853 0.5845 0.5836 0.5836
1.1086 0.0955 375 1.0789 0.5852 0.5849 0.5849
1.0726 0.1019 400 1.0772 0.5885 0.5884 0.5884

Framework versions

  • PEFT 0.10.0
  • Transformers 4.40.2
  • Pytorch 2.3.0+cu118
  • Datasets 2.19.0
  • Tokenizers 0.19.1