Orca_save / lora /README.md
pepoo20's picture
Upload lora/README.md with huggingface_hub
e30b402 verified
|
raw
history blame
1.68 kB
metadata
license: other
library_name: peft
tags:
  - llama-factory
  - lora
  - generated_from_trainer
base_model: microsoft/Orca-2-7b
model-index:
  - name: lora
    results: []

lora

This model is a fine-tuned version of microsoft/Orca-2-7b on the Pretrain_Basic dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4452

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 4
  • eval_batch_size: 2
  • seed: 42
  • gradient_accumulation_steps: 4
  • total_train_batch_size: 16
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: cosine
  • lr_scheduler_warmup_steps: 1000
  • num_epochs: 1.0

Training results

Training Loss Epoch Step Validation Loss
0.5009 0.1586 500 0.4964
0.4641 0.3172 1000 0.4591
0.4514 0.4758 1500 0.4516
0.4522 0.6344 2000 0.4482
0.4436 0.7930 2500 0.4459
0.4463 0.9516 3000 0.4452

Framework versions

  • PEFT 0.10.0
  • Transformers 4.40.2
  • Pytorch 2.0.1+cu118
  • Datasets 2.17.1
  • Tokenizers 0.19.1