mob2711 commited on
Commit
968d9ea
1 Parent(s): d67a01a

End of training

Browse files
README.md CHANGED
@@ -2,9 +2,9 @@
2
  license: apache-2.0
3
  library_name: peft
4
  tags:
5
- - unsloth
6
  - trl
7
  - sft
 
8
  - generated_from_trainer
9
  base_model: unsloth/llama-2-7b-bnb-4bit
10
  model-index:
@@ -17,7 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  # llama3-chat_10000_500
19
 
20
- This model is a fine-tuned version of [unsloth/llama-2-7b-bnb-4bit](https://huggingface.co/unsloth/llama-2-7b-bnb-4bit) on an unknown dataset.
 
 
21
 
22
  ## Model description
23
 
@@ -48,6 +50,27 @@ The following hyperparameters were used during training:
48
  - num_epochs: 5
49
  - mixed_precision_training: Native AMP
50
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
51
  ### Framework versions
52
 
53
  - PEFT 0.10.0
 
2
  license: apache-2.0
3
  library_name: peft
4
  tags:
 
5
  - trl
6
  - sft
7
+ - unsloth
8
  - generated_from_trainer
9
  base_model: unsloth/llama-2-7b-bnb-4bit
10
  model-index:
 
17
 
18
  # llama3-chat_10000_500
19
 
20
+ This model is a fine-tuned version of [unsloth/llama-2-7b-bnb-4bit](https://huggingface.co/unsloth/llama-2-7b-bnb-4bit) on the None dataset.
21
+ It achieves the following results on the evaluation set:
22
+ - Loss: 1.1126
23
 
24
  ## Model description
25
 
 
50
  - num_epochs: 5
51
  - mixed_precision_training: Native AMP
52
 
53
+ ### Training results
54
+
55
+ | Training Loss | Epoch | Step | Validation Loss |
56
+ |:-------------:|:-----:|:----:|:---------------:|
57
+ | 1.1238 | 0.33 | 104 | 0.9666 |
58
+ | 1.0103 | 0.67 | 208 | 0.9480 |
59
+ | 1.0056 | 1.0 | 312 | 0.9424 |
60
+ | 0.921 | 1.33 | 416 | 0.9508 |
61
+ | 0.9252 | 1.66 | 520 | 0.9476 |
62
+ | 0.9219 | 2.0 | 624 | 0.9415 |
63
+ | 0.7968 | 2.33 | 728 | 0.9808 |
64
+ | 0.8012 | 2.66 | 832 | 0.9787 |
65
+ | 0.7975 | 3.0 | 936 | 0.9819 |
66
+ | 0.674 | 3.33 | 1040 | 1.0476 |
67
+ | 0.6638 | 3.66 | 1144 | 1.0509 |
68
+ | 0.6687 | 3.99 | 1248 | 1.0456 |
69
+ | 0.5858 | 4.33 | 1352 | 1.1100 |
70
+ | 0.5783 | 4.66 | 1456 | 1.1124 |
71
+ | 0.581 | 4.99 | 1560 | 1.1126 |
72
+
73
+
74
  ### Framework versions
75
 
76
  - PEFT 0.10.0
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:20b0984a5d532e1bbcc920a85d3acfe41fd54b32bbc5a2af0354a46161daa3ea
3
  size 159967880
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:976a8e29779fe70a19e4a09a311c92f804a81b785d03c40ccb811e468dce4476
3
  size 159967880
runs/Apr26_17-37-53_92f4037b28a3/events.out.tfevents.1714153076.92f4037b28a3.25.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:97e85519f6edb9978242cd8d79f6e0738233b8898571b67eaa6b1cbd5401cd2f
3
- size 12322
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3bccfd313b6a2b9752371a36534424fd3b08ce398ab402e020fa5e95f224eb9c
3
+ size 12676