viv6267 commited on
Commit
27a33cc
·
verified ·
1 Parent(s): 2fb9cf6

End of training

Browse files
Files changed (2) hide show
  1. README.md +75 -3
  2. adapter_model.safetensors +1 -1
README.md CHANGED
@@ -1,3 +1,75 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: peft
3
+ base_model: NousResearch/Llama-2-7b-hf
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - accuracy
8
+ - precision
9
+ - recall
10
+ - f1
11
+ model-index:
12
+ - name: evaluation_model
13
+ results: []
14
+ ---
15
+
16
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
+ should probably proofread and complete it, then remove this comment. -->
18
+
19
+ # evaluation_model
20
+
21
+ This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
22
+ It achieves the following results on the evaluation set:
23
+ - Loss: 0.7124
24
+ - Accuracy: 0.4667
25
+ - Precision: 0.4577
26
+ - Recall: 0.9559
27
+ - F1: 0.6190
28
+
29
+ ## Model description
30
+
31
+ More information needed
32
+
33
+ ## Intended uses & limitations
34
+
35
+ More information needed
36
+
37
+ ## Training and evaluation data
38
+
39
+ More information needed
40
+
41
+ ## Training procedure
42
+
43
+ ### Training hyperparameters
44
+
45
+ The following hyperparameters were used during training:
46
+ - learning_rate: 5e-05
47
+ - train_batch_size: 2
48
+ - eval_batch_size: 2
49
+ - seed: 42
50
+ - gradient_accumulation_steps: 8
51
+ - total_train_batch_size: 16
52
+ - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
53
+ - lr_scheduler_type: linear
54
+ - lr_scheduler_warmup_steps: 500
55
+ - num_epochs: 5
56
+ - mixed_precision_training: Native AMP
57
+
58
+ ### Training results
59
+
60
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 |
61
+ |:-------------:|:------:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|
62
+ | No log | 0.9829 | 43 | 0.9195 | 0.5467 | 0.0 | 0.0 | 0.0 |
63
+ | No log | 1.9943 | 87 | 0.6833 | 0.5667 | 0.5172 | 0.6618 | 0.5806 |
64
+ | No log | 2.9829 | 130 | 0.6898 | 0.5267 | 0.4884 | 0.9265 | 0.6396 |
65
+ | 0.8708 | 3.9943 | 174 | 0.6775 | 0.5667 | 0.5149 | 0.7647 | 0.6154 |
66
+ | 0.8708 | 4.9371 | 215 | 0.7124 | 0.4667 | 0.4577 | 0.9559 | 0.6190 |
67
+
68
+
69
+ ### Framework versions
70
+
71
+ - PEFT 0.13.2
72
+ - Transformers 4.46.2
73
+ - Pytorch 2.5.1+cu121
74
+ - Datasets 3.1.0
75
+ - Tokenizers 0.20.3
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:aeadc2e60e04165e77e52beefadbdbb2eb102e42e60bfda4f95800df57cc96c3
3
  size 16827064
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f12771825308a90a943e704144143340484043b79eddd275a0f35859c33de809
3
  size 16827064