End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -6,18 +6,23 @@ tags:
 - sft
 - generated_from_trainer
 model-index:
-- name: Cold-Data-LLama-2-7B
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Cold-Data-LLama-2-7B
 This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0526
 ## Model description
@@ -36,7 +41,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.002
 - train_batch_size: 16
 - eval_batch_size: 32
 - seed: 42
@@ -47,17 +52,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_ratio: 0.03
 - num_epochs: 10
-### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 0.1019        | 1.992 | 249  | 0.1022          |
-| 0.0542        | 3.984 | 498  | 0.0540          |
-| 0.0508        | 5.976 | 747  | 0.0513          |
-| 0.0479        | 7.968 | 996  | 0.0515          |
-| 0.0472        | 9.96  | 1245 | 0.0537          |
 ### Framework versions
 - PEFT 0.12.0

 - sft
 - generated_from_trainer
 model-index:
+- name: Cold-Again-LLama-2-7B
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Cold-Again-LLama-2-7B
 This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: 1.3661
+- eval_runtime: 90.0594
+- eval_samples_per_second: 1.11
+- eval_steps_per_second: 0.044
+- epoch: 5.76
+- step: 36
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 16
 - eval_batch_size: 32
 - seed: 42
 - lr_scheduler_warmup_ratio: 0.03
 - num_epochs: 10
 ### Framework versions
 - PEFT 0.12.0

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:38cd1ccd5681e8b6442697ddc6b05aa049d662f3fe973d76419114b54a30c5a7
 size 134235048

 version https://git-lfs.github.com/spec/v1
+oid sha256:eaae29d5c08c301696d3b97088a18cf91eeb0e129284ac2ffc2e336e18e0807c
 size 134235048

runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724193656.fastgpuserv.1412094.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0dd21beb2705c281008c444c908b3bbc99f8b40a7608ab8cc9afd0c2307affc3
+size 7406