End of training
Browse files- README.md +6 -6
- adapter_model.bin +1 -1
README.md
CHANGED
@@ -104,7 +104,7 @@ xformers_attention: null
|
|
104 |
|
105 |
This model is a fine-tuned version of [princeton-nlp/Sheared-LLaMA-1.3B](https://huggingface.co/princeton-nlp/Sheared-LLaMA-1.3B) on the None dataset.
|
106 |
It achieves the following results on the evaluation set:
|
107 |
-
- Loss: 1.
|
108 |
|
109 |
## Model description
|
110 |
|
@@ -139,11 +139,11 @@ The following hyperparameters were used during training:
|
|
139 |
| Training Loss | Epoch | Step | Validation Loss |
|
140 |
|:-------------:|:------:|:----:|:---------------:|
|
141 |
| No log | 0.0000 | 1 | 5.2423 |
|
142 |
-
| 4.
|
143 |
-
| 3.
|
144 |
-
| 1.
|
145 |
-
| 1.
|
146 |
-
| 1.
|
147 |
|
148 |
|
149 |
### Framework versions
|
|
|
104 |
|
105 |
This model is a fine-tuned version of [princeton-nlp/Sheared-LLaMA-1.3B](https://huggingface.co/princeton-nlp/Sheared-LLaMA-1.3B) on the None dataset.
|
106 |
It achieves the following results on the evaluation set:
|
107 |
+
- Loss: 1.0933
|
108 |
|
109 |
## Model description
|
110 |
|
|
|
139 |
| Training Loss | Epoch | Step | Validation Loss |
|
140 |
|:-------------:|:------:|:----:|:---------------:|
|
141 |
| No log | 0.0000 | 1 | 5.2423 |
|
142 |
+
| 4.8593 | 0.0001 | 10 | 4.5952 |
|
143 |
+
| 3.0778 | 0.0002 | 20 | 2.4800 |
|
144 |
+
| 1.6502 | 0.0003 | 30 | 1.5654 |
|
145 |
+
| 1.197 | 0.0004 | 40 | 1.1690 |
|
146 |
+
| 1.213 | 0.0004 | 50 | 1.0933 |
|
147 |
|
148 |
|
149 |
### Framework versions
|
adapter_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 30103498
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2f9d7419235f572930207c2ac3039526ab97041a69871500eead7f79e69d89cf
|
3 |
size 30103498
|