jysssacc
/

bloomz-560m_adalora_lr5e-05_bs4_epoch20_wd0.01

PEFT

Safetensors

Generated from Trainer

Model card Files Files and versions Community

jysssacc commited on Jan 9

Commit

dd1036a

•

1 Parent(s): d6a4fdf

End of training

Browse files

Files changed (2) hide show

README.md +21 -21
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigscience/bloomz-560m](https://huggingface.co/bigscience/bloomz-560m) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.2942
 ## Model description
@@ -48,26 +48,26 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 5.0173        | 1.0   | 157  | 4.6371          |
-| 4.7211        | 2.0   | 314  | 4.3547          |
-| 4.481         | 3.0   | 471  | 3.7286          |
-| 3.673         | 4.0   | 628  | 3.4085          |
-| 3.5614        | 5.0   | 785  | 3.3548          |
-| 3.4806        | 6.0   | 942  | 3.3300          |
-| 3.4426        | 7.0   | 1099 | 3.3148          |
-| 3.366         | 8.0   | 1256 | 3.3067          |
-| 3.4129        | 9.0   | 1413 | 3.3017          |
-| 3.3326        | 10.0  | 1570 | 3.2965          |
-| 3.2807        | 11.0  | 1727 | 3.2934          |
-| 3.2478        | 12.0  | 1884 | 3.2932          |
-| 3.2432        | 13.0  | 2041 | 3.2923          |
-| 3.1969        | 14.0  | 2198 | 3.2919          |
-| 3.2311        | 15.0  | 2355 | 3.2936          |
-| 3.1626        | 16.0  | 2512 | 3.2937          |
-| 3.2034        | 17.0  | 2669 | 3.2937          |
-| 3.2143        | 18.0  | 2826 | 3.2941          |
-| 3.2509        | 19.0  | 2983 | 3.2944          |
-| 3.1608        | 20.0  | 3140 | 3.2942          |
 ### Framework versions

 This model is a fine-tuned version of [bigscience/bloomz-560m](https://huggingface.co/bigscience/bloomz-560m) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.2939
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 5.0192        | 1.0   | 157  | 4.6374          |
+| 4.7212        | 2.0   | 314  | 4.3532          |
+| 4.4794        | 3.0   | 471  | 3.7274          |
+| 3.6826        | 4.0   | 628  | 3.4142          |
+| 3.5713        | 5.0   | 785  | 3.3583          |
+| 3.4892        | 6.0   | 942  | 3.3334          |
+| 3.4485        | 7.0   | 1099 | 3.3175          |
+| 3.3712        | 8.0   | 1256 | 3.3086          |
+| 3.4196        | 9.0   | 1413 | 3.3030          |
+| 3.3404        | 10.0  | 1570 | 3.2977          |
+| 3.2865        | 11.0  | 1727 | 3.2944          |
+| 3.2568        | 12.0  | 1884 | 3.2936          |
+| 3.2502        | 13.0  | 2041 | 3.2924          |
+| 3.2052        | 14.0  | 2198 | 3.2918          |
+| 3.2404        | 15.0  | 2355 | 3.2934          |
+| 3.1706        | 16.0  | 2512 | 3.2934          |
+| 3.2111        | 17.0  | 2669 | 3.2934          |
+| 3.2227        | 18.0  | 2826 | 3.2937          |
+| 3.2601        | 19.0  | 2983 | 3.2941          |
+| 3.1703        | 20.0  | 3140 | 3.2939          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cdb18c4a68d1470c29cb7e5534f8bb21ccb01b2a3b5d51f61e50eb1914645f71
 size 4729800

 version https://git-lfs.github.com/spec/v1
+oid sha256:39f34efef29204468e83c88397092256909b004e6109acf50d353d06d5ebe6fa
 size 4729800