SleepyGorilla
commited on
SleepyGorilla/Mistral_7B
Browse files
README.md
CHANGED
@@ -18,15 +18,15 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
|
19 |
This model is a fine-tuned version of [TheBloke/OpenHermes-2-Mistral-7B-GPTQ](https://huggingface.co/TheBloke/OpenHermes-2-Mistral-7B-GPTQ) on the None dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
-
- Loss: 0.
|
22 |
-
- Rewards/chosen: -
|
23 |
-
- Rewards/rejected: -
|
24 |
- Rewards/accuracies: 1.0
|
25 |
-
- Rewards/margins:
|
26 |
-
- Logps/rejected: -
|
27 |
-
- Logps/chosen: -
|
28 |
-
- Logits/rejected: -2.
|
29 |
-
- Logits/chosen: -2.
|
30 |
|
31 |
## Model description
|
32 |
|
@@ -59,11 +59,11 @@ The following hyperparameters were used during training:
|
|
59 |
|
60 |
| Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
|
61 |
|:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
|
62 |
-
| 0.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.
|
67 |
|
68 |
|
69 |
### Framework versions
|
|
|
18 |
|
19 |
This model is a fine-tuned version of [TheBloke/OpenHermes-2-Mistral-7B-GPTQ](https://huggingface.co/TheBloke/OpenHermes-2-Mistral-7B-GPTQ) on the None dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
+
- Loss: 0.0132
|
22 |
+
- Rewards/chosen: -1.4792
|
23 |
+
- Rewards/rejected: -8.5855
|
24 |
- Rewards/accuracies: 1.0
|
25 |
+
- Rewards/margins: 7.1064
|
26 |
+
- Logps/rejected: -319.5252
|
27 |
+
- Logps/chosen: -138.4254
|
28 |
+
- Logits/rejected: -2.3872
|
29 |
+
- Logits/chosen: -2.5369
|
30 |
|
31 |
## Model description
|
32 |
|
|
|
59 |
|
60 |
| Training Loss | Epoch | Step | Validation Loss | Rewards/chosen | Rewards/rejected | Rewards/accuracies | Rewards/margins | Logps/rejected | Logps/chosen | Logits/rejected | Logits/chosen |
|
61 |
|:-------------:|:-----:|:----:|:---------------:|:--------------:|:----------------:|:------------------:|:---------------:|:--------------:|:------------:|:---------------:|:-------------:|
|
62 |
+
| 0.5575 | 0.0 | 10 | 0.4017 | 0.0150 | -0.6143 | 1.0 | 0.6293 | -239.8125 | -123.4837 | -2.4102 | -2.6084 |
|
63 |
+
| 0.3781 | 0.0 | 20 | 0.1298 | -0.2390 | -2.2414 | 1.0 | 2.0025 | -256.0842 | -126.0231 | -2.3786 | -2.6120 |
|
64 |
+
| 0.219 | 0.0 | 30 | 0.0410 | -0.5640 | -4.3638 | 1.0 | 3.7998 | -277.3080 | -129.2739 | -2.3879 | -2.5872 |
|
65 |
+
| 0.038 | 0.0 | 40 | 0.0168 | -1.2083 | -7.3369 | 1.0 | 6.1286 | -307.0389 | -135.7168 | -2.3962 | -2.5566 |
|
66 |
+
| 0.0669 | 0.0 | 50 | 0.0132 | -1.4792 | -8.5855 | 1.0 | 7.1064 | -319.5252 | -138.4254 | -2.3872 | -2.5369 |
|
67 |
|
68 |
|
69 |
### Framework versions
|
adapter_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 13648432
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:58a9849f918c2040073e354ef8b6860deae289185db34920861266a53e2e876e
|
3 |
size 13648432
|
runs/Mar18_11-00-46_0d00f13c1b40/events.out.tfevents.1710759826.0d00f13c1b40.247.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:500e94250634a61337d19cb66fdd2913cbcfdb21a2b0c39133dd60061c3f8ca4
|
3 |
+
size 13334
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4475
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4087bb6f460ec3cc9afe55fb6b825229b72adfd18a1c406f7ebba757842a8f84
|
3 |
size 4475
|