bilkultheek
commited on
Commit
•
1370ce5
1
Parent(s):
6e55c47
Training in progress, step 20
Browse files- README.md +9 -15
- adapter_config.json +0 -4
- adapter_model.safetensors +1 -1
- runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724180258.fastgpuserv.1412094.0 +3 -0
- runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724193656.fastgpuserv.1412094.1 +3 -0
- runs/Aug25_12-49-36_fastgpuserv/events.out.tfevents.1724572185.fastgpuserv.1980100.0 +3 -0
- runs/Aug25_12-51-29_fastgpuserv/events.out.tfevents.1724572295.fastgpuserv.1980100.1 +3 -0
- runs/Aug25_12-52-57_fastgpuserv/events.out.tfevents.1724572380.fastgpuserv.1980100.2 +3 -0
- runs/Aug25_12-54-00_fastgpuserv/events.out.tfevents.1724572443.fastgpuserv.1980100.3 +3 -0
- runs/Aug25_12-56-16_fastgpuserv/events.out.tfevents.1724572582.fastgpuserv.2002935.0 +3 -0
- runs/Aug25_12-58-05_fastgpuserv/events.out.tfevents.1724572692.fastgpuserv.2002935.1 +3 -0
- runs/Aug25_13-01-05_fastgpuserv/events.out.tfevents.1724572873.fastgpuserv.2014604.0 +3 -0
- runs/Aug25_13-01-05_fastgpuserv/events.out.tfevents.1724572937.fastgpuserv.2014604.1 +3 -0
- runs/Aug25_13-02-49_fastgpuserv/events.out.tfevents.1724572977.fastgpuserv.2014604.2 +3 -0
- runs/Aug25_13-08-55_fastgpuserv/events.out.tfevents.1724573339.fastgpuserv.2014604.3 +3 -0
- runs/Aug25_13-11-19_fastgpuserv/events.out.tfevents.1724573484.fastgpuserv.2039446.0 +3 -0
- runs/Aug25_13-13-03_fastgpuserv/events.out.tfevents.1724573587.fastgpuserv.2039446.1 +3 -0
- runs/Aug25_13-14-36_fastgpuserv/events.out.tfevents.1724573679.fastgpuserv.2039446.2 +3 -0
- runs/Aug25_13-16-49_fastgpuserv/events.out.tfevents.1724573812.fastgpuserv.2039446.3 +3 -0
- runs/Aug25_13-18-04_fastgpuserv/events.out.tfevents.1724573887.fastgpuserv.2039446.4 +3 -0
- runs/Aug25_13-19-17_fastgpuserv/events.out.tfevents.1724573959.fastgpuserv.2039446.5 +3 -0
- runs/Aug25_13-21-16_fastgpuserv/events.out.tfevents.1724574078.fastgpuserv.2039446.6 +3 -0
- runs/Aug25_13-22-31_fastgpuserv/events.out.tfevents.1724574155.fastgpuserv.2039446.7 +3 -0
- runs/Aug25_13-24-04_fastgpuserv/events.out.tfevents.1724574247.fastgpuserv.2039446.8 +3 -0
- runs/Aug25_13-25-56_fastgpuserv/events.out.tfevents.1724574359.fastgpuserv.2039446.9 +3 -0
- runs/Aug25_13-28-50_fastgpuserv/events.out.tfevents.1724574532.fastgpuserv.2039446.10 +3 -0
- runs/Aug25_13-29-54_fastgpuserv/events.out.tfevents.1724574596.fastgpuserv.2039446.11 +3 -0
- runs/Aug25_13-31-12_fastgpuserv/events.out.tfevents.1724574675.fastgpuserv.2039446.12 +3 -0
- runs/Aug25_13-33-47_fastgpuserv/events.out.tfevents.1724574833.fastgpuserv.2094483.0 +3 -0
- runs/Aug25_13-37-11_fastgpuserv/events.out.tfevents.1724575034.fastgpuserv.2094483.1 +3 -0
- runs/Aug25_13-39-10_fastgpuserv/events.out.tfevents.1724575153.fastgpuserv.2094483.2 +3 -0
- runs/Aug25_13-40-19_fastgpuserv/events.out.tfevents.1724575222.fastgpuserv.2094483.3 +3 -0
- runs/Aug25_13-45-34_fastgpuserv/events.out.tfevents.1724575537.fastgpuserv.2094483.4 +3 -0
- runs/Aug25_13-47-07_fastgpuserv/events.out.tfevents.1724575630.fastgpuserv.2094483.5 +3 -0
- runs/Aug26_10-52-41_fastgpuserv/events.out.tfevents.1724651569.fastgpuserv.681719.0 +3 -0
- training_args.bin +2 -2
README.md
CHANGED
@@ -6,18 +6,23 @@ tags:
|
|
6 |
- sft
|
7 |
- generated_from_trainer
|
8 |
model-index:
|
9 |
-
- name: Cold-
|
10 |
results: []
|
11 |
---
|
12 |
|
13 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
14 |
should probably proofread and complete it, then remove this comment. -->
|
15 |
|
16 |
-
# Cold-
|
17 |
|
18 |
This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
-
-
|
|
|
|
|
|
|
|
|
|
|
21 |
|
22 |
## Model description
|
23 |
|
@@ -36,7 +41,7 @@ More information needed
|
|
36 |
### Training hyperparameters
|
37 |
|
38 |
The following hyperparameters were used during training:
|
39 |
-
- learning_rate: 0.
|
40 |
- train_batch_size: 16
|
41 |
- eval_batch_size: 32
|
42 |
- seed: 42
|
@@ -47,17 +52,6 @@ The following hyperparameters were used during training:
|
|
47 |
- lr_scheduler_warmup_ratio: 0.03
|
48 |
- num_epochs: 10
|
49 |
|
50 |
-
### Training results
|
51 |
-
|
52 |
-
| Training Loss | Epoch | Step | Validation Loss |
|
53 |
-
|:-------------:|:-----:|:----:|:---------------:|
|
54 |
-
| 0.1019 | 1.992 | 249 | 0.1022 |
|
55 |
-
| 0.0542 | 3.984 | 498 | 0.0540 |
|
56 |
-
| 0.0508 | 5.976 | 747 | 0.0513 |
|
57 |
-
| 0.0479 | 7.968 | 996 | 0.0515 |
|
58 |
-
| 0.0472 | 9.96 | 1245 | 0.0537 |
|
59 |
-
|
60 |
-
|
61 |
### Framework versions
|
62 |
|
63 |
- PEFT 0.12.0
|
|
|
6 |
- sft
|
7 |
- generated_from_trainer
|
8 |
model-index:
|
9 |
+
- name: Cold-Again-LLama-2-7B
|
10 |
results: []
|
11 |
---
|
12 |
|
13 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
14 |
should probably proofread and complete it, then remove this comment. -->
|
15 |
|
16 |
+
# Cold-Again-LLama-2-7B
|
17 |
|
18 |
This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- eval_loss: 1.3661
|
21 |
+
- eval_runtime: 90.0594
|
22 |
+
- eval_samples_per_second: 1.11
|
23 |
+
- eval_steps_per_second: 0.044
|
24 |
+
- epoch: 5.76
|
25 |
+
- step: 36
|
26 |
|
27 |
## Model description
|
28 |
|
|
|
41 |
### Training hyperparameters
|
42 |
|
43 |
The following hyperparameters were used during training:
|
44 |
+
- learning_rate: 0.0001
|
45 |
- train_batch_size: 16
|
46 |
- eval_batch_size: 32
|
47 |
- seed: 42
|
|
|
52 |
- lr_scheduler_warmup_ratio: 0.03
|
53 |
- num_epochs: 10
|
54 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
55 |
### Framework versions
|
56 |
|
57 |
- PEFT 0.12.0
|
adapter_config.json
CHANGED
@@ -15,10 +15,6 @@
|
|
15 |
"megatron_config": null,
|
16 |
"megatron_core": "megatron.core",
|
17 |
"modules_to_save": [
|
18 |
-
"classifier",
|
19 |
-
"score",
|
20 |
-
"classifier",
|
21 |
-
"score",
|
22 |
"classifier",
|
23 |
"score"
|
24 |
],
|
|
|
15 |
"megatron_config": null,
|
16 |
"megatron_core": "megatron.core",
|
17 |
"modules_to_save": [
|
|
|
|
|
|
|
|
|
18 |
"classifier",
|
19 |
"score"
|
20 |
],
|
adapter_model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 134267920
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fd50fb82d204807b97190a86a0bdbeeda83e66d8cdb296e8122ccbecb3f803e8
|
3 |
size 134267920
|
runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724180258.fastgpuserv.1412094.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cfea3dd99266f8e2d927b3263ad7f9dc2333b546e001b052feec3f12f408c84a
|
3 |
+
size 8700
|
runs/Aug20_23-57-27_fastgpuserv/events.out.tfevents.1724193656.fastgpuserv.1412094.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0dd21beb2705c281008c444c908b3bbc99f8b40a7608ab8cc9afd0c2307affc3
|
3 |
+
size 7406
|
runs/Aug25_12-49-36_fastgpuserv/events.out.tfevents.1724572185.fastgpuserv.1980100.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:94bac5057816ac7b9bd02bccaff1c89290da5dabbdcfe0062596415e54e5a5db
|
3 |
+
size 4438
|
runs/Aug25_12-51-29_fastgpuserv/events.out.tfevents.1724572295.fastgpuserv.1980100.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e94d3717138def6bc618ef8e5c929b48ecefd83aa5b2c8ff342374948ae39f21
|
3 |
+
size 5598
|
runs/Aug25_12-52-57_fastgpuserv/events.out.tfevents.1724572380.fastgpuserv.1980100.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6b97f604468f1ef7f28857b5d74e6bdf9d077c084d5a6e9fb19ae42999dd2b93
|
3 |
+
size 5598
|
runs/Aug25_12-54-00_fastgpuserv/events.out.tfevents.1724572443.fastgpuserv.1980100.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c0a91abd9d44e497de4b388cfcd3eeb161700f5a90dbbf797992e4060d85fd0d
|
3 |
+
size 4184
|
runs/Aug25_12-56-16_fastgpuserv/events.out.tfevents.1724572582.fastgpuserv.2002935.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c262987c6874977559b16bc9f9cf632c92511fbfe2ab81569ac9fa6aeba34209
|
3 |
+
size 5598
|
runs/Aug25_12-58-05_fastgpuserv/events.out.tfevents.1724572692.fastgpuserv.2002935.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:430307b7d4b3dfc7f07960a800f4f6c8c3b49704d19a14f0096c6793f57039fa
|
3 |
+
size 5598
|
runs/Aug25_13-01-05_fastgpuserv/events.out.tfevents.1724572873.fastgpuserv.2014604.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:41393a92903e62df14bf495a4039eccc18ea5204ffaedd62b705fddd9e69a0cf
|
3 |
+
size 5598
|
runs/Aug25_13-01-05_fastgpuserv/events.out.tfevents.1724572937.fastgpuserv.2014604.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1b46e86d24bca382788d36b178fdc98484f5f5f25f0b80682f3a67b5774787d4
|
3 |
+
size 5598
|
runs/Aug25_13-02-49_fastgpuserv/events.out.tfevents.1724572977.fastgpuserv.2014604.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a2296f480f72760cadb8ccec85b048b59941e2b707bd6c0a8d65e52b12441a75
|
3 |
+
size 5596
|
runs/Aug25_13-08-55_fastgpuserv/events.out.tfevents.1724573339.fastgpuserv.2014604.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1cacbe5d9ef832b1f20887456d37fadc641d787b68741c1c309061df6ad9054f
|
3 |
+
size 4184
|
runs/Aug25_13-11-19_fastgpuserv/events.out.tfevents.1724573484.fastgpuserv.2039446.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:339e20ee57f60c47413524e27c5cd52de457b6f8de8fb8edc3ad397528d5f3a2
|
3 |
+
size 6076
|
runs/Aug25_13-13-03_fastgpuserv/events.out.tfevents.1724573587.fastgpuserv.2039446.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:64b089d6bda5d19a795ebf1e10cdf2b6c9a42af74d8a209fa080f251c30e2e11
|
3 |
+
size 6076
|
runs/Aug25_13-14-36_fastgpuserv/events.out.tfevents.1724573679.fastgpuserv.2039446.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e205c713fd6766b63147f4c86f273f6e443c1cae50def3f1d6978665d62a3821
|
3 |
+
size 6076
|
runs/Aug25_13-16-49_fastgpuserv/events.out.tfevents.1724573812.fastgpuserv.2039446.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:defe742e00d150c4f7cd16f19c449d8551e0850f8f033e36814c13839328d9e1
|
3 |
+
size 6076
|
runs/Aug25_13-18-04_fastgpuserv/events.out.tfevents.1724573887.fastgpuserv.2039446.4
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6da021f154c2c8db83797b311ec472783efae80f41409ac8e5c04836c4c352b4
|
3 |
+
size 6076
|
runs/Aug25_13-19-17_fastgpuserv/events.out.tfevents.1724573959.fastgpuserv.2039446.5
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:787ebf30bff7b63bddb73ee484ab8a584ced63199a259e3efea9960c71c7813c
|
3 |
+
size 6076
|
runs/Aug25_13-21-16_fastgpuserv/events.out.tfevents.1724574078.fastgpuserv.2039446.6
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:71f035a87b1d3f3f0e91533cdc4b67555f0f3e2e1e8d969b192fa50d7ff3cdf4
|
3 |
+
size 6076
|
runs/Aug25_13-22-31_fastgpuserv/events.out.tfevents.1724574155.fastgpuserv.2039446.7
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:19f27422f32a5b79f46db60c6cd1053643e2908e01c11939a858d9bfd2c29f37
|
3 |
+
size 6076
|
runs/Aug25_13-24-04_fastgpuserv/events.out.tfevents.1724574247.fastgpuserv.2039446.8
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0c02bfebe7df0f3c9c72a1cca27188e32cfbebbfe151483aec81ee725a09024d
|
3 |
+
size 6076
|
runs/Aug25_13-25-56_fastgpuserv/events.out.tfevents.1724574359.fastgpuserv.2039446.9
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:07dcb0dd548a5dbf42d7374ddee73f563c67cef4bfebcebb968c4de62ba37666
|
3 |
+
size 6077
|
runs/Aug25_13-28-50_fastgpuserv/events.out.tfevents.1724574532.fastgpuserv.2039446.10
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:37bf42ac45b1be45a9462485e3ac1f766c670721d4149072b2ee580063dbf0ca
|
3 |
+
size 6077
|
runs/Aug25_13-29-54_fastgpuserv/events.out.tfevents.1724574596.fastgpuserv.2039446.11
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:baaa5b103f99bf013c76c4c2314bad677518e5133ec10a662a84a9a258e535eb
|
3 |
+
size 6077
|
runs/Aug25_13-31-12_fastgpuserv/events.out.tfevents.1724574675.fastgpuserv.2039446.12
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4084bb131681fdb9652839f5a026b30e1810b0c54e3a6f24f02fed43c7244107
|
3 |
+
size 4184
|
runs/Aug25_13-33-47_fastgpuserv/events.out.tfevents.1724574833.fastgpuserv.2094483.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7b37216c58bec635d1dfb11687cd46222b42efd07b4c3ff2204a950a3fc05c96
|
3 |
+
size 6077
|
runs/Aug25_13-37-11_fastgpuserv/events.out.tfevents.1724575034.fastgpuserv.2094483.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:23cecba266d80cb784f471afcfa3c8022a76a8e0f206d63fec76efcfe2f4777c
|
3 |
+
size 6077
|
runs/Aug25_13-39-10_fastgpuserv/events.out.tfevents.1724575153.fastgpuserv.2094483.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b9810a57917dc5deacee5f3a38e7dc7a26e0c5efae0b5ad82e00e6954d7fe990
|
3 |
+
size 6077
|
runs/Aug25_13-40-19_fastgpuserv/events.out.tfevents.1724575222.fastgpuserv.2094483.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:65a4f4b45c2ff2131a135c230eaf1a0e67f873fa5b1c06ee1eb73e38f03323d1
|
3 |
+
size 6077
|
runs/Aug25_13-45-34_fastgpuserv/events.out.tfevents.1724575537.fastgpuserv.2094483.4
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cad21eb3a2a322b320b97ec5258fd775edd1542772f0d62d6e4c4cc52d10fef5
|
3 |
+
size 6077
|
runs/Aug25_13-47-07_fastgpuserv/events.out.tfevents.1724575630.fastgpuserv.2094483.5
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d6d613233e81b43d3dfd5bfcd2e1487f2e7c7ee0fc35873bf13644d53a1aafba
|
3 |
+
size 4436
|
runs/Aug26_10-52-41_fastgpuserv/events.out.tfevents.1724651569.fastgpuserv.681719.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0d6bb75db714a8b97962615252d95be365b7b9a6b67e4e011f09a0bdecf02670
|
3 |
+
size 6501
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:aa87077c98cf82a25c6e6ca0ea31291d12cf2d89cf8409479fa0f6dfcde184fd
|
3 |
+
size 5432
|